Resources for Text, Speech and Language Processing

Books and publishing houses

  1. Books
  2. Publishing houses
Back to Resources


Information Retrieval

  1. William R. Hersh, "Information Retrieval: A Health and Biomedical Perspective (2nd edition)", Springer Verlag, 2003, ISBN 0-387-95522-4
  2. W. Bruce Croft and John Lafferty (editors), "Language Modeling for Information Retrieval", Kluwer Academic Publishers, 2003, ISBN 1-4020-1216-0, The Kluwer International Series on Information Retrieval, Volume 13
  3. Soumen Chakrabarti, "Mining the Web: Discovering Knowledge from Hypertext Data", Morgan-Kaufmann Publishers, 2002, ISBN 1558607544
  4. Thorsten Joachims, "Learning to Classify Text using Support Vector Machines", Kluwer Academic Publishers, 2002
  5. Ian H. Witten, Alistair Moffat, and Timothy C. Bell, "Managing Gigabytes", Morgan Kaufmann, 1999
  6. George Chang et al. (editors), "Mining the World Wide Web", Kluwer Academic Publishers, 2001
  7. W. Bruce Croft (editor), "Advances in Information Retrieval", Kluwer Academic Publishers, 2000
  8. Gerald J. Kowalski and Mark T. Maybury, "Information Storage and Retrieval Systems Theory and Implementation", Kluwer Academic Publishers, 2000
  9. Remco C. Veltkamp, Hans Burkhardt, and Hans-Peter Kriegel (editors), "State-of-the-Art in Content-Based Image and Video Retrieval", Kluwer Academic Publishers, 2001
  10. Ludovic Lebart, Andre Salem, and Lisette Berry, "Exploring Textual Data", Kluwer Academic Publishers, 1997. Review available in Computational Linguistics, 25(1), 1999.
  11. Sandor Dominich, "Mathematical Foundations of Information Retrieval", Kluwer Academic Publishers, 2001
  12. David A. Grossman and Ophir Frieder, "Information Retrieval: Algorithms and Heuristics", Kluwer Academic Publishers, 1998
  13. Michael W. Berry and Murray Browne (editor), "Understanding Search Engines: Mathematical Modeling and Text Retrieval", SIAM Press, 1999
  14. Dan Gusfield, "Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology", Cambridge University Press, 1997.
  15. Peter W. Foltz (editor), "Quantitative Approaches to Semantic Knowledge Representations: A Special Issue of Discourse Processes", Lawrende Erlbaum Associates, 1998
  16. Ricardo Baeza-Yates and Berthier Ribeiro-Neto, "Modern Information Retrieval", Addison Wesley / ACM Press Series, 1999
  17. Christ-Jan Doedens, "Text Databases: One Database Model and Several Retrieval Languages", Rodopi Bv Editions
  18. Robert Spencer, "Information Visualization", Addison Wesley / ACM Press, 2001
  19. Robert R. Korfhage, "Information Storage and Retrieval", John Wiley and Sons, 1997
  20. William Frakes and Ricardo Baeza-Yates, "Information Retrieval: Data Structures and Algorithms", Prentice Hall, 1992
  21. Tomek Strzalkowski (editor), "Natural Language Information Retrieval", Kluwer Academic Publishers, 1999
Back to top

Artificial Intelligence and Machine Learning

  1. Stuart J. Russell and Peter Norvig, "Artificial Intelligence: A Modern Approach", Prentice Hall, 1994
  2. Tom M. Mitchell, "Machine Learning", McGraw-Hill, 1997
  3. L. Breiman, J.H. Friedman, R.A. Olsen and C.J. Stone, "Classification and regression trees", Wadsworth, Belmont, CA, 1984
  4. Huan Liu and Hiroshi Motoda (editors), "Feature Extraction, Construction and Selection: A Data Mining Perspective", Kluwer Academic Publishers, 1998, ISBN 0-7923-8196-3, The Kluwer International Series in Engineering and Computer Science, Volume 453
  5. Huan Liu and Hiroshi Motoda (editors), "Feature Selection for Knowledge Discovery and Data Mining", Kluwer Academic Publishers, 1998, ISBN 0-7923-8198-X, The Kluwer International Series in Engineering and Computer Science, Volume 454
  6. D. Paul Benjamin (editor), "Change of Representation and Inductive Bias", Kluwer Academic Publishers, 1990, ISBN 0792390555
Back to top

Statistical Processing of Data

  1. David J. Hand, Heikki Mannila and Padhraic Smyth, "Principles of Data Mining", MIT Press, 2000
  2. Richard O. Duda, Peter E. Hart, and David G. Stork, "Pattern Classification (2nd edition)", John Wiley and Sons, 2000, ISBN 0471056693
  3. Michael Berthold and David J. Hand (editors), "Intelligent Data Analysis: An Introduction", Springer Verlag, 1999
  4. W.J. Krzanowski, "Principles of Multivariate Analysis (2nd edition)", Oxford University Press, 2000, ISBN 0198507089
  5. Thomas M. Cover and Joy A. Thomas, "Elements of Information Theory", John Wiley and Sons, 1991, ISBN 0471062596
  6. Douglas C. Montgomery, "Design and Analysis of Experiments (5th edition)", John Wiley and Sons, 2001
  7. Paul R. Cohen, "Empirical Methods for Artificial Intelligence", MIT Press, 1995
Back to top

Natural Language Processing

  1. Peter Jackson and Isabelle Moulinier, "Natural Language Processing for Online Applications", John Benjamins, 2002, Natural Language Processing Series
  2. Daniel Jurafsky and James H. Martin, "Speech and Language Processing", Prentice Hall, 2000
  3. Christopher D. Manning and Hinrich Schuetze, "Foundations of Statistical Natural Language Processing", MIT Press, 1999
  4. James Allen, "Natural Language Understanding (2nd edition)", Addison Wesley, 1995
  5. Christiane Fellbaum (editor), "Wordnet: An Electronic Lexical Database", MIT Press, 1998
  6. Susan Armstrong et al. (editors), "Natural Language Processing Using Very Large Corpora", Kluwer Academic Publishers, 1999
  7. Tony McEnery and Andrew Wilson, "Corpus Linguistics", Edinburgh University Press, 1996.
  8. Graeme D. Kennedy, "An Introduction to Corpus Linguistics", 1999. Review available in Computational Linguistics, 25(2):299-301, June 1999.
  9. Eugene Charniak, "Statistical Language Learning", MIT Press, 1996. Review available in Computational Linguistics, 21(1):103-111, 1995.
  10. Robert Dale, Hermann Moisl, and Harold Somers, "Handbook of Natural Language Processing", Marcel Dekker, 2000
  11. R. Harald Baayen, "Word Frequency Distributions", Kluwer Academic Publishers, 2001
  12. Fernando C.N. Pereira and Stuart M. Shieber, "PROLOG and Natural Language Analysis", CSLI Publications, 1987
  13. Michael A. Covington, "Natural Language Processing for Prolog Programmers", Prentice Hall, 1994
  14. James Pustejovsky, "The Generative Lexicon", Bradford Books, 1998
  15. Paul P. Wang, "Computing with Words", John Wiley and Sons, 2001
  16. D. A. Cruse, "Lexical Semantics", Cambridge University Press, 2001
  17. Ashwin Ram and Kenneth Moorman (editors), "Understanding Language Understanding", MIT Press, 1999
  18. Eneko Agirre and Philip Edmonds (editors), "Word Sense Disambiguation: Algorithms and Applications", Springer, 2006
Back to top

Text Summarization

  1. Inderjeet Mani, "Automatic Summarization", John Benjamins, 2001
  2. Inderjeet Mani and Mark T. Maybury (editors), "Advances in Automatic Text Summarization", MIT Press, 1999
Back to top

Question Answering

  1. Tomek Strzalkowski and Sanda Harabagiu, "Advances in Open-Domain Question Answering", Kluwer Academic Publishers, 2002
Back to top

Logic and Linguistics

  1. Andrew Radford, "Linguistics: An Introduction", Cambridge University Press, 1999
  2. Carl Pollard and Ivan A. Sag, "Head-Driven Phrase Structure Grammar", University of Chicago Press, 1994
  3. Mary Dalrymple et al. (editors), "Formal Issues in Lexical-Functional Grammar", CSLI Publications, 2000
  4. J.F.A.K. Van Benthem and Alice Ter Meulen, "Handbook of Logic and Language", MIT Press, 1997
  5. L.T.F. Gamut, "Logic, Language, and Meaning: Introduction to Logic (2 volumes)", University of Chicago Press, 1991
  6. Stuart M. Shieber, "An Introduction to Unification-Based Approaches to Grammar", CSLI Publications
  7. Henriette de Swart, "Introduction to Natural Language Semantics", CSLI Publications
  8. Rebecca Green, Carol A. Bean and Sung Hyon Myaeng (editors), "The Semantics of Relationships: An Interdisciplinary Perspective", Kluwer Academic Publishers, 2002, ISBN 1-4020-0568-7, Information Science and Knowledge Management, Volume 3
Back to top

Speech Processing

  1. Frederick Jelinek, "Statistical Methods for Speech Recognition", MIT Press, 1999
  2. Lawrence Rabiner and Biing-Hwang Juang, "Fundamentals of Speech Recognition", Prentice Hall, 1993
Back to top

Software Engineering

Well, NLP and AI are fascinating stuff and everything, but at the end it all comes to implementation of actually working systems ... :)
  1. Brian Kernighan and Dennis Ritchie, "The C Programming Language (2nd edition)", Prentice Hall
  2. Andrew Koenig, "C Traps and Pitfalls", Addison Wesley, 1998
  3. Stanley Lippman, "The C++ Primer (3rd edition)", Addison Wesley
  4. Stanley Lippman, "Inside the C++ Object Model", Addison Wesley
  5. Scott Meyers, "Effective C++: 50 Specific Ways to Improve Your Programs and Designs (2nd edition)", Addison Wesley, 1998
  6. Scott Meyers, "More Effective C++: 35 New Ways to Improve Your Programs and Designs", Addison Wesley, 1996
  7. Scott Meyers, "Effective STL: 50 Specific Ways to Improve Your Use of the Standard Template Library", Addison Wesley, 2001
  8. Nicolai M. Josuttis, "The C++ Standard Library: A Tutorial and Reference", Addison Wesley, 1999
  9. Bjarne Stroustrup, "The C++ Programming Language (3rd edition)", Addison Wesley
  10. Erich Gamma, Richard Helm, Ralph Johnson, and John Vlissides, "Design Patterns: Elements of Reusable Object-Oriented Software", Addison Wesley
  11. William J. Brown, Raphael C. Malveau, Hays W. McCormick III, and Thomas J. Mowbray, "AntiPatterns", John Wiley and Sons
  12. John Vlissides, "Pattern Hatching: Design Patterns Applied", Addison Wesley
  13. Bertrand Meyer, "Object-Oriented Software Construction (2nd edition)", Prentice Hall
  14. Frederick Brooks, "The Mythical Man-Month (2nd edition)", Addison Wesley
Back to top

Computer Science

  1. Thomas H. Cormen, Charles E. Leiserson, and Ronald L. Rivest, "Introduction to Algorithms (2nd edition)", MIT Press, 2001
  2. John E. Hopcroft, Rajeev Motwani, and Jeffrey D. Ullman, "Introduction to Automata Theory, Languages, and Computation (2nd edition)", Addison Wesley, 2000
  3. Alfred V. Aho, Ravi Sethi, and Jeffrey D. Ullman, "Compilers: Principles, Techniques, and Tools", Addison Wesley, 1986.
  4. Michael R. Garey and David S. Johnson, "Computers and Intractability: A Guide to the Theory of NP-Completeness", W.H. Freeman and Company, 1979.
  5. William H. Press, Saul A. Teukolsky, William T. Vetterling, and Brian P. Flannery, "Numerical Recipes in C (2nd edition)", Cambridge University Press, 1995
Back to top

Publishing houses

Most publishing houses allow you either to browse their catalogs online, or order a hardcopy, or both. To order catalogs, go to the individual Web sites and fill an appropriate form or send email to customer service. Some publishers offer convenient email alerts about new books and journal issues (notably, Kluwer and Prentice Hall).

Computational Linguistics, Information Retrieval, Artificial Intelligence

  1. AAAI Press
  2. Academic Press
  3. Cambridge University Press (also
  4. Center for the Study of Languages and Information (CSLI)
  5. Elsevier Science
  6. Kluwer Academic Publishers (also
  7. MIT Press
  8. Morgan Kaufmann Publishers
  9. Oxford University Press (also
  10. Prentice Hall (also
  11. Springer-Verlag (also
  12. University of Chicago Press
Back to top

General-purpose computer science books

  1. ACM Press
  2. Addison-Wesley Longman, along with The Benjamin/Cummings Publishing Company (also and
  3. Houghton Mifflin
  4. John Wiley and Sons (also
  5. Macmillan Computer Reference (also
  6. Marcel Dekker
  7. McGraw-Hill Book Company (also, and
  8. O'Reilly and Associates (for a printed catalog, email to
  9. W.H.Freeman and Company
Back to top

Evgeniy Gabrilovich

Last updated on July 28, 2006

Keywords: Computational Linguistics, Natural Language Processing, NLP, Natural Language Understanding, Natural Language Analysis, Natural Language Generation, Information Retrieval, IR, Text Categorization, Artificial Intelligence, AI, Machine Learning, Corpus Linguistics, Algorithm Design, Text Mining, Text Data Mining, Digital Signal Processing, DSP, Speech Processing, Speech Recognition, SR, Automatic Speaker Recognition, ASR, Speaker Identification, Speaker Verification