Foundations of Statistical Natural Language Processing

Front Cover
Hinrich Schütze
MIT Press, 1999 - Language Arts & Disciplines - 680 pages
25 Reviews

Statistical approaches to processing natural language text have become dominant in recent years. This foundational text is the first comprehensive introduction to statistical natural language processing (NLP) to appear. The book contains all the theory and algorithms needed for building NLP tools. It provides broad but rigorous coverage of mathematical and linguistic foundations, as well as detailed discussion of statistical methods, allowing students and researchers to construct their own implementations. The book covers collocation finding, word sense disambiguation, probabilistic parsing, information retrieval, and other applications.

  

What people are saying - Write a review

User ratings

5 stars
14
4 stars
6
3 stars
2
2 stars
1
1 star
2

Review: Foundations of Statistical Natural Language Processing

User Review  - Rachid El guerrab - Goodreads

Needs more walk-through integrated examples, not just simple illustrations for specific paragraphs. It could also benefit from a discussion of NLP software and possible architectures for the domain. Read full review

Review: Foundations of Statistical Natural Language Processing

User Review  - Michael Shaw - Goodreads

A must read for anyone looking to get into NLP. Teaches from first principles, including briefly touching on information theory/entropy. I felt it was well grounded, and proceded at a good pace. No ... Read full review

Contents

Introduction
3
Mathematical Foundations
39
Linguistic Essentials
81
CorpusBased Work
117
Collocations
151
n gram Models over Sparse Data
191
Word Sense Disambiguation
229
Lexical Acquisition
265
Probabilistic Context Free Grammars
381
Probabilistic Parsing
407
Statistical Alignment and Machine Translation
463
Clustering
495
Topics in Information Retrieval
529
Text Categorization
575
Tiny Statistical Tables
609
Index
657

Markov Models
317
PartofSpeech Tagging
341

Common terms and phrases

References to this book

All Book Search results »

About the author (1999)

Christopher Manning is an Associate Professor of Computer Science and Linguistics at Stanford University. His research concentrates on probabilistic models of language and statistical natural language processing, information extraction, text understanding and text mining.

Dr Hinrich Schutze resides as Chair of Theoretical Computational Linguistics at the Institute for Natural Language Processing, University of Stuttgart,

Bibliographic information