Introduction to Chinese Natural Language ProcessingThis book introduces Chinese language-processing issues and techniques to readers who already have a basic background in natural language processing (NLP). Since the major difference between Chinese and Western languages is at the word level, the book primarily focuses on Chinese morphological analysis and introduces the concept, structure, and interword semantics of Chinese words. The following topics are covered: a general introduction to Chinese NLP; Chinese characters, morphemes, and words and the characteristics of Chinese words that have to be considered in NLP applications; Chinese word segmentation; unknown word detection; word meaning and Chinese linguistic resources; interword semantics based on word collocation and NLP techniques for collocation extraction. Table of Contents: Introduction / Words in Chinese / Challenges in Chinese Morphological Processing / Chinese Word Segmentation / Unknown Word Identification / Word Meaning / Chinese Collocations / Automatic Chinese Collocation Extraction / Appendix / References / Author Biographies |
Contents
chapter 1 | 1 |
chapter 2 | 7 |
chapter 3 | 27 |
chapter 4 | 41 |
chapter 5 | 61 |
chapter 6 | 73 |
chapter 7 | 95 |
chapter 8 | 109 |
Appendix A | 131 |
135 | |
Author Biographies | 147 |
Other editions - View all
Introduction to Chinese Natural Language Processing Kam-Fai Wong,Wenjie Li,Ruifeng Xu,Zheng-sheng Zhang Limited preview - 2022 |
Introduction to Chinese Natural Language Processing Kam-Fai Wong,Wenjie Li,Ruifeng Xu,Zheng-sheng Zhang No preview available - 2009 |
Common terms and phrases
abbreviations adjectives adverbs algorithm ambiguity antonyms Big5 bigram candidate chapter Chinese characters Chinese collocation extraction Chinese text Chinese word segmentation chunking CILIN co-occurrence co-occurrence distribution co-occurrence frequency co-words collocated words Computational Linguistics concepts context corpus dictionary dictionary-based disyllabic compounds encoding English EXAMPLES MEANINGS given name grammatical hanzi headword Holonym HowNet hypernym hyponym identified language processing large number lexical lexicon linguistic location name MEANINGS oF CoMPoNENTS MEANINGS oF MoRPHEMES meronym monosyllabic morphological processing mutual information n-gram collocation name identification named entity recognition natural language noun occur ofthe ofwords organization name parsing Peking University person name place names POS tagging probability reduplication relation sememes sentence similar Smadja standard statistical structure substitution suffix syllables synonym substitution synonyms synsets syntactic tion traditional Chinese characters true collocations Type 2 collocations Unicode unknown word verb word combinations word sequence WordNet Zhou 香港