The Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data

Front Cover
Cambridge University Press, Dec 11, 2006 - Computers
7 Reviews
Text mining is a new and exciting area of computer science research that tries to solve the crisis of information overload by combining techniques from data mining, machine learning, natural language processing, information retrieval, and knowledge management. Similarly, link detection a rapidly evolving approach to the analysis of text that shares and builds upon many of the key elements of text mining also provides new tools for people to better leverage their burgeoning textual data resources. The Text Mining Handbook presents a comprehensive discussion of the state-of-the-art in text mining and link detection. In addition to providing an in-depth examination of core text mining and link detection algorithms and operations, the book examines advanced pre-processing techniques, knowledge representation considerations, and visualization approaches. Finally, the book explores current real-world, mission-critical applications of text mining and link detection in such varied fields as M&A business intelligence, genomics research and counter-terrorism activities.
  

What people are saying - Write a review

User ratings

5 stars
4
4 stars
1
3 stars
2
2 stars
0
1 star
0

Review: The Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data

User Review  - Jill - Goodreads

Very much a handbook and sometiems completely over my head but gave me the intro information needed for my column and for that I'm grateful. Read full review

Contents

Core Text Mining Operations
19
Text Mining Preprocessing Techniques
57
Categorization
64
Clustering
82
Information Extraction
94
Probabilistic Models for Information Extraction
131
Preprocessing Applications Using Probabilistic
146
PresentationLayer Considerations for Browsing
177
Visualization Approaches
189
Link Analysis
244
Text Mining Applications
275
DIAL A Dedicated Information Extraction Language
315
Bibliography
337
Index
391
Copyright

Common terms and phrases

Popular passages

Page 349 - In Proceedings of the 1st International Conference on Knowledge Discovery and Data Mining, pp.
Page 361 - Explora: A Multipattern and Multistrategy Discovery Assistant. In Advances in Knowledge Discovery and Data Mining, eds. U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy, Cambridge, MA: MIT Press. 11. W. Klosgen, J. Zytkow 1996. Knowledge Discovery in Databases Terminology.
Page 355 - Extension and Integration of the Gene Ontology (GO): Combining GO Vocabularies with External Vocabularies.

References to this book

About the author (2006)

Dr Ronen Feldman is a Senior Lecturer in the Mathematics and Computer Science Department of Bar-Ilan University and Director of the Data and Text Mining Laboratory. Dr Feldman is co-founder, Chief Scientist and Chairman of the Board of Clearforest, Ltd., a leader in developing next generation text mining applications for corporate and government clients. He also recently served as an Adjunct Professor at New York University's Stern School of Business. A pioneer in the areas of machine learning, data mining, and unstructured data management, he has authored or co-authored more than 70 published articles and conference papers in these areas.

Jim Sanger is a venture capitalist, applied technologist and recognized industry expert in the areas of commercial data solutions, Internet applications and IT security products. He is a partner at ABS Ventures, an independent venture firm founded in 1982 and originally associated with technology banking leader Alex Brown and Sons. Immediately before joining ABS Ventures, Mr Sanger was a Managing Director in the New York offices of DB Capital Venture Partners, the global venture capital arm of Europe's largest financial institution, Deutsche Bank. Before transferring to DB Capital in New York, Mr Sanger was Chief Technology Officer and Director of Tech Investment for Deutsche Bank's London-based corporate development and venturing group, as well as Vice President of Software Development for the E-business division of the Deutsche Bank's investment banking organization. Prior to his work at Deutsche Bank, Mr Sanger held a variety of senior IT positions at Barclays Bank and Bell Atlantic Corporation (now Verizon Communications). Mr Sanger has been a board member of several thought-leading technology companies, including Inxight Software, Gomez, Inc., and Clearforest, Inc.; he has also served as an official observer to the boards of AlphaBlox (acquired by IBM in 2004), Intralinks, and Imagine Software, and as a member of the Technical Advisory Board of Qualys, Inc. He has been a speaker at leading technology conferences, including Internet World, M-Commerce World, Euromoney Seminars, and Yankee Group-sponsored events. Mr Sanger received his BA, cum laude, from the University of Pennsylvania and attended postgraduate courses in Software Engineering and Information Technology at Oxford University and the University of Liverpool. Mr Sanger is a member of the IEEE and American Association for Artificial Intelligence (AAAI).

Bibliographic information