Text mining application programming

Front Cover
Charles River Media, May 13, 2006 - Computers - 412 pages
1 Review
Text Mining Application Programming teaches software developers how to mine the vast amounts of information available on the Web, internal networks, and desktop files and turn it into usable data. The book helps developers understand the problems associated with managing unstructured text, and explains how to build your own mining tools using standard statistical methods from information theory, artificial intelligence, and operations research. Each of the topics covered are thoroughly explained and then a practical implementation is provided. The book begins with a brief overview of text data, where it can be found, and the typical search engines and tools used to search and gather this text. It details how to build tools for extracting and using the text, and covers the mathematics behind many of the algorithms used in building these tools. From there you'll learn how to build tokens from text, construct indexes, and detect patterns in text. You'll also find methods to extract the names of people, places, and organizations from an email, a news article, or a Web page. The next portion of the book teaches you how to find information on the Web, the structure of the Web, and how to build spiders to crawl the Web. Text categorization is also described in the context of managing email. The final part of the book covers information monitoring, summarization, and a simple Question & Answer (Q&A) system. The code used in the book is written in Perl, but knowledge of Perl is not necessary to run the software. Developers with an intermediate level of experience with Perl can customize the software. Although the book is about programming, methods are explained with English-like pseudocode and the source code is provided on the CD-ROM. After reading this book, you'll be ready to tap into the bevy of information available online in ways you never thought possible.

From inside the book

What people are saying - Write a review

Review: Text Mining Application Programming (Charles River Media Programming)

User Review  - Gary Lang - Goodreads

Great book. I wish the examples were written in .NET instead of Perl, but no matter - all the essential technologies are covered. Read full review

Related books

Contents

Introduction
1
Mathematics Background
31
Markov Models and POS Tagging 113
62
Copyright

11 other sections not shown

Common terms and phrases

References from web pages

Text Mining Application Programming | Bookwatch, The | Find ...
Text Mining Application Programming from Bookwatch, The in Arts provided free by Find Articles.
findarticles.com/ p/ articles/ mi_m0QLD/ is_2006_July/ ai_n16534148

CRM - Text Mining Application Programming
Teaches developers how to build text mining applications to manage unstructured text
www.charlesriver.com/ books/ BookDetail.aspx?productID=125195

Text Mining Application Programming is available from Bestprices ...
Text Mining Application Programming only $41.99, get the Text Mining Application Programming book From bestprices.com!
www.bestprices.com/ cgi-bin/ vlink/ 1584504609BT?id=nsession

Powell's Books - Text Mining Application Programming - With CD (06 ...
Text mining offers a way for individuals and corporations to exploit the vast amount of information available on the Internet
www.powells.com/ biblio?isbn=9781584504603

Text Mining Application Programming
textbooksrus.com Save 50-90% off New & Used Books and Textbooks. Absolute lowest prices! Free Shipping. Fast Delivery
www.textbooksrus.com/ search/ BookDetail/ ?isbn=1584504609& kbid=1067

Text Mining Application Programming (豆瓣)
第一个在"Text Mining Application Programming"的论坛里发言. 欢迎加入讨论,请先 登录或注册. 快速注册. 你的email地址: 请填写email 用于确认你的身份, ...
www.douban.com/ subject/ 2364176/

TEXT MINING APPLICATION PROGRAMMING - Manu Konchady - Comprar ...
TEXT MINING APPLICATION PROGRAMMING - Manu Konchady.
manu-konchady.comprar-livro.com.br/ livros/ 1158450460/

Text Mining Application Programming (Programming Series)
Text Mining Application Programming (Programming Series). Text Mining Application Programming (Programming Series). Purchase this Book · Purchase this Book ...
portal.acm.org/ citation.cfm?id=1137800

Text Mining Application Programming (Programming Series) -口コミ ...
Text Mining Application Programming (Programming Series)の激安・格安情報や感想、 口コミ評価など。インターネット通販(通信販売)で価格の違いを比較して最安値で...
kurabe.biz/ items/ ForeignBooks/ 1584504609

Information Systems
Great deals on computer books: Bargain hunting on over 80.000 computer books - price reductions shown in real-time! ...
www.mycompbookbee.co.uk/ cat801104.html

About the author (2006)

Manu Konchady (Oakton, VA) is a consultant working on open source text mining software. Previously, he worked at Mitre Corp. where he designed and developed software to mine the Internet. He received his Ph.D. in Information Technology from George Mason University and his articles have appeared in Dr. DobbAs Journal and Linux Journal.

Bibliographic information