Document warehousing and text mining
What developers need to know about the rapidly growing technologies of document warehousing and text mining
This unique book shows warehouse developers and managers how to build this new type of warehouse, how to organize free-form text for easy access, and, most importantly, how to exploit text mining techniques to provide timely and accurate information for decision-makers. The author covers the complete process of building and managing a document warehouse, including examples of actual implementations, a review of security issues and tools such as XML and Wide Area Information Servers and their selection criteria, and how text mining techniques are different from data mining techniques.
85 pages matching operations in this book
Results 1-3 of 85
What people are saying - Write a review
We haven't found any reviews in the usual places.
Expanding the Scope of Business Intelligence
The Role of Text Mining in Document Warehousing
Understanding the Structure of Text
28 other sections not shown
analyzing applications approach basic business intelligence chapter character set classification clustering Common Warehouse Metamodel competitive intelligence crawlers customers data mining data warehouse data warehousing defined describe developed docu document management systems document retrieval document sources document warehouse document warehousing domain Dublin Core end users example feature extraction Figure file system hierarchical IBM Intelligent Miner identify industry information retrieval interest internal Internet keywords load machine translation ment metadata migraines Miner for Text multiple number of documents operations options organization particular patterns preprocessing problem query relational database relevant repository search engines semantic sentence servers set of documents specific standard steps storage stored structure summarization target taxonomy techniques text analysis tools text mining thematic indexing tion topics transformation ument understanding UWORD vector space model weights wget words