Data Preparation for Data Mining
Data Preparation for Data Mining addresses an issue unfortunately ignored by most authorities on data mining: data preparation. Thanks largely to its perceived difficulty, data preparation has traditionally taken a backseat to the more alluring question of how best to extract meaningful knowledge. But without adequate preparation of your data, the return on the resources invested in mining is certain to be disappointing.
Dorian Pyle corrects this imbalance. A twenty-five-year veteran of what has become the data mining industry, Pyle shares his own successful data preparation methodology, offering both a conceptual overview for managers and complete technical details for IT professionals. Apply his techniques and watch your mining efforts pay off-in the form of improved performance, reduced distortion, and more valuable results.
On the enclosed CD-ROM, you'll find a suite of programs as C source code and compiled into a command-line-driven toolkit. This code illustrates how the author's techniques can be applied to arrive at an automated preparation solution that works for you. Also included are demonstration versions of three commercial products that help with data preparation, along with sample data with which you can practice and experiment.
What people are saying - Write a review
I guess that the vast field of data mining has various levels of knowledge. I bought this book with the hope that it would assist me in the data preparation of my dataset that I was going to mine for my MSc thesis.
In short, this book was the most trivial book on Data mining / preparation that I have ever read. Examples were on minute datasets that need hardly any preparation; endless pages on describing basic facts on data; and no clear real-life examples that match the level of data mining encountered in industry.
I am very disapointed and wish I did not buy this book.
Data Exploration as a Process
The Nature of the World