Data Mining: Practical Machine Learning Tools and Techniques, Second Edition
As with any burgeoning technology that enjoys commercial attention, the use of data mining is surrounded by a great deal of hype. Exaggerated reports tell of secrets that can be uncovered by setting algorithms loose on oceans of data. But there is no magic in machine learning, no hidden power, no alchemy. Instead there is an identifiable body of practical techniques that can extract useful information from raw data. This book describes these techniques and shows how they work.
The book is a major revision of the first edition that appeared in 1999. While the basic core remains the same, it has been updated to reflect the changes that have taken place over five years, and now has nearly double the references. The highlights for the new edition include thirty new technique sections; an enhanced Weka machine learning workbench, which now features an interactive interface; comprehensive information on neural networks; a new section on Bayesian networks; plus much more.
* Algorithmic methods at the heart of successful data mining—including tried and true techniques as well as leading edge methods
* Performance improvement techniques that work by transforming the input or output
* Downloadable Weka, a collection of machine learning algorithms for data mining tasks, including tools for data pre-processing, classification, regression, clustering, association rules, and visualization—in a new, interactive interface
What people are saying - Write a review
Doing it again
Training and testing learning schemes
Clustering and association rules
Unsupervised instance filters
The Knowledge Flow interface
Analyzing the results
Other editions - View all
applied association rules astigmatic attribute values Bayesian networks calculate called Chapter choose class value classifier contact lens cost coverage cross-validation data mining database dataset decision tree described distribution error rate estimate evaluate example Figure humidity hypermetrope hyperplane hyperrectangle input instance space instance-based learning involved Iris setosa Iris versicolor Iris virginica kD-tree leaf learner learning algorithms learning scheme linear models linear regression logistic regression loss function machine learning measure missing values model tree myope Na´ve Bayes node nominal attributes number of instances numeric attributes numeric prediction outcome outlook output overcast overfitting parameters particular perceptron performance petal length play possible pre-presbyopic probability problem produce pruning represent result rule set sample Section setosa simple split statistical structure subset subtree sunny support vector machines Table tear production rate techniques temperature test instance test set tion training data training instances training set true weather data weights Weka windy zero
Page ii - Technology Cynthia Maro Saracco Readings in Database Systems, Third Edition Edited by Michael Stonebraker and Joseph M. Hellerstein Understanding SQL's Stored Procedures: A Complete Guide to SQL/PSM Jim Melton Principles of Multimedia Database Systems VS Subrahmanian Principles of Database Query Processing for Advanced Applications Clement T. Yu and Weiyi Meng Advanced Database Systems Carlo Zaniolo, Stefano Ceri, Christos Faloutsos, Richard T. Snodgrass, VS Subrahmanian, and Roberto Zicari Principles...
Page i - The Morgan Kaufmann Series in Data Management Systems Series Editor: Jim Gray, Microsoft Research...
Page ii - JDBC, and Related Technologies Jim Melton and Andrew Eisenberg Database: Principles, Programming, and Performance, Second Edition Patrick and Elizabeth O'Neil The Object Data Standard: ODMG 3.0 Edited by RGG Cattell and Douglas K.
Page ii - SQL: 1999 — Understanding Relational Language Components Jim Melton and Alan R. Simon Information Visualization in Data Mining and Knowledge Discovery Edited by Usama Fayyad, Georges G. Grinstein, and Andreas Wierse Transactional Information Systems: Theory, Algorithms, and Practice of Concurrency Control and Recovery Gerhard Weikum and Gottfried Vossen Spatial Databases: With Application to...