Data Mining: Practical Machine Learning Tools and Techniques, Second Edition

Front Cover
Elsevier, Jul 13, 2005 - Computers - 560 pages

Data Mining, Second Edition, describes data mining techniques and shows how they work. The book is a major revision of the first edition that appeared in 1999. While the basic core remains the same, it has been updated to reflect the changes that have taken place over five years, and now has nearly double the references.

The highlights of this new edition include thirty new technique sections; an enhanced Weka machine learning workbench, which now features an interactive interface; comprehensive information on neural networks; a new section on Bayesian networks; and much more.

This text is designed for information systems practitioners, programmers, consultants, developers, information technology managers, specification writers as well as professors and students of graduate-level data mining and machine learning courses.

  • Algorithmic methods at the heart of successful data mining—including tried and true techniques as well as leading edge methods
  • Performance improvement techniques that work by transforming the input or output
 

Contents

Concepts instances and attributes
41
Knowledge representation
61
Part II
80
4
83
2
89
5
91
5
143
1
148
The Explorer
369
Doing it again
377
Training and testing learning schemes
384
Clustering and association rules
391
Unsupervised instance filters
400
The Knowledge Flow interface
427
Analyzing the results
440
2
449

2
163
6
187
7
285
3
345
The Weka machine learning workbench
363
Going through the code
462
computeInfoGain
480
Index
505
About the authors
525
Copyright

Common terms and phrases

Popular passages

Page ii - Technology Cynthia Maro Saracco Readings in Database Systems, Third Edition Edited by Michael Stonebraker and Joseph M. Hellerstein Understanding SQL's Stored Procedures: A Complete Guide to SQL/PSM Jim Melton Principles of Multimedia Database Systems VS Subrahmanian Principles of Database Query Processing for Advanced Applications Clement T. Yu and Weiyi Meng Advanced Database Systems Carlo Zaniolo, Stefano Ceri, Christos Faloutsos, Richard T. Snodgrass, VS Subrahmanian, and Roberto Zicari Principles...
Page ii - Management of Heterogeneous and Autonomous Database Systems Edited by Ahmed Elmagarmid, Marek Rusinkiewicz, and Amit Sheth Object-Relational DBMSs: Tracking the Next Great Wave, Second Edition Michael Stonebraker and Paul Brown with...
Page i - The Morgan Kaufmann Series in Data Management Systems Series Editor: Jim Gray, Microsoft Research...
Page ii - JDBC, and Related Technologies Jim Melton and Andrew Eisenberg Database: Principles, Programming, and Performance, Second Edition Patrick and Elizabeth O'Neil The Object Data Standard: ODMG 3.0 Edited by RGG Cattell and Douglas K.
Page ii - SQL: 1999 — Understanding Relational Language Components Jim Melton and Alan R. Simon Information Visualization in Data Mining and Knowledge Discovery Edited by Usama Fayyad, Georges G. Grinstein, and Andreas Wierse Transactional Information Systems: Theory, Algorithms, and Practice of Concurrency Control and Recovery Gerhard Weikum and Gottfried Vossen Spatial Databases: With Application to...
Page 497 - In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, IJCAI'97, pages 729-737, Nagoya, Japan, August 1997.

About the author (2005)

Ian H. Witten is a professor of computer science at the University of Waikato in New Zealand. He directs the New Zealand Digital Library research project. His research interests include information retrieval, machine learning, text compression, and programming by demonstration. He received an MA in Mathematics from Cambridge University, England; an MSc in Computer Science from the University of Calgary, Canada; and a PhD in Electrical Engineering from Essex University, England. He is a fellow of the ACM and of the Royal Society of New Zealand. He has published widely on digital libraries, machine learning, text compression, hypertext, speech synthesis and signal processing, and computer typography.

Bibliographic information