Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations

Front Cover
Elsevier Science, 2000 - Computers - 371 pages

"This is a milestone in the synthesis of data mining, data analysis, information theory, and machine learning."

-Jim Gray, Microsoft Research


This book offers a thorough grounding in machine learning concepts as well as practical advice on applying machine learning tools and techniques in real-world data mining situations. Inside, you'll learn all you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining-including both tried-and-true techniques of the past and Java-based methods at the leading edge of contemporary research. If you're involved at any level in the work of extracting usable knowledge from large collections of data, this clearly written and effectively illustrated book will prove an invaluable resource.


Complementing the authors' instruction is a fully functional platform-independent Java software system for machine learning, available for download. Apply it to the sample data sets provided to refine your data mining skills, apply it to your own data to discern meaningful patterns and generate valuable insights, adapt it for your specialized data mining applications, or use it to develop your own machine learning schemes.


Features



* Helps you select appropriate approaches to particular problems and to compare and evaluate the results of different techniques.


* Covers performance improvement techniques, including input preprocessing and combining output from different methods.


* Comes with downloadable machine learning software: use it to master the techniques covered inside, apply it to your own projects, and/or customize it to meet special needs.

From inside the book

Contents

Whats it all about?
1
Concepts instances attributes
37
Knowledge representation
57
Copyright

9 other sections not shown

Other editions - View all

Common terms and phrases

About the author (2000)

Ian H. Witten is a professor of computer science at the University of Waikato in New Zealand. He directs the New Zealand Digital Library research project. His research interests include information retrieval, machine learning, text compression, and programming by demonstration. He received an MA in Mathematics from Cambridge University, England; an MSc in Computer Science from the University of Calgary, Canada; and a PhD in Electrical Engineering from Essex University, England. He is a fellow of the ACM and of the Royal Society of New Zealand. He has published widely on digital libraries, machine learning, text compression, hypertext, speech synthesis and signal processing, and computer typography.

Bibliographic information