Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations

Front Cover
Morgan Kaufmann, 2000 - Computers - 371 pages
19 Reviews

This book offers a thorough grounding in machine learning concepts as well as practical advice on applying machine learning tools and techniques in real-world data mining situations. Inside, you'll learn all you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining-including both tried-and-true techniques of the past and Java-based methods at the leading edge of contemporary research. If you're involved at any level in the work of extracting usable knowledge from large collections of data, this clearly written and effectively illustrated book will prove an invaluable resource.


Complementing the authors' instruction is a fully functional platform-independent Java software system for machine learning, available for download. Apply it to the sample data sets provided to refine your data mining skills, apply it to your own data to discern meaningful patterns and generate valuable insights, adapt it for your specialized data mining applications, or use it to develop your own machine learning schemes.



* Helps you select appropriate approaches to particular problems and to compare and evaluate the results of different techniques.
* Covers performance improvement techniques, including input preprocessing and combining output from different methods.
* Comes with downloadable machine learning software: use it to master the techniques covered inside, apply it to your own projects, and/or customize it to meet special needs.
  

What people are saying - Write a review

User ratings

5 stars
2
4 stars
8
3 stars
5
2 stars
3
1 star
1

Review: Data Mining: Practical Machine Learning Tools and Techniques

User Review  - JDK1962 - Goodreads

I really, really wanted to like this book more than I did. After all, it was about a topic that I have great interest in, and describes a workbench application (Weka) that I can command-line access ... Read full review

Review: Data Mining: Practical Machine Learning Tools and Techniques (Morgan Kaufmann Series in Data Management Systems)

User Review  - Kid - Goodreads

Best introductory book on Data Mining in terms of concepts and practice. Not too academically but goal-driven and data-driven, which makes readers understand it easier. WEKA is a great tool, although ... Read full review

Contents

III
1
V
2
VI
4
VII
5
VIII
7
IX
8
XI
11
XII
13
CX
169
CXII
170
CXIII
171
CXIV
172
CXV
173
CXVI
174
CXVII
175
CXVIII
177

XIII
15
XIV
16
XV
17
XVI
20
XVII
21
XVIII
22
XIX
23
XX
24
XXI
25
XXII
26
XXIII
27
XXIV
28
XXV
29
XXVI
32
XXVII
34
XXVIII
37
XXX
38
XXXI
41
XXXII
45
XXXIII
48
XXXV
49
XXXVI
51
XXXVII
52
XXXVIII
53
XXXIX
54
XL
55
XLI
57
XLIII
58
XLV
59
XLVI
63
XLVII
64
XLVIII
67
XLIX
70
L
72
LI
75
LII
76
LIII
77
LIV
78
LV
80
LVI
81
LVII
82
LVIII
85
LIX
88
LX
89
LXI
93
LXII
94
LXIII
97
LXV
98
LXVII
103
LXVIII
104
LXIX
105
LXXI
108
LXXII
111
LXXIII
112
LXXV
113
LXXVII
114
LXXIX
115
LXXX
116
LXXXI
119
LXXXII
120
LXXXIII
123
LXXXIV
125
LXXXV
127
LXXXVII
128
LXXXVIII
129
LXXXIX
133
XC
134
XCI
135
XCII
136
XCIII
137
XCIV
139
XCV
141
XCVI
144
XCVII
145
XCVIII
147
XCIX
150
C
154
CI
155
CII
157
CIII
159
CV
161
CVI
162
CVII
164
CVIII
167
CIX
168
CXIX
178
CXX
181
CXXI
184
CXXII
187
CXXIII
188
CXXIV
189
CXXV
191
CXXVI
193
CXXVIII
194
CXXX
195
CXXXI
196
CXXXII
197
CXXXIII
199
CXXXIV
200
CXXXV
201
CXXXVI
202
CXXXVIII
203
CXXXIX
204
CXLI
205
CXLII
208
CXLIII
209
CXLIV
210
CXLV
211
CXLVI
212
CXLVII
217
CXLVIII
218
CXLIX
221
CL
223
CLI
225
CLII
226
CLIII
229
CLV
232
CLVI
233
CLVII
235
CLVIII
236
CLIX
238
CLX
239
CLXI
240
CLXII
243
CLXIII
244
CLXIV
246
CLXV
247
CLXVII
248
CLXVIII
249
CLXIX
250
CLXX
251
CLXXI
254
CLXXII
258
CLXXIII
260
CLXXIV
263
CLXXV
265
CLXXVI
267
CLXXVII
271
CLXXVIII
272
CLXXX
274
CLXXXI
276
CLXXXII
277
CLXXXV
279
CLXXXVI
282
CLXXXVII
283
CLXXXVIII
286
CLXXXIX
289
CXC
294
CXCI
296
CXCII
297
CXCIII
299
CXCV
306
CXCVII
314
CXCIX
316
CC
317
CCI
321
CCIII
322
CCIV
325
CCVI
327
CCVII
329
CCVIII
331
CCX
333
CCXI
334
CCXII
335
CCXIII
336
CCXIV
339
CCXV
351
CCXVI
371
Copyright

Common terms and phrases

References to this book

All Book Search results »

About the author (2000)

Ian H. Witten is a professor of computer science at the University of Waikato in New Zealand. He directs the New Zealand Digital Library research project. His research interests include information retrieval, machine learning, text compression, and programming by demonstration. He received an MA in Mathematics from Cambridge University, England; an MSc in Computer Science from the University of Calgary, Canada; and a PhD in Electrical Engineering from Essex University, England. He is a fellow of the ACM and of the Royal Society of New Zealand. He has published widely on digital libraries, machine learning, text compression, hypertext, speech synthesis and signal processing, and computer typography. He has written several books, the latest being Managing Gigabytes (1999) and Data Mining (2000), both from Morgan Kaufmann.

Eibe Frank lives in New Zealand with his Samoan spouse and two lovely boys, but originally hails from Germany, where he received his first degree in computer science from the University of Karlsruhe. He moved to New Zealand to pursue his Ph.D. in machine learning under the supervision of Ian H. Witten, and joined the Department of Computer Science at the University of Waikato as a lecturer on completion of his studies. He is now an associate professor at the same institution. As an early adopter of the Java programming language, he laid the groundwork for the Weka software described in this book. He has contributed a number of publications on machine learning and data mining to the literature and has refereed for many conferences and journals in these areas.>

Mark A. Hall was born in England but moved to New Zealand with his parents as a young boy. He now lives with his wife and four young children in a small town situated within an hour's drive of the University of Waikato. He holds a bachelor's degree in computing and mathematical sciences and a Ph.D. in computer science, both from the University of Waikato. Throughout his time at Waikato, as a student and lecturer in computer science and more recently as a software developer and data mining consultant for Pentaho, an open-source business intelligence software company, Mark has been a core contributor to the Weka software described in this book. He has published a number of articles on machine learning and data mining and has refereed for conferences and journals in these areas.

Bibliographic information