Data Mining Methods for the Content Analyst: An Introduction to the Computational Analysis of Content

Front Cover
Routledge, Nov 12, 2012 - Language Arts & Disciplines - 120 pages
0 Reviews

With continuous advancements and an increase in user popularity, data mining technologies serve as an invaluable resource for researchers across a wide range of disciplines in the humanities and social sciences. In this comprehensive guide, author and research scientist Kalev Leetaru introduces the approaches, strategies, and methodologies of current data mining techniques, offering insights for new and experienced users alike.

Designed as an instructive reference to computer-based analysis approaches, each chapter of this resource explains a set of core concepts and analytical data mining strategies, along with detailed examples and steps relating to current data mining practices. Every technique is considered with regard to context, theory of operation and methodological concerns, and focuses on the capabilities and strengths relating to these technologies. In addressing critical methodologies and approaches to automated analytical techniques, this work provides an essential overview to a broad innovative field.

 

What people are saying - Write a review

We haven't found any reviews in the usual places.

Contents

1 Introduction
1
2 Obtaining and Preparing Data
7
3 Vocabulary Analysis
26
4 Correlation and Cooccurrence
36
5 Lexicons Entity Extraction and Geocoding
43
6 Topic Extraction
57
7 Sentiment Analysis
65
8 Similarity Categorization and Clustering
71
9 Network Analysis
86
References
97
Index
100
Copyright

Common terms and phrases

About the author (2012)

Kalev Leetaru is Senior Research Scientist for Content Analysis at the University of Illinois Institute for Computing in Humanities, Arts, and Social Science and Center Affiliate of the National Center for Supercomputing Applications. He leads a number of large initiatives centering on the application of high performance computing to grand challenge problems using massive-scale document and data archives.

Bibliographic information