Data Science: Concepts and Practice

Front Cover
Morgan Kaufmann, Nov 27, 2018 - Computers - 568 pages

Learn the basics of Data Science through an easy to understand conceptual framework and immediately practice using RapidMiner platform. Whether you are brand new to data science or working on your tenth project, this book will show you how to analyze data, uncover hidden patterns and relationships to aid important decisions and predictions.

Data Science has become an essential tool to extract value from data for any organization that collects, stores and processes data as part of its operations. This book is ideal for business users, data analysts, business analysts, engineers, and analytics professionals and for anyone who works with data.

You’ll be able to:

  1. Gain the necessary knowledge of different data science techniques to extract value from data.
  2. Master the concepts and inner workings of 30 commonly used powerful data science algorithms.
  3. Implement step-by-step data science process using using RapidMiner, an open source GUI based data science platform

Data Science techniques covered: Exploratory data analysis, Visualization, Decision trees, Rule induction, k-nearest neighbors, Na´ve Bayesian classifiers, Artificial neural networks, Deep learning, Support vector machines, Ensemble models, Random forests, Regression, Recommendation engines, Association analysis, K-Means and Density based clustering, Self organizing maps, Text mining, Time series forecasting, Anomaly detection, Feature selection and more...

  • Contains fully updated content on data science, including tactics on how to mine business data for information
  • Presents simple explanations for over twenty powerful data science techniques
  • Enables the practical use of data science algorithms without the need for programming
  • Demonstrates processes with practical use cases
  • Introduces each algorithm or technique and explains the workings of a data science algorithm in plain language
  • Describes the commonly used setup options for the open source tool RapidMiner
 

What people are saying - Write a review

We haven't found any reviews in the usual places.

Contents

1 Introduction
1
2 Data Science Process
19
3 Data Exploration
39
4 Classification
65
5 Regression Methods
165
6 Association Analysis
199
7 Clustering
221
8 Model Evaluation
263
12 Time Series Forecasting
395
13 Anomaly Detection
447
14 Feature Selection
467
15 Getting Started with RapidMiner
491
Comparison of Data Science Algorithms
523
About the Authors
531
Index
533
Praise
545

9 Text Mining
281
10 Deep Learning
307
11 Recommendation Engines
343
Back Cover
549
Copyright

Other editions - View all

Common terms and phrases

About the author (2018)

Vijay Kotu is Vice President of Analytics at ServiceNow. He leads the implementation of large-scale data platforms and services to support the company's enterprise business. He has led analytics organizations for over a decade with focus on data strategy, business intelligence, machine learning, experimentation, engineering, enterprise adoption, and building analytics talent. Prior to joining ServiceNow, he was Vice President of Analytics at Yahoo. He worked at Life Technologies and Adteractive where he led marketing analytics, created algorithms to optimize online purchasing behavior, and developed data platforms to manage marketing campaigns. He is a member of the Association of Computing Machinery and a member of the Advisory Board at RapidMiner.

Dr. Deshpande has extensive experience in working with companies ranging from startups to Fortune 5 in fields ranging from automotive, aerospace, retail, food, and manufacturing verticals delivering business analysis; designing and developing custom data products for implementing business intelligence, data science, and predictive analytics solutions. He was the Founder of SimaFore, a predictive analytics consulting company which was acquired by Soliton Inc., a provider of testing solutions for the semiconductor industry. He was also the Founding Co-chair of the annual Predictive Analytics World-Manufacturing conference. In his professional career he has worked with Ford Motor Company on their product development, with IBM at their IBM Watson Center of Competence, and with Domino’s Pizza at their data science and artificial intelligence groups. He has a Ph.D. from Carnegie Mellon and an MBA from Ross School of Business, Michigan.

Bibliographic information