Predictive Analytics and Data Mining: Concepts and Practice with RapidMiner

Front Cover
Morgan Kaufmann, Nov 27, 2014 - Computers - 446 pages

Put Predictive Analytics into ActionLearn the basics of Predictive Analysis and Data Mining through an easy to understand conceptual framework and immediately practice the concepts learned using the open source RapidMiner tool. Whether you are brand new to Data Mining or working on your tenth project, this book will show you how to analyze data, uncover hidden patterns and relationships to aid important decisions and predictions. Data Mining has become an essential tool for any enterprise that collects, stores and processes data as part of its operations. This book is ideal for business users, data analysts, business analysts, business intelligence and data warehousing professionals and for anyone who wants to learn Data Mining.You’ll be able to:1. Gain the necessary knowledge of different data mining techniques, so that you can select the right technique for a given data problem and create a general purpose analytics process.2. Get up and running fast with more than two dozen commonly used powerful algorithms for predictive analytics using practical use cases.3. Implement a simple step-by-step process for predicting an outcome or discovering hidden relationships from the data using RapidMiner, an open source GUI based data mining tool

Predictive analytics and Data Mining techniques covered: Exploratory Data Analysis, Visualization, Decision trees, Rule induction, k-Nearest Neighbors, Naïve Bayesian, Artificial Neural Networks, Support Vector machines, Ensemble models, Bagging, Boosting, Random Forests, Linear regression, Logistic regression, Association analysis using Apriori and FP Growth, K-Means clustering, Density based clustering, Self Organizing Maps, Text Mining, Time series forecasting, Anomaly detection and Feature selection. Implementation files can be downloaded from the book companion site at www.LearnPredictiveAnalytics.com

  • Demystifies data mining concepts with easy to understand language
  • Shows how to get up and running fast with 20 commonly used powerful techniques for predictive analysis
  • Explains the process of using open source RapidMiner tools
  • Discusses a simple 5 step process for implementing algorithms that can be used for performing predictive analytics
  • Includes practical use cases and examples
 

Contents

Chapter 1 Introduction
1
Chapter 2 Data Mining Process
17
Chapter 3 Data Exploration
37
Chapter 4 Classification
63
Chapter 5 Regression Methods
165
Chapter 6 Association Analysis
195
Chapter 7 Clustering
217
Chapter 8 Model Evaluation
257
Chapter 10 Time Series Forecasting
305
Chapter 11 Anomaly Detection
329
Chapter 12 Feature Selection
347
Chapter 13 Getting Started with RapidMiner
371
Comparison of Data Mining Algorithms
407
Index
417
About the Authors
425
Copyright

Chapter 9 Text Mining
275

Other editions - View all

Common terms and phrases

About the author (2014)

Vijay Kotu is Vice President of Analytics at ServiceNow. He leads the implementation of large-scale data platforms and services to support the company's enterprise business. He has led analytics organizations for over a decade with focus on data strategy, business intelligence, machine learning, experimentation, engineering, enterprise adoption, and building analytics talent. Prior to joining ServiceNow, he was Vice President of Analytics at Yahoo. He worked at Life Technologies and Adteractive where he led marketing analytics, created algorithms to optimize online purchasing behavior, and developed data platforms to manage marketing campaigns. He is a member of the Association of Computing Machinery and a member of the Advisory Board at RapidMiner.

Dr. Deshpande has extensive experience in working with companies ranging from startups to Fortune 5 in fields ranging from automotive, aerospace, retail, food, and manufacturing verticals delivering business analysis; designing and developing custom data products for implementing business intelligence, data science, and predictive analytics solutions. He was the Founder of SimaFore, a predictive analytics consulting company which was acquired by Soliton Inc., a provider of testing solutions for the semiconductor industry. He was also the Founding Co-chair of the annual Predictive Analytics World-Manufacturing conference. In his professional career he has worked with Ford Motor Company on their product development, with IBM at their IBM Watson Center of Competence, and with Domino’s Pizza at their data science and artificial intelligence groups. He has a Ph.D. from Carnegie Mellon and an MBA from Ross School of Business, Michigan.

Bibliographic information