Contents vii
Acknowledgments xv
Preface xvii
I PART 1: AN INTRODUCTION TO VERIDICAL DATA SCIENCE 1
1 An introduction to veridical data science 3
2 The Data Science Life Cycle 23
3 Setting up your data science project 43
II PART 2: PREPARING, EXPLORING, AND DESCRIBING DATA 65
4 Data Preparation 67
5 Exploratory Data Analysis 109
6 Principal component analysis 149
7 Clustering 197
III PART 3: PREDICTION 253
8 An introduction to prediction problems 255
9 Predicting continuous responses with Least Squares 275
10 Extending the Least Squares algorithm 311
11 Predicting binary responses and logistic regression 353
12 Decision trees and random forest 403
13 Producing the final prediction results 437
14 Conclusion 473
Answers to True or False exercises 481