This book is aimed at the data scientist with some familiarity with the R and/or Python programming languages, and with some prior (perhaps spotty or ephemeral) exposure to statistics. Two of the authors came to the world of data science from the world of statistics, and have some appreciation of the contribution that statistics can make to the art of data science. At the same time, we are well aware of the limitations of traditional statistics instruction: statistics as a discipline is a century and a half old, and most statistics textbooks and courses are laden with the momentum and inertia of an ocean liner. All the methods in this book have some connection — historical or methodological — to the discipline of statistics.
Two goals underlie this book:
– To lay out, in digestible, navigable, and easily referenced form, key concepts from statistics that are relevant to data science.
– To explain which concepts are important and useful from a data science perspective, which are less so, and why.
(Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python 2nd Edition; Peter Bruce, Andrew Bruce, Peter Gedeck; O’Reilly Media ; Preface: Page xiii)