Tag Archives: Suggested Reading

Beyond the Basics – Data Science for Business (Foster Provost & Tom Fawcett).

Data Science For BusinessIn an earlier post, I recommended a short read on introductory data science and big data. That book gave a fantastic overview of the major areas and ideas governing these disciplines. However, if you are motivated to dig deeper and wrap your head around details, then Data Science for Business should be your next read. This book does a fantastic job of helping the reader understand how one should think if they are considering data science as a profession, or they want to understand all the hype. Further, much detail and time is given to the idea of the “Data Analytics Lifecycle” which governs data science projects through process and a framework.  The authors meticulously step through the various modeling techniques with solid examples and explanations.  There are sections of the book that detail some math and their derivations which may prove to be challenge if your math is rusty. However it should not present too much of an obstacle with regards to understanding the gist of what is being conveyed.  In the preface, the authors state the book is intended for business people who are working with data scientists, managing data scientists, or seeking to understand the value in data science. Also, the book is suited to developers implementing data science solutions and finally, aspiring data scientists.  I believe that this book has a role to play in one’s education in data science and that it is an appropriate read for those wishing to understand, with detail, how data science is done and what it aims to achieve.


Louis V. Frolio

2 Comments

Filed under Suggested Readings

A Great Introduction to Data Science and Big Data.

SimpleIntroDataScienceFor those of you foraying into the world of Big Data (BD) and Data Science (DS), it can be challenging to find a single resource to help paint a meaningful high-level picture of what this stuff is all about.  Personally, I always like to start with a 30,000 foot view of the challenge or endeavor before me.  I find that it helps frame the important concepts better enabling the consumption and digestion of the details to follow.  This tiered approach is especially important to the disciplines of BD and DS.  A book I read in less than 45 minutes completely satisfied my 30,000 foot criteria.  The key to this book’s success is the organic progression of each chapter, the breadth of topics introduced and its overall brevity.  The authors (in a mere 65 pages) walk you through a summary of data science, a working definition of big data, the new technologies necessitated by big data, aspects of the data analytics lifecycle, key characteristics of a data scientist and approaches to effective communication as a data scientist.  If you have an interest in DS or BD, get your hands on this book.  It provides a simple overview of the complicated disciplines of data science and big data.

Louis V. Frolio

Leave a Comment

Filed under Suggested Readings