Books

The best books on Data Science, Big Data, Data Mining, Machine Learning, Python, R, SQL, NoSQL and more.

Data Mining and Machine Learning
Real-World Active Learning

Real-World Active Learning

Ted Cuzzillo, 2015

Applications and Strategies for Human-in-the-loop Machine Learning.

Data Mining and Machine Learning
A Course in Machine Learning

A Course in Machine Learning

Hal Daumé III, 2014
SQL, NoSQL, and Databases
The Little MongoDB Book
Languages: MongoDB

The Little MongoDB Book

Karl Seguin, 2011

MongoDB is an open source NoSQL database, easily scalable and high performance. It retains some similarities with relational databases which, in my opinion, makes it a great choice for anyone who is approaching the NoSQL world.

Learning Languages
Advanced R
Languages: R

Advanced R

Hadley Wickham, 2014

Useful tools and techniques for attacking many types of R programming problems, helping you avoid mistakes and dead ends. With ten+ years of experience programming in R, the author illustrates the elegance, beauty, and flexibility at the heart of R.

Learning Languages
Think Python 2nd Edition
Languages: Python

Think Python 2nd Edition

Allen Downey, 2015
Allen Downey is a Professor of Computer Science at Olin College

This hands-on guide takes you through Python a step at a time, beginning with basic programming concepts before moving on to functions, recursion, data structures, and object-oriented design. Updated to Python 3.

Distributed Computing Tools
Hadoop Tutorial as a PDF

Hadoop Tutorial as a PDF

Tutorials Point
Online Learning Resource

Intro to Hadoop - An open-source framework for storing and processing big data in a distributed environment across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines.

Distributed Computing Tools
Hadoop Illuminated

Hadoop Illuminated

Mark Kerzner & Sujee Maniyam, 2014

'Hadoop illuminated' is the open source book about Apache Hadoop™. It aims to make Hadoop knowledge accessible to a wider audience, not just to the highly technical.

Learning Languages
Python for You and Me
Languages: Python

Python for You and Me

Kushal Das, 2015

This is a simple book to learn the Python programming language, it is for the programmers who are new to Python.

Data Science in General
School of Data Handbook

School of Data Handbook

School of Data, 2015

The School of Data Handbook is a companion text to the School of Data. Its function is something like a traditional textbook – it will provide the detail and background theory to support the School of Data courses and challenges.

Learning Languages
R Programming
Languages: R

R Programming

Wikibooks, 2014

The aim of this Wikibook is to be the place where anyone can share his or her knowledge and tricks on R. It is supposed to be organized by task but not by discipline. We try to make a cross-disciplinary book, i.e. a book that can be used by all.

Be notified when we release new material

Join over 3,500 data science enthusiasts.