Advanced Analytics with Spark: Patterns for Learning from Data at Scale: Sandy Ryza, Uri Laserson, Sean Owen, Josh Wills: 9781491912768: Amazon.com: Books

@machinelearnbot 

This book fills an important gap in large scale data science. Spark has emerged as the big data platform of choice for data scientists both from the ease of use as well as the performance / optimization point of view. In a few lines of Scala code, Spark allows you to write iterative algorithms that scale out very well. For a data scientist who wants to explore large scale data sets, Spark is a great starting point (this is incredible progress in the Spark community given the project is just about 4 years old). However, Spark itself is moving fast and maturing with time, and Spark and Scala as well as distributed algorithms are typically not in the arsenal of many data scientists today.