Machine Learning with Spark - Tackle Big Data with Powerful Spark Machine Learning Algorithms: Nick Pentreath: 9781783288519: Amazon.com: Books

@machinelearnbot 

This book is a nice introduction to using the Apache Spark framework. It assumes no prior knowledge of either Hadoop, Spark or machine learning itself (although the latter is covered at quite a rapid pace in places so some background would likely be helpful!). The code examples are presented in Python and (mainly) Scala, with examples that are reasonably well-described. The overall tone of the book is clear and the chapters progress in a logical order, with a fairly rapid journey through the main machine learning techniques from a Spark perspective. Later chapters were particularly interesting, covering text mining and more complex methods (e.g. Some of the example data sets feel a little'tired' (movie ratings data yet again - or perhaps I've just read too many machine learning books), but otherwise this is a good book and comes recommended.