Jose Quesada
Machine Learning with Scala on Spark by Jose Quesada
This video was recorded at Scala Days Berlin 2016 follow us on Twitter @ScalaDays or visit our website for more information http://scaladays.org Abstract: What new superpowers does it give me? The machine learning libraries in Apache Spark are an impressive piece of software engineering, and are maturing rapidly. At Data Science Retreat we've taken a real-world dataset and worked through the stages of building a predictive model -- exploration, data cleaning, feature engineering, and model fitting -- in several different frameworks. We'll show what it's like to work with Spark.ml, and compare it to other widely used frameworks (in R and python) along several dimensions: ease of use, productivity, feature set, and performance.