Azure/mmlspark
MMLSpark provides a number of deep learning and data science tools for Apache Spark, including seamless integration of Spark Machine Learning pipelines with Microsoft Cognitive Toolkit (CNTK) and OpenCV, enabling you to quickly create powerful, highly-scalable predictive and analytical models for large image and text datasets. MMLSpark requires Scala 2.11, Spark 2.1, and either Python 2.7 or Python 3.5 . See our notebooks for all examples. Below is an excerpt from a simple example of using a pre-trained CNN to classify images in the CIFAR-10 dataset. See other sample notebooks as well as the MMLSpark documentation for Scala and PySpark.
Jun-24-2017, 17:30:22 GMT