Yahoo! CaffeOnSpark: Distributed Deep Learning on Big Data Clusters

Apr-6-2016, 06:18:02 GMT–#artificialintelligence

Deep learning (DL) is a critical capability required by Yahoo product teams (ex. Flickr, Image Search) to gain intelligence from massive amounts of online data. Many existing DL frameworks require a separated cluster for deep learning, and multiple programs have to be created for a typical machine learning pipeline (see Figure 1). The separated clusters require large datasets to be transferred among them, and introduce unwanted system complexity and latency for end-to-end learning. As discussed in our earlier Tumblr post, we believe that deep learning should be conducted in the same cluster along with existing data processing pipelines to support feature engineering and traditional (non-deep) machine learning.

artificial intelligence, caffeonspark, machine learning, (11 more...)

#artificialintelligence

Apr-6-2016, 06:18:02 GMT

News Web Page

Add feedback

Industry:
- Information Technology > Services (0.59)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found