[1609.04836v1] On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima • /r/MachineLearning

Sep-20-2016, 07:05:21 GMT–@machinelearnbot

Have you seen the work of Friedlander and Schmidt, and my follow up paper (shameless plug, toot toot)? Though our analysis is restricted to convex functions, there is also a notion of "sharpness" of minima which is appears as the condition number of the problem.

artificial intelligence, large-batch training, machine learning, (3 more...)

@machinelearnbot

Sep-20-2016, 07:05:21 GMT

News Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found