[R] Be Careful What You Backpropagate: A Case For Linear Output Activations & Gradient Boosting • r/MachineLearning

Jul-14-2017, 02:05:07 GMT–@machinelearnbot

I'm a little bewildered here. Note, that the softmax is not included in the table for the very simple reason that it gave miserable results on this NN configuration. Softmax Cross Entropy is the de facto output activation in FCNs. They don't specify if that test was with CE error or MSE, but even if it was with MSE (as a later experiment is), that just speaks to the incredibly poorly designed network they used (392-50-10 neurons is truly weird). The idea bears some resemblance to momentum, where we gradually speed things up when the error gradients are consistent.

artificial intelligence, machine learning, social media, (9 more...)

@machinelearnbot

Jul-14-2017, 02:05:07 GMT

News Web Page

Add feedback

Industry:
- Media > News (0.40)

Technology:
- Information Technology
  - Communications > Social Media (1.00)
  - Artificial Intelligence > Machine Learning
    - Ensemble Learning (0.40)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found