The idea is ridiculously simple (perhaps why it is effective?): randomly skip layers while training • /r/MachineLearning

Apr-3-2016, 16:15:28 GMT–#artificialintelligence

The idea is ridiculously simple (perhaps why it is effective?): I don't understand the claim "Remember all the narratives we told about how depth learns hierarchical representations, and higher level representations -- those higher level representations don't seem to matter so much after all.". The net has over 100 layers!?! I imagine that this also works reasonably well in the RNN encoder in an encoder/decoder framework. I wonder if it also applies to generative RNNs.

artificial intelligence, skip layer, social media, (4 more...)

#artificialintelligence

Apr-3-2016, 16:15:28 GMT

News Web Page

Add feedback

Industry:
- Media > News (0.40)

Technology:
- Information Technology
  - Artificial Intelligence (1.00)
  - Communications > Social Media (0.91)