Reviews: DropMax: Adaptive Variational Softmax

Oct-8-2024, 20:48:13 GMT–Neural Information Processing Systems

This paper proposes doing dropout in the output softmax layer during supervised training of neural net classifiers. The dropout probabilities are adapted per example. The probabilities are computed as a function of the penultimate layer of the classifier. So that layer is used to compute both: the logits, and the gating for those logits. This model combines ideas from adaptive dropout (Ba and Frey NIPS'13) and variational dropout (Kingma et al). The key problem being solved is how to do inference to get the optimal dropout probabilities.

artificial intelligence, classifier, machine learning, (9 more...)

Neural Information Processing Systems

Oct-8-2024, 20:48:13 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)