Efficient Exact Gradient Update for training Deep Networks with Very Large Sparse Targets
Pascal Vincent, Alexandre de Brébisson, Xavier Bouthillier
–Neural Information Processing Systems
An important class of problems involves training deep neural networks with sparse prediction targets of very high dimension D. These occur naturally in e.g.
Neural Information Processing Systems
Feb-7-2025, 11:02:57 GMT