Goto

Collaborating Authors

 Africa





AdaptiveStochasticVarianceReduction forNon-convexFinite-SumMinimization

Neural Information Processing Systems

Toourknowledge, ADASPIDER isthefirstparameterfree non-convex variance-reduction method in the sense that it does not require the knowledge of problem-dependent parameters, such as smoothness constant L,targetaccuracyฯตoranybound ongradient norms.




MaskTune: MitigatingSpuriousCorrelationsby ForcingtoExplore

Neural Information Processing Systems

This workproposesMaskTune, a masking strategy that prevents over-reliance on spurious (or a limited number of) features. MaskTuneforces the trained model to explore new features during asingleepochfinetuning bymasking previously discoveredfeatures.MaskTune, unlike earlier approaches for mitigating shortcut learning, does not require any supervision, suchasannotating spurious features orlabels forsubgroup samples in a dataset.


high

Neural Information Processing Systems

We show it depends on the precise way in which the limit is taken, and in particular on how the quantityofdata,thehiddenlayerwidth,&thelearningratescalesasd .