Goto

Collaborating Authors

 timescale





Parameters as interacting particles: long time convergence and asymptotic error scaling of neural networks

Grant Rotskoff, Eric Vanden-Eijnden

Neural Information Processing Systems

Theperformance ofneural networksonhigh-dimensional datadistributions suggests that it may be possible to parameterize a representation of agiven highdimensional function with controllably small errors, potentially outperforming standard interpolation methods. We demonstrate, both theoretically and numerically, that this is indeed the case. We map the parameters of a neural network to a system of particles relaxing with an interaction potential determined by the lossfunction.



UnderstandingAdaptive, MultiscaleTemporal IntegrationInDeepSpeechRecognitionSystems

Neural Information Processing Systems

A central challenge of representing natural signals, such as speech and music, is that they are structured across many different timescales (Chomsky and Halle, 1968; Lerdahl and Jackendoff, 1985; Hickokand Poeppel, 2007).


a57ecd54d4df7d999bd9c5e3b973ec75-Supplemental.pdf

Neural Information Processing Systems

Wecanseethis as the slope of the update function changes (middle row of Figure 1), these green lines correspond tothelocations givenbythearrowsinthetoprow.