Noise-induced degeneration in online learning

Sato, Yuzuru, Tsutsui, Daiji, Fujiwara, Akio

Aug-27-2020–arXiv.org Machine Learning

The gradient descent is the simplest optimisation algorithm represented by gradient dynamics in a potential. When the input data is finite, gradient descent dynamics fluctuates due to the finite size effects, and is called stochastic gradient descent. In this paper, we study stability of stochastic gradient descent dynamics from the viewpoint of dynamical systems theory. Learning is characterised as nonautonomous dynamics driven by uncertain input from the external, and as multi-scale dynamics which consists of slow memory dynamics and fast system dynamics. When the uncertain input sequences are modelled by stochastic processes, dynamics of learning is described by a random dynamical system. In contrast to the traditional Fokker-Planck approaches [5, 15], the random dynamical system approaches enable the study not only of stationary distributions and global statistics, but also of the pathwise structure of stochastic dynamics. Based on nonautonomous and random dynamical system theory, it is possible to analyse stability and bifurcation in machine learning.

computer based training, educational technology, tanh, (21 more...)

arXiv.org Machine Learning

Aug-27-2020

arXiv.org PDF

Add feedback

Country:
- Asia > Japan > Hokkaidō (0.14)

Genre:
- Research Report (0.64)

Industry:
- Education > Educational Setting > Online (0.42)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Neural Networks (1.00)
    - Statistical Learning > Gradient Descent (1.00)
  - Representation & Reasoning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found