On the biological plausibility of orthogonal initialisation for solving gradient instability in deep neural networks

Oct-27-2022–arXiv.org Artificial Intelligence

Initialising the synaptic weights of artificial neural networks (ANNs) with orthogonal matrices is known to alleviate vanishing and exploding gradient problems. A major objection against such initialisation schemes is that they are deemed biologically implausible as they mandate factorization techniques that are difficult to attribute to a neurobiological process. This paper presents two initialisation schemes that allow a network to naturally evolve its weights to form orthogonal matrices, provides theoretical analysis that pre-training orthogonalisation always converges, and empirically confirms that the proposed schemes outperform randomly initialised recurrent and feedforward networks.

artificial intelligence, machine learning, matrix, (19 more...)

arXiv.org Artificial Intelligence

Oct-27-2022

arXiv.org PDF

Add feedback

Country:
- Asia > Middle East
  - Lebanon (0.04)
- North America > United States
  - California > San Diego County
    - San Diego (0.04)
  - Pennsylvania > Allegheny County
    - Pittsburgh (0.04)

Genre:
- Research Report (0.50)

Industry:
- Health & Medicine > Therapeutic Area > Neurology (0.88)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found