Neuromodulation Gated Transformer
Knowles, Kobe, Bensemann, Joshua, Benavides-Prado, Diana, Yogarajan, Vithya, Witbrock, Michael, Dobbie, Gillian, Chen, Yang
–arXiv.org Artificial Intelligence
We introduce a novel architecture, the Neuromodulation Gated Transformer (NGT), which implements neuromodulation in transformers via a multiplicative effect. We compare it to baselines and show that it results in the best average performance on the SuperGLUE benchmark validation sets. Cellular neuromodulation is a biological mechanism involving neurons, where their intrinsic properties are continuously modified in a context-dependent manner according to stimuli, i.e., biochemicals called neuromodulators (Bargmann & Marder, 2013; Marder et al., 2014; Shine et al., 2021; Vecoven et al., 2020); it allows for the regulation of a population of neurons (Katz & Edwards, 1999). It has achieved notable success in the continual learning domain (Beaulieu et al., 2020; Ellefsen et al., 2015; Velez & Clune, 2017). Transformers (Vaswani et al., 2017) are architectures that eliminate recurrence by relying entirely on attention.
arXiv.org Artificial Intelligence
May-11-2023
- Country:
- North America > United States > Minnesota (0.29)
- Genre:
- Research Report (0.65)
- Industry:
- Technology: