Scaling Memory-Augmented Neural Networks with Sparse Reads and Writes

Rae, Jack, Hunt, Jonathan J., Danihelka, Ivo, Harley, Timothy, Senior, Andrew W., Wayne, Gregory, Graves, Alex, Lillicrap, Timothy

Feb-14-2020, 14:27:23 GMT–Neural Information Processing Systems

Neural networks augmented with external memory have the ability to learn algorithmic solutions to complex tasks. These models appear promising for applications such as language modeling and machine translation. However, they scale poorly in both space and time as the amount of memory grows --- limiting their applicability to real-world domains. Here, we present an end-to-end differentiable memory access scheme, which we call Sparse Access Memory (SAM), that retains the representational power of the original approaches whilst training efficiently with very large memories. We show that SAM achieves asymptotic lower bounds in space and time complexity, and find that an implementation runs $1,\!000\times$ faster and with $3,\!000\times$ less physical memory than non-sparse models.

scaling memory-augmented neural network, sparse read and write

Neural Information Processing Systems

Feb-14-2020, 14:27:23 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report (0.43)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks (0.65)
  - Natural Language (0.63)