NLP Tutorials -- Part 20: Compressive Transformer

Jun-3-2022, 07:00:32 GMT–#artificialintelligence

Welcome back to yet another interesting improvement of the Transformer (Attention is All You Need) architecture -- Compressive Transformers. This particular architecture has a lower memory requirement than Vanilla Transformer and is similar to the Transformer-XL that models longer sequences efficiently. The below image depicts how the memory is compressed. We can also say that this is drawing some parallels to the human brain -- We have a brilliant memory because of the power of compressing and storing information very intelligently. This sure seems interesting, doesn't it?

compressive transformer, transformer, transformer-xl, (12 more...)

#artificialintelligence

Jun-3-2022, 07:00:32 GMT

News Web Page

Add feedback

Genre:
- Instructional Material (0.32)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.31)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found