PhD Thesis: Exploring the role of (self-)attention in cognitive and computer vision architecture
–arXiv.org Artificial Intelligence
We investigate the role of attention and memory in complex reasoning tasks. We analyze Transformer-based self-attention as a model and extend it with memory. By studying a synthetic visual reasoning test, we refine the taxonomy of reasoning tasks. Incorporating self-attention with ResNet50, we enhance feature maps using feature-based and spatial attention, achieving efficient solving of challenging visual reasoning tasks. Our findings contribute to understanding the attentional needs of SVRT tasks. Additionally, we propose GAMR, a cognitive architecture combining attention and memory, inspired by active vision theory. GAMR outperforms other architectures in sample efficiency, robustness, and compositionality, and shows zero-shot generalization on new reasoning tasks.
arXiv.org Artificial Intelligence
Jun-28-2023
- Country:
- Africa > Middle East
- Algeria > In Salah Province > In Salah (0.04)
- Asia
- India
- Chandigarh (0.04)
- Karnataka > Bengaluru (0.04)
- West Bengal > Kharagpur (0.04)
- Japan > Honshū
- Tōhoku > Fukushima Prefecture > Fukushima (0.04)
- Middle East > Qatar
- India
- Europe
- Austria (0.04)
- Belgium > Flanders
- Flemish Brabant > Leuven (0.04)
- France > Occitanie
- Haute-Garonne > Toulouse (0.04)
- Norway > Norwegian Sea (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- North America
- Mexico > Puebla (0.04)
- United States
- California > Los Angeles County
- Los Angeles (0.14)
- New York (0.04)
- Pennsylvania (0.04)
- Texas > Travis County
- Austin (0.04)
- California > Los Angeles County
- Africa > Middle East
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Health & Medicine > Therapeutic Area > Neurology (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science > Problem Solving (1.00)
- Machine Learning
- Neural Networks > Deep Learning (1.00)
- Statistical Learning (1.00)
- Natural Language > Large Language Model (0.86)
- Representation & Reasoning (1.00)
- Vision (1.00)
- Information Technology > Artificial Intelligence