PhD Thesis: Exploring the role of (self-)attention in cognitive and computer vision architecture

Jun-28-2023–arXiv.org Artificial Intelligence

We investigate the role of attention and memory in complex reasoning tasks. We analyze Transformer-based self-attention as a model and extend it with memory. By studying a synthetic visual reasoning test, we refine the taxonomy of reasoning tasks. Incorporating self-attention with ResNet50, we enhance feature maps using feature-based and spatial attention, achieving efficient solving of challenging visual reasoning tasks. Our findings contribute to understanding the attentional needs of SVRT tasks. Additionally, we propose GAMR, a cognitive architecture combining attention and memory, inspired by active vision theory. GAMR outperforms other architectures in sample efficiency, robustness, and compositionality, and shows zero-shot generalization on new reasoning tasks.

large language model, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

Jun-28-2023

arXiv.org PDF

Add feedback

Country:
- Africa > Middle East
  - Algeria > In Salah Province > In Salah (0.04)
- Asia
  - India
    - Chandigarh (0.04)
    - Karnataka > Bengaluru (0.04)
    - West Bengal > Kharagpur (0.04)
  - Japan > Honshū
    - Tōhoku > Fukushima Prefecture > Fukushima (0.04)
  - Middle East > Qatar
    - Ad-Dawhah > Doha (0.04)
- Europe
  - Austria (0.04)
  - Belgium > Flanders
    - Flemish Brabant > Leuven (0.04)
  - France > Occitanie
    - Haute-Garonne > Toulouse (0.04)
  - Norway > Norwegian Sea (0.04)
  - Portugal > Lisbon
    - Lisbon (0.04)
- North America
  - Mexico > Puebla (0.04)
  - United States
    - California > Los Angeles County
      - Los Angeles (0.14)
    - New York (0.04)
    - Pennsylvania (0.04)
    - Texas > Travis County
      - Austin (0.04)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Cognitive Science > Problem Solving (1.00)
  - Machine Learning
    - Neural Networks > Deep Learning (1.00)
    - Statistical Learning (1.00)
  - Natural Language > Large Language Model (0.86)
  - Representation & Reasoning (1.00)
  - Vision (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found