Hierarchical Representations for Spatio-Temporal Visual Attention Modeling and Understanding

Aug-9-2023–arXiv.org Artificial Intelligence

Thesis concerns the study and development of hierarchical representations for spatio-temporal visual attention modeling and understanding in video sequences. More specifically, we propose two computational models for visual attention. First, we present a generative probabilistic model for context-aware visual attention modeling and understanding. Secondly, we develop a deep network architecture for visual attention modeling, which first estimates top-down spatio-temporal visual attention, and ultimately serves for modeling attention in the temporal domain. The first part of the thesis introduces our first proposal: a generative probabilistic framework for spatio-temporal visual attention modeling and understanding.

artificial intelligence, machine learning, natural language, (23 more...)

arXiv.org Artificial Intelligence

Aug-9-2023

arXiv.org PDF

Add feedback

Country:
- Antarctica (0.04)
- Africa > Mozambique (0.04)
- North America > United States
  - Indiana (0.04)
  - New York > New York County
    - New York City (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.13)
  - Illinois > Cook County
    - Chicago (0.04)
  - California
    - Los Angeles County > Los Angeles (0.04)
    - Santa Clara County
      - Palo Alto (0.04)
      - Mountain View (0.04)
- Europe
  - Russia (0.04)
  - Greece (0.04)
  - France (0.04)
  - United Kingdom > England
    - Oxfordshire > Oxford (0.04)
    - Cambridgeshire > Cambridge (0.04)
    - Greater London > London
      - Wimbledon (0.04)
  - Spain > Galicia
    - Madrid (0.04)
  - Romania > București - Ilfov Development Region
    - Municipality of Bucharest > Bucharest (0.04)
  - Netherlands > North Holland
    - Amsterdam (0.04)
- Asia
  - Russia (0.04)
  - Middle East
    - Jordan (0.04)
    - Israel (0.04)
  - China > Guangxi Province
    - Nanning (0.04)

Genre:
- Research Report > New Finding (1.00)
- Overview (1.00)

Industry:
- Media > Film (1.00)
- Transportation (0.67)
- Government (0.67)
- Education (0.67)
- Leisure & Entertainment
  - Sports (1.00)
  - Games (0.67)
- Health & Medicine > Therapeutic Area
  - Neurology (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Vision > Image Understanding (1.00)
  - Natural Language > Text Processing (1.00)
  - Cognitive Science > Problem Solving (1.00)
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (0.92)
  - Machine Learning
    - Statistical Learning (1.00)
    - Performance Analysis > Accuracy (1.00)
    - Neural Networks > Deep Learning (1.00)
    - Learning Graphical Models
      - Directed Networks > Bayesian Learning (1.00)
      - Undirected Networks > Markov Models (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found