Hierarchical Representations for Spatio-Temporal Visual Attention Modeling and Understanding
Fernández-Torres, Miguel-Ángel
–arXiv.org Artificial Intelligence
Thesis concerns the study and development of hierarchical representations for spatio-temporal visual attention modeling and understanding in video sequences. More specifically, we propose two computational models for visual attention. First, we present a generative probabilistic model for context-aware visual attention modeling and understanding. Secondly, we develop a deep network architecture for visual attention modeling, which first estimates top-down spatio-temporal visual attention, and ultimately serves for modeling attention in the temporal domain. The first part of the thesis introduces our first proposal: a generative probabilistic framework for spatio-temporal visual attention modeling and understanding.
arXiv.org Artificial Intelligence
Aug-9-2023
- Country:
- Africa > Mozambique (0.04)
- Antarctica (0.04)
- Asia
- China > Guangxi Province
- Nanning (0.04)
- Middle East
- Russia (0.04)
- China > Guangxi Province
- Europe
- France (0.04)
- Greece (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Romania > București - Ilfov Development Region
- Municipality of Bucharest > Bucharest (0.04)
- Russia (0.04)
- Spain > Galicia
- Madrid (0.04)
- United Kingdom
- England
- Cambridgeshire > Cambridge (0.04)
- Greater London > London
- Wimbledon (0.04)
- Oxfordshire > Oxford (0.04)
- North Sea > Southern North Sea (0.04)
- England
- North America > United States
- California
- Los Angeles County > Los Angeles (0.04)
- Santa Clara County
- Mountain View (0.04)
- Palo Alto (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Indiana (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.13)
- New York > New York County
- New York City (0.04)
- California
- Genre:
- Overview (1.00)
- Research Report > New Finding (1.00)
- Industry:
- Education (0.67)
- Government (0.92)
- Health & Medicine > Therapeutic Area
- Neurology (1.00)
- Leisure & Entertainment
- Media > Film (1.00)
- Transportation (0.67)
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science > Problem Solving (1.00)
- Machine Learning
- Learning Graphical Models
- Directed Networks > Bayesian Learning (1.00)
- Undirected Networks > Markov Models (0.67)
- Neural Networks > Deep Learning (1.00)
- Performance Analysis > Accuracy (1.00)
- Statistical Learning (1.00)
- Learning Graphical Models
- Natural Language > Text Processing (1.00)
- Representation & Reasoning > Uncertainty
- Bayesian Inference (0.92)
- Vision > Image Understanding (1.00)
- Information Technology > Artificial Intelligence