Challenges and Trends in Egocentric Vision: A Survey
Li, Xiang, Qiu, Heqian, Wang, Lanxiao, Zhang, Hanwen, Qi, Chenghao, Han, Linfeng, Xiong, Huiyu, Li, Hongliang
–arXiv.org Artificial Intelligence
With the rapid development of artificial intelligence technologies and wearable devices, egocentric vision understanding has emerged as a new and challenging research direction, gradually attracting widespread attention from both academia and industry. Egocentric vision captures visual and multimodal data through cameras or sensors worn on the human body, offering a unique perspective that simulates human visual experiences. This paper provides a comprehensive survey of the research on egocentric vision understanding, systematically analyzing the components of egocentric scenes and categorizing the tasks into four main areas: subject understanding, object understanding, environment understanding, and hybrid understanding. We explore in detail the sub-tasks within each category. We also summarize the main challenges and trends currently existing in the field. Furthermore, this paper presents an overview of high-quality egocentric vision datasets, offering valuable resources for future research. By summarizing the latest advancements, we anticipate the broad applications of egocentric vision technologies in fields such as augmented reality, virtual reality, and embodied intelligence, and propose future research directions based on the latest developments in the field.
arXiv.org Artificial Intelligence
Mar-19-2025
- Country:
- Oceania > New Zealand (0.04)
- North America > United States
- Europe
- United Kingdom > England
- Greater London > London (0.04)
- Switzerland > Zürich
- Zürich (0.14)
- Netherlands > North Holland
- Amsterdam (0.04)
- Italy
- United Kingdom > England
- Asia
- India (0.04)
- South Korea > Gwangju
- Gwangju (0.04)
- Singapore > Central Region
- Singapore (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture
- Tokyo (0.04)
- Chūbu > Ishikawa Prefecture
- Kanazawa (0.04)
- Kantō > Tokyo Metropolis Prefecture
- China
- Sichuan Province > Chengdu (0.04)
- Shaanxi Province > Xi'an (0.04)
- Hong Kong (0.04)
- Genre:
- Overview (1.00)
- Research Report > Promising Solution (0.45)
- Instructional Material > Course Syllabus & Notes (0.45)
- Industry:
- Leisure & Entertainment (1.00)
- Information Technology (1.00)
- Media > Film (0.67)
- Health & Medicine
- Therapeutic Area (0.67)
- Consumer Health (0.67)
- Education > Educational Setting
- Online (0.45)
- Technology:
- Information Technology
- Sensing and Signal Processing > Image Processing (1.00)
- Hardware (1.00)
- Data Science > Data Mining (1.00)
- Communications > Networks (1.00)
- Human Computer Interaction > Interfaces
- Virtual Reality (1.00)
- Artificial Intelligence
- Robots (1.00)
- Cognitive Science (1.00)
- Natural Language > Large Language Model (0.93)
- Representation & Reasoning > Agents (0.92)
- Vision
- Image Understanding (1.00)
- Face Recognition (1.00)
- Machine Learning
- Statistical Learning (1.00)
- Neural Networks > Deep Learning (1.00)
- Information Technology