AITopics | bidirectional interaction

Lightweight Vision Transformer with Bidirectional Interaction

Neural Information Processing SystemsFeb-9-2026, 18:35:27 GMT

Recent advancements in vision backbones have significantly improved their performance by simultaneously modeling images' local and global contexts.

artificial intelligence, machine learning, transformer, (16 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.04)
Asia > Middle East > Saudi Arabia > Northern Borders Province > Arar (0.04)
Asia > China > Anhui Province > Hefei (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Add feedback

Lightweight Vision Transformer with Bidirectional Interaction

Neural Information Processing SystemsDec-24-2025, 11:18:41 GMT

Recent advancements in vision backbones have significantly improved their performance by simultaneously modeling images' local and global contexts. However, the bidirectional interaction between these two contexts has not been well explored and exploited, which is important in the human visual system.

bidirectional interaction, lightweight vision transformer, name change, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.39)

Add feedback

3170de57bc1899315b97712043d8bb22-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 09:54:12 GMT

interaction, transformer, vision transformer, (15 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.04)
Asia > Middle East > Saudi Arabia > Northern Borders Province > Arar (0.04)
Asia > China > Anhui Province > Hefei (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Add feedback

Lightweight Vision Transformer with Bidirectional Interaction

Neural Information Processing SystemsOct-11-2024, 00:06:29 GMT

Recent advancements in vision backbones have significantly improved their performance by simultaneously modeling images' local and global contexts. However, the bidirectional interaction between these two contexts has not been well explored and exploited, which is important in the human visual system. This paper proposes a Fully Adaptive Self-Attention (FASA) mechanism for vision transformer to model the local and global information as well as the bidirectional interaction between them in context-aware ways. Specifically, FASA employs self-modulated convolutions to adaptively extract local representation while utilizing self-attention in down-sampled space to extract global representation. Subsequently, it conducts a bidirectional adaptation process between local and global representation to model their interaction.

bidirectional interaction, lightweight vision transformer, transformer, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Vision (0.78)

Add feedback

On the Emergence of Symmetrical Reality

Zhang, Zhenliang, Zhang, Zeyu, Jiao, Ziyuan, Su, Yao, Liu, Hangxin, Wang, Wei, Zhu, Song-Chun

arXiv.org Artificial IntelligenceJan-26-2024

Artificial intelligence (AI) has revolutionized human cognitive abilities and facilitated the development of new AI entities capable of interacting with humans in both physical and virtual environments. Despite the existence of virtual reality, mixed reality, and augmented reality for several years, integrating these technical fields remains a formidable challenge due to their disparate application directions. The advent of AI agents, capable of autonomous perception and action, further compounds this issue by exposing the limitations of traditional human-centered research approaches. It is imperative to establish a comprehensive framework that accommodates the dual perceptual centers of humans and AI agents in both physical and virtual worlds. In this paper, we introduce the symmetrical reality framework, which offers a unified representation encompassing various forms of physical-virtual amalgamations. This framework enables researchers to better comprehend how AI agents can collaborate with humans and how distinct technical pathways of physical-virtual integration can be consolidated from a broader perspective. We then delve into the coexistence of humans and AI, demonstrating a prototype system that exemplifies the operation of symmetrical reality systems for specific tasks, such as pouring water. Subsequently, we propose an instance of an AI-driven active assistance service that illustrates the potential applications of symmetrical reality. This paper aims to offer beneficial perspectives and guidance for researchers and practitioners in different fields, thus contributing to the ongoing research about human-AI coexistence in both physical and virtual environments.

agent, ai agent, virtual world, (16 more...)

arXiv.org Artificial Intelligence

2401.15132

Country:

Asia > China (0.04)
North America > United States > Hawaii (0.04)
North America > Canada (0.04)
(2 more...)

Genre:

Overview (0.68)
Instructional Material (0.67)
Research Report (0.63)

Industry:

Education (1.00)
Health & Medicine > Therapeutic Area > Neurology (0.48)

Technology:

Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
(3 more...)

Add feedback

Towards an AI assistant for human grid operators

Marot, Antoine, Rozier, Alexandre, Dussartre, Matthieu, Crochepierre, Laure, Donnot, Benjamin

arXiv.org Machine LearningDec-3-2020

Power systems are becoming more complex to operate in the digital age. As a result, real-time decision-making is getting more challenging as the human operator has to deal with more information, more uncertainty, more applications and more coordination. While supervision has been primarily used to help them make decisions over the last decades, it cannot reasonably scale up anymore. There is a great need for rethinking the human-machine interface under more unified and interactive frameworks. Taking advantage of the latest developments in Human-machine Interactions and Artificial intelligence, we share the vision of a new assistant framework relying on an hypervision interface and greater bidirectional interactions. We review the known principles of decision-making that drives the assistant design and supporting assistance functions we present. We finally share some guidelines to make progress towards the development of such an assistant.

information, interaction, operator, (16 more...)

arXiv.org Machine Learning

2012.02026

Country: