AITopics | agent navigation

Collaborating Authors

agent navigation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Language and Visual Entity Relationship Graph for Agent Navigation

Neural Information Processing SystemsDec-24-2025, 01:44:09 GMT

Vision-and-Language Navigation (VLN) requires an agent to navigate in a real-world environment following natural language instructions. From both the textual and visual perspectives, we find that the relationships among the scene, its objects, and directional cues are essential for the agent to interpret complex instructions and correctly perceive the environment. To capture and utilize the relationships, we propose a novel Language and Visual Entity Relationship Graph for modelling the inter-modal relationships between text and vision, and the intra-modal relationships among visual entities. We propose a message passing algorithm for propagating information between language elements and visual entities in the graph, which we then combine to determine the next action to take. Experiments show that by taking advantage of the relationships we are able to improve over state-of-the-art. On the Room-to-Room (R2R) benchmark, our method achieves the new best performance on the test unseen split with success rate weighted by path length of 52%. On the Room-for-Room (R4R) dataset, our method significantly improves the previous best from 13% to 34% on the success weighted by normalized dynamic time warping.

agent navigation, name change, visual entity relationship graph, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.77)

Add feedback

Appendix: Language and Visual Entity Relationship Graph for Agent Navigation

Neural Information Processing SystemsOct-2-2025, 23:28:02 GMT

Replicating the encoding by 32 times does not enrich its information but makes its gradient 32 times larger during back-propagation. We suspect that this benefits the agent to learn about the action-related terms (e.g. " turn left, " go forward ") in the

agent, artificial intelligence, machine learning, (15 more...)

Neural Information Processing Systems

Country: North America > Canada (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.49)

Add feedback

Review for NeurIPS paper: Language and Visual Entity Relationship Graph for Agent Navigation

Neural Information Processing SystemsJan-24-2025, 13:51:10 GMT

Weaknesses: - The proposed method is tailored for VLN and may limit its generalization to other domains (it is not new for other vision-and-language tasks). If the same h_t and u are feed into the three attentions, how could different contexts be learned? There seems to be something wrong, either the technique or the notations. However, VLN models may be sensitive to hyper-parameter tuning. It would be better if the authors can demonstrate the mean and standard deviation of multiple runs. In what cases the proposed model would fail?

agent navigation, neurips paper, visual entity relationship graph, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.36)

Add feedback

Language and Visual Entity Relationship Graph for Agent Navigation

Neural Information Processing SystemsOct-10-2024, 06:21:50 GMT

agent navigation, visual entity relationship graph

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.82)

Add feedback

Communicative Learning with Natural Gestures for Embodied Navigation Agents with Human-in-the-Scene

#artificialintelligenceAug-9-2021, 22:52:20 GMT

Human-robot collaboration is an essential research topic in artificial intelligence (AI), enabling researchers to devise cognitive AI systems and affords an intuitive means for users to interact with the robot. Of note, communication plays a central role. To date, prior studies in embodied agent navigation have only demonstrated that human languages facilitate communication by instructions in natural languages. Nevertheless, a plethora of other forms of communication is left unexplored. In fact, human communication originated in gestures and oftentimes is delivered through multimodal cues, e.g. "go there" with a pointing gesture.

communication, communicative learning, embodied navigation agent, (3 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback