AITopics | goal location

Collaborating Authors

goal location

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Learning to Poke by Poking: Experiential Learning of Intuitive Physics

Pulkit Agrawal, Ashvin V. Nair, Pieter Abbeel, Jitendra Malik, Sergey Levine

Neural Information Processing SystemsApr-22-2026, 01:52:32 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, robot, (19 more...)

Neural Information Processing Systems

Country: Europe (0.28)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

4eb8e997fc91086225b7484cf8eac341-Paper-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsFeb-12-2026, 21:35:56 GMT

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Law (0.67)
Information Technology (0.67)
Leisure & Entertainment > Games (0.46)
Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

Chasing Ghosts: Instruction Following as Bayesian State Tracking

Peter Anderson, Ayush Shrivastava, Devi Parikh, Dhruv Batra, Stefan Lee

Neural Information Processing SystemsFeb-12-2026, 18:37:39 GMT

Neural Information Processing Systems http://nips.cc/

agent, instruction, trajectory, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Oregon (0.04)
North America > United States > Massachusetts (0.04)
North America > Canada (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.69)
(2 more...)

Add feedback

e7663e974c4ee7a2b475a4775201ce1f-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 13:35:48 GMT

The key challenge in making this connection is grounding the skills, so that each skill corresponds to a specific goal-conditioned policy. We start by recalling the definition of the discounted state occupancymeasure(Eq.3): p(st+=sg)=(1 γ) X On the second line, we havechanged the bounds of the summation to start at 0, and changed the terms inside the summation accordingly. On the third line, we applied linearity of expectation to movethesummation insidetheexpectation. Onthefourthline,weappliedlinearity ofexpectation again to move the term fort = 0 inside the expectation. Finally, we substituted the definition of rg(s,a)toobtainthedesiredresult. This result means that we are doing policy improvement with approximate Q-values.

artificial intelligence, machine learning, representation, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs

Himanshu Sahni, Toby Buckley, Pieter Abbeel, Ilya Kuzovkin

Neural Information Processing SystemsFeb-12-2026, 04:49:03 GMT

To this end, Andrychowicz et al.[1] introduced Hindsight Experience Replay (HER), which can rapidly train goal-conditioned policies by retroactively imagining failed trajectories as successful ones.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Arizona > Maricopa County > Phoenix (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > France > Hauts-de-France > Nord > Lille (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.96)

Add feedback

NoRL,NoSimulation: LearningtoNavigatewithout Navigating

Neural Information Processing SystemsFeb-11-2026, 13:26:33 GMT

NRNS outperforms RL-based formulations by a significant margin.

artificial intelligence, machine learning, navigation, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > France (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Robots (0.68)

Add feedback

A Ergodic

Neural Information Processing SystemsNov-15-2025, 05:58:02 GMT

As alluded to in Section 3, the formulation discussed in this paper is suitable for reversible environments. M. While the weight for entropy is automatically adjusted using dual A similar scheme to relabel the demonstration set can be followed. First, we describe the reward functions and the success metrics corresponding to each environment. The success metric is the same as the reward function. The success metric is the same as the reward function.

demonstration, sawyer door closing, trajectory, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.97)

Add feedback

Enhancing PIBT via Multi-Action Operations

Yukhnevich, Egor, Andreychuk, Anton

arXiv.org Artificial IntelligenceNov-14-2025

PIBT is a rule-based Multi-Agent Path Finding (MAPF) solver, widely used as a low-level planner or action sampler in many state-of-the-art approaches. Its primary advantage lies in its exceptional speed, enabling action selection for thousands of agents within milliseconds by considering only the immediate next timestep. However, this short-horizon design leads to poor performance in scenarios where agents have orientation and must perform time-consuming rotation actions. In this work, we present an enhanced version of PIBT that addresses this limitation by incorporating multi-action operations. We detail the modifications introduced to improve PIBT's performance while preserving its hallmark efficiency. Furthermore, we demonstrate how our method, when combined with graph-guidance technique and large neighborhood search optimization, achieves state-of-the-art performance in the online LMAPF-T setting.

agent, artificial intelligence, opération, (15 more...)

arXiv.org Artificial Intelligence

2511.09193

Genre: Research Report > New Finding (0.68)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

Collaborative Task Assignment, Sequencing and Multi-agent Path-finding

Bai, Yifan, Kotpalliwar, Shruti, Kanellakis, Christoforos, Nikolakopoulos, George

arXiv.org Artificial IntelligenceOct-28-2025

In this article, we address the problem of collaborative task assignment, sequencing, and multi-agent pathfinding (TSPF), where a team of agents must visit a set of task locations without collisions while minimizing flowtime. TSPF incorporates agent-task compatibility constraints and ensures that all tasks are completed. We propose a Conflict-Based Search with Task Sequencing (CBS-TS), an optimal and complete algorithm that alternates between finding new task sequences and resolving conflicts in the paths of current sequences. CBS-TS uses a mixed-integer linear program (MILP) to optimize task sequencing and employs Conflict-Based Search (CBS) with Multi-Label A* (MLA*) for collision-free path planning within a search forest. By invoking MILP for the next-best sequence only when needed, CBS-TS efficiently limits the search space, enhancing computational efficiency while maintaining optimality. We compare the performance of our CBS-TS against Conflict-based Steiner Search (CBSS), a baseline method that, with minor modifications, can address the TSPF problem. Experimental results demonstrate that CBS-TS outperforms CBSS in most testing scenarios, achieving higher success rates and consistently optimal solutions, whereas CBSS achieves near-optimal solutions in some cases. The supplementary video is available at https://youtu.be/QT8BYgvefmU.

agent, artificial intelligence, constraint, (16 more...)

arXiv.org Artificial Intelligence

2510.21738

Country: Europe (0.28)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

Filters

Collaborating Authors

goal location

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

15ddb1773510075ef44981cdb204330b-Paper-Conference.pdf

Learning to Poke by Poking: Experiential Learning of Intuitive Physics

4eb8e997fc91086225b7484cf8eac341-Paper-Datasets_and_Benchmarks_Track.pdf

Chasing Ghosts: Instruction Following as Bayesian State Tracking

e7663e974c4ee7a2b475a4775201ce1f-Supplemental-Conference.pdf

Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs

NoRL,NoSimulation: LearningtoNavigatewithout Navigating

A Ergodic

Enhancing PIBT via Multi-Action Operations

Collaborative Task Assignment, Sequencing and Multi-agent Path-finding