AITopics

2506.14975

Country: North America > United States > California (0.28)

Genre: Research Report (1.00)

Industry: Transportation > Air (0.88)

Technology:

Information Technology > Sensing and Signal Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
(5 more...)

Simulation to Rules: A Dual-VLM Framework for Formal Visual Planning

Hao, Yilun, Chen, Yongchao, Fan, Chuchu, Zhang, Yang

Vision Language Models (VLMs) show strong potential for visual planning but struggle with precise spatial and long-horizon reasoning. In contrast, Planning Domain Definition Language (PDDL) planners excel at long-horizon formal planning, but cannot interpret visual inputs. Recent works combine these complementary advantages by enabling VLMs to turn visual planning problems into PDDL files for formal planning. However, while VLMs can generate PDDL problem files satisfactorily, they struggle to accurately generate the PDDL domain files, which describe all the planning rules. As a result, prior methods rely on human experts to predefine domain files or on constant environment access for refinement. We propose VLMFP, a Dual-VLM-guided framework that can autonomously generate both PDDL problem and domain files for formal visual planning. VLMFP introduces two VLMs to ensure reliable PDDL file generation: A SimVLM that simulates action consequences based on input rule descriptions, and a GenVLM that generates and iteratively refines PDDL files by comparing the PDDL and SimVLM execution results. VLMFP unleashes multiple levels of generalizability: The same generated PDDL domain file works for all the different instances under the same problem, and VLMs generalize to different problems with varied appearances and rules. We evaluate VLMFP with 6 grid-world domains and test its generalization to unseen instances, appearance, and game rules. On average, SimVLM accurately describes 95.5%, 82.6% of scenarios, simulates 85.5%, 87.8% of action sequence, and judges 82.4%, 85.6% goal reaching for seen and unseen appearances, respectively. With the guidance of SimVLM, VLMFP can generate PDDL files to reach 70.0%, 54.1% valid plans for unseen instances in seen and unseen appearances, respectively. Project page: https://sites.google.com/view/vlmfp.

artificial intelligence, planning & scheduling, vlmfp, (16 more...)

2510.03182

Genre:

Workflow (0.68)
Research Report (0.50)

Industry: Leisure & Entertainment > Games (0.46)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)

Optimal Smooth Coverage Trajectory Planning for Quadrotors in Cluttered Environment

Li, Duanjiao, Chen, Yun, Zhang, Ying, Yao, Junwen, Huang, Dongyue, Zhang, Jianguo, Ding, Ning

In recent years, with the rapid development of manufacturing industries, unmanned systems have found widespread applications across various fields. Among them, quadro-tors have been increasingly utilized in industrial applications such as aerial photography and surveying [1]. As electricity consumption continues to rise, the frequency of power grid maintenance has also increased. Given the high risks and costs associated with manual inspections, the importance of utilizing unmanned systems for autonomous power grid inspections has become increasingly evident [2], as shown in Fig 1. Substations, as critical components of the power grid system, play an essential role in ensuring seamless inspection across modules within the same facility or between different facilities. The units scheduled for inspection can be abstracted as a series of access points, with drones acting as agents tasked with visiting these points.

artificial intelligence, machine learning, trajectory, (15 more...)

2510.03169

Country: Asia > China > Guangdong Province (0.14)

Genre: Research Report (0.40)

Industry:

Energy > Power Industry (1.00)
Media > Photography (0.88)

Technology:

Information Technology > Artificial Intelligence > Robots (0.70)
Information Technology > Artificial Intelligence > Machine Learning (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.49)

Trepella, Stefano, Martini, Mauro, Pérez-Higueras, Noé, Ostuni, Andrea, Caballero, Fernando, Merino, Luis, Chiaberge, Marcello

Metrics vs Surveys: Can Quantitative Measures Replace Human Surveys in Social Robot Navigation? A Correlation Analysis

Abstract-- Social, also called human-aware, navigation is a key challenge for the integration of mobile robots into human environments. The evaluation of such systems is complex, as factors such as comfort, safety, and legibility must be considered. Human-centered assessments, typically conducted through surveys, provide reliable insights but are costly, resource-intensive, and difficult to reproduce or compare across systems. Alternatively, numerical social navigation metrics are easy to compute and facilitate comparisons, yet the community lacks consensus on a standard set of metrics. This work explores the relationship between numerical metrics and human-centered evaluations to identify potential correlations. If specific quantitative measures align with human perceptions, they could serve as standardized evaluation tools, reducing the dependency on surveys. Our results indicate that while current metrics capture some aspects of robot navigation behavior, important subjective factors remain insufficiently represented and new metrics are necessary. Human-aware robot navigation is a key research area for integrating mobile robots into human environments [1], [2]. Beyond the classical challenges of path planning and obstacle avoidance, human-aware navigation must address qualitative aspects of social interaction, such as comfort, predictability, and personal space, which are difficult to capture with mathematical models [3], [4].

artificial intelligence, machine learning, metric, (18 more...)

2510.02941

Country: Europe (0.93)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.69)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.47)

ERUPT: An Open Toolkit for Interfacing with Robot Motion Planners in Extended Reality

Ngui, Isaac, McBeth, Courtney, Santos, André, He, Grace, Mimnaugh, Katherine J., Motes, James D., Soares, Luciano, Morales, Marco, Amato, Nancy M.

We propose the Extended Reality Universal Planning Toolkit (ERUPT), an extended reality (XR) system for interactive motion planning. Our system allows users to create and dynamically reconfigure environments while they plan robot paths. In immersive three-dimensional XR environments, users gain a greater spatial understanding. XR also unlocks a broader range of natural interaction capabilities, allowing users to grab and adjust objects in the environment similarly to the real world, rather than using a mouse and keyboard with the scene projected onto a two-dimensional computer screen. Our system integrates with MoveIt, a manipulation planning framework, allowing users to send motion planning requests and visualize the resulting robot paths in virtual or augmented reality. We provide a broad range of interaction modalities, allowing users to modify objects in the environment and interact with a virtual robot. Our system allows operators to visualize robot motions, ensuring desired behavior as it moves throughout the environment, without risk of collisions within a virtual space, and to then deploy planned paths on physical robots in the real world.

artificial intelligence, planning & scheduling, robot, (20 more...)

2510.02464

Country: North America > United States > Illinois (0.28)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (0.58)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.47)
Information Technology > Artificial Intelligence > Robots > Humanoid Robots (0.37)
Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (0.37)

Masters, Charlie, Vellanki, Advaith, Shangguan, Jiangbo, Kultys, Bart, Gilmore, Jonathan, Moore, Alastair, Albrecht, Stefano V.

Orchestrating Human-AI Teams: The Manager Agent as a Unifying Research Challenge

While agentic AI has advanced in automating individual tasks, managing complex multi-agent workflows remains a challenging problem. This paper presents a research vision for autonomous agentic systems that orchestrate collaboration within dynamic human-AI teams. We propose the Autonomous Manager Agent as a core challenge: an agent that decomposes complex goals into task graphs, allocates tasks to human and AI workers, monitors progress, adapts to changing conditions, and maintains transparent stakeholder communication. We formalize workflow management as a Partially Observable Stochastic Game and identify four foundational challenges: (1) compositional reasoning for hierarchical decomposition, (2) multi-objective optimization under shifting preferences, (3) coordination and planning in ad hoc teams, and (4) governance and compliance by design. To advance this agenda, we release MA-Gym, an open-source simulation and evaluation framework for multi-agent workflow orchestration. Evaluating GPT-5-based Manager Agents across 20 workflows, we find they struggle to jointly optimize for goal completion, constraint adherence, and workflow runtime - underscoring workflow management as a difficult open problem. We conclude with organizational and ethical implications of autonomous management systems.

large language model, machine learning, manager agent, (18 more...)

2510.02557

Country:

North America > United States (0.46)
Europe > United Kingdom > England > Greater London > London (0.15)

Genre: Workflow (1.00)

Industry:

Law (1.00)
Government (1.00)
Banking & Finance (0.93)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Neural Information Processing SystemsOct-3-2025, 08:57:22 GMT

7f2be1b45d278ac18804b79207a24c53-Paper.pdf

belief space, macro-action policy, open-loop action, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.53)

Neural Information Processing SystemsOct-3-2025, 07:23:12 GMT

1e1d184167ca7676cf665225e236a3d2-Reviews.html

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. This paper presented a method to enable the generation of robust plans with partially specified domain models. The motivation of this research topic is well stated. The main contribution of this work is the formalization of the notion of plan robustness with respect to an incomplete domain model. The paper is clearly written and should the general interest for the broad NIPS audience.

domain model, incomplete domain model, robustness, (13 more...)

Neural Information Processing Systems

Country: North America > United States > Nevada (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.48)