AITopics | Planning & Scheduling

Collaborating Authors

Planning & Scheduling

"Planning is the process of generating (possibly partial) representations of future behavior prior to the use of such plans to constrain or control that behavior. The outcome is usually a set of actions, with temporal and other constraints on them, for execution by some agent or agents. As a core aspect of human intelligence, planning has been studied since the earliest days of AI and cognitive science. Planning research has led to many useful tools for real-world applications, and has yielded significant insights into the organization of behavior and the nature of reasoning about actions."
– Planning entry by Austin Tate in the MIT Encyclopedia of Cognitive Science.

News Overviews Instructional Materials AI-Alerts Classics

ViPlan: A Benchmark for Visual Planning with Symbolic Predicates and Vision-Language Models

Merler, Matteo, Dainese, Nicola, Alakuijala, Minttu, Bonetta, Giovanni, Ferrazzi, Pietro, Tian, Yu, Magnini, Bernardo, Marttinen, Pekka

arXiv.org Artificial IntelligenceMay-20-2025

Integrating Large Language Models with symbolic planners is a promising direction for obtaining verifiable and grounded plans compared to planning in natural language, with recent works extending this idea to visual domains using Vision-Language Models (VLMs). However, rigorous comparison between VLM-grounded symbolic approaches and methods that plan directly with a VLM has been hindered by a lack of common environments, evaluation protocols and model coverage. We introduce ViPlan, the first open-source benchmark for Visual Planning with symbolic predicates and VLMs. ViPlan features a series of increasingly challenging tasks in two domains: a visual variant of the classic Blocksworld planning problem and a simulated household robotics environment. We benchmark nine open-source VLM families across multiple sizes, along with selected closed models, evaluating both VLM-grounded symbolic planning and using the models directly to propose actions. We find symbolic planning to outperform direct VLM planning in Blocksworld, where accurate image grounding is crucial, whereas the opposite is true in the household robotics tasks, where commonsense knowledge and the ability to recover from errors are beneficial. Finally, we show that across most models and methods, there is no significant benefit to using Chain-of-Thought prompting, suggesting that current VLMs still struggle with visual reasoning.

large language model, machine learning, simple 0, (20 more...)

arXiv.org Artificial Intelligence

2505.1318

Country: North America > Mexico (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study > Negative Result (0.68)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

RIFLES: Resource-effIcient Federated LEarning via Scheduling

Alosaime, Sara, Jhumka, Arshad

arXiv.org Artificial IntelligenceMay-20-2025

--Federated Learning (FL) is a privacy-preserving machine learning technique that allows decentralized collaborative model training across a set of distributed clients, by avoiding raw data exchange. A fundamental component of FL is the selection of a subset of clients in each round for model training by a central server . Current selection strategies are myopic in nature in that they are based on past or current interactions, often leading to inefficiency issues such as straggling clients. In this paper, we address this serious shortcoming by proposing the RIFLES approach that builds a novel availability forecasting layer to support the client selection process. We make the following contributions: (i) we formalise the sequential selection problem and reduce it to a scheduling problem and show that the problem is NP-complete, (ii) leveraging heartbeat messages from clients, RIFLES build an availability prediction layer to support (long term) selection decisions, (iii) we propose a novel adaptive selection strategy to support efficient learning and resource usage. T o circumvent the inherent exponential complexity, we present RIFLES, a heuristic that leverages clients' historical availability data by using a CNN-LSTM time series forecasting model, allowing the server to predict the optimal participation times of clients, thereby enabling informed selection decisions. By comparing against other FL techniques, we show that RIFLES provide significant improvement by between 10%- 50% on a variety of metrics such as accuracy and test loss. T o the best of our knowledge, it is the first work to investigate FL as a scheduling problem.

artificial intelligence, machine learning, rifle, (19 more...)

arXiv.org Artificial Intelligence

2505.13169

Genre: Research Report (1.00)

Industry:

Information Technology (0.68)
Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.86)

Add feedback

Constraint-Aware Diffusion Guidance for Robotics: Real-Time Obstacle Avoidance for Autonomous Racing

Ma, Hao, Bodmer, Sabrina, Carron, Andrea, Zeilinger, Melanie, Muehlebach, Michael

arXiv.org Artificial IntelligenceMay-20-2025

Diffusion models hold great potential in robotics due to their ability to capture complex, high-dimensional data distributions. However, their lack of constraint-awareness limits their deployment in safety-critical applications. We propose Constraint-Aware Diffusion Guidance (CoDiG), a data-efficient and general-purpose framework that integrates barrier functions into the denoising process, guiding diffusion sampling toward constraint-satisfying outputs. CoDiG enables constraint satisfaction even with limited training data and generalizes across tasks. We evaluate our framework in the challenging setting of miniature autonomous racing, where real-time obstacle avoidance is essential. Real-world experiments show that CoDiG generates safe outputs efficiently under dynamic conditions, highlighting its potential for broader robotic applications. A demonstration video is available at https://youtu.be/KNYsTdtdxOU.

artificial intelligence, machine learning, trajectory, (17 more...)

arXiv.org Artificial Intelligence

2505.13131

Country: Europe (0.28)

Genre:

Research Report (0.64)
Instructional Material (0.46)

Industry: Leisure & Entertainment > Sports > Motorsports (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.73)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.48)

Add feedback

Geofenced Unmanned Aerial Robotic Defender for Deer Detection and Deterrence (GUARD)

Temesgen, Ebasa, Jerez, Mario, Brown, Greta, Wilson, Graham, Divakarla, Sree Ganesh Lalitaditya, Boelter, Sarah, Nelson, Oscar, McPherson, Robert, Gini, Maria

arXiv.org Artificial IntelligenceMay-19-2025

--Wildlife-induced crop damage, particularly from deer, threatens agricultural productivity. Traditional deterrence methods often fall short in scalability, responsiveness, and adaptability to diverse farmland environments. This paper presents an integrated unmanned aerial vehicle (UA V) system designed for autonomous wildlife deterrence, developed as part of the Farm Robotics Challenge. Our system combines a YOLO-based real-time computer vision module for deer detection, an energy-efficient coverage path planning algorithm for efficient field monitoring, and an autonomous charging station for continuous operation of the UA V . In collaboration with a local Minnesota farmer, the system is tailored to address practical constraints such as terrain, infrastructure limitations, and animal behavior . The solution is evaluated through a combination of simulation and field testing, demonstrating robust detection accuracy, efficient coverage, and extended operational time. Crop damage caused by wildlife, particularly deer incursions, represents a challenge for modern agriculture. Deer damage to crops is responsible for disagreements among farmers, hunters, and the Department of Natural Resources over how the deer population should be controlled [1].

artificial intelligence, deterrence, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2505.1077

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.28)
North America > United States > Iowa (0.04)
North America > United States > New Jersey (0.04)
(5 more...)

Genre: Research Report (1.00)

Industry:

Transportation > Infrastructure & Services (1.00)
Food & Agriculture > Agriculture (1.00)
Transportation > Ground > Road (0.57)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.48)

Add feedback

Vaiage: A Multi-Agent Solution to Personalized Travel Planning

Liu, Binwen, Ge, Jiexi, Wang, Jiamin

arXiv.org Artificial IntelligenceMay-19-2025

Planning trips is a cognitively intensive task involving conflicting user preferences, dynamic external information, and multi-step temporal-spatial optimization. Traditional platforms often fall short - they provide static results, lack contextual adaptation, and fail to support real-time interaction or intent refinement. Our approach, Vaiage, addresses these challenges through a graph-structured multi-agent framework built around large language models (LLMs) that serve as both goal-conditioned recommenders and sequential planners. LLMs infer user intent, suggest personalized destinations and activities, and synthesize itineraries that align with contextual constraints such as budget, timing, group size, and weather. Through natural language interaction, structured tool use, and map-based feedback loops, Vaiage enables adaptive, explainable, and end-to-end travel planning grounded in both symbolic reasoning and conversational understanding. To evaluate Vaiage, we conducted human-in-the-loop experiments using rubric-based GPT-4 assessments and qualitative feedback. The full system achieved an average score of 8.5 out of 10, outperforming the no-strategy (7.2) and no-external-API (6.8) variants, particularly in feasibility. Qualitative analysis indicated that agent coordination - especially the Strategy and Information Agents - significantly improved itinerary quality by optimizing time use and integrating real-time context. These results demonstrate the effectiveness of combining LLM reasoning with symbolic agent coordination in open-ended, real-world planning tasks.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2505.10922

Country: North America > United States > California (0.30)

Genre: Research Report > New Finding (0.34)

Industry:

Health & Medicine > Consumer Health (1.00)
Consumer Products & Services > Travel (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

AutoCam: Hierarchical Path Planning for an Autonomous Auxiliary Camera in Surgical Robotics

Banks, Alexandre, Moore, Randy, Zaman, Sayem Nazmuz, Abdelaal, Alaa Eldin, Salcudean, Septimiu E.

arXiv.org Artificial IntelligenceMay-16-2025

--Incorporating an autonomous auxiliary camera into robot-assisted minimally invasive surgery (RAMIS) enhances spatial awareness and eliminates manual viewpoint control. Existing path planning methods for auxiliary cameras track two-dimensional surgical features but do not simultaneously account for camera orientation, workspace constraints, and robot joint limits. This study presents AutoCam: an automatic auxiliary camera placement method to improve visualization in RAMIS. Implemented on the da Vinci Research Kit, the system uses a priority-based, workspace-constrained control algorithm that combines heuristic geometric placement with nonlinear optimization to ensure robust camera tracking. A user study (N=6) demonstrated that the system maintained 99.84% visibility of a salient feature and achieved a pose error of 4.36 2.11 degrees and 1.95 5.66 mm. The controller was computationally efficient, with a loop time of 6.8 12.8 ms. An additional pilot study (N=6), where novices completed a Fundamentals of Laparoscopic Surgery training task, suggests that users can teleoperate just as effectively from AutoCam's viewpoint as from the endoscope's while still benefiting from AutoCam's improved visual coverage of the scene. These results indicate that an auxiliary camera can be autonomously controlled using the da Vinci patient-side manipulators to track a salient feature, laying the groundwork for new multi-camera visualization methods in RAMIS. OBOT assisted minimally invasive surgery (RAMIS) has been adopted in over 60 countries [1] and is shown to reduce postoperative blood loss, shorten hospitalization times, and enable tremor filtering and enhanced dexterity [2], [3]. Most surgical robots, including the da Vinci (Intuitive Surgical, Inc.) and Hugo (Medtronic, Inc.) systems, have a single endoscopic camera (ECM) restricted to rotate about the remote center of motion (RCM) at the incision site [4]. Having only one viewpoint with limited maneuverability compromises global awareness of the surgical scene [5] and impedes surgical workflow when the endoscope is occluded [4], [6], [7]. This work was supported by the NSERC Canada Graduate Scholarships, the NSERC Discovery Grant, and the C.A. Laszlo Biomedical Engineering Chair held by Professor Salcudean. A. Banks and R. Moore contributed equally to this work. Salcudean are with the University of British Columbia, V ancouver, BC V6T 1Z4, Canada. A. E. Abdelaal is with Stanford University, Stanford, CA 94305, United States.

artificial intelligence, constraint, planning & scheduling, (18 more...)

arXiv.org Artificial Intelligence

2505.10398

Country:

North America > Canada > British Columbia > Vancouver (0.24)
North America > United States > California > Santa Clara County > Stanford (0.24)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Surgery (1.00)
Health & Medicine > Health Care Technology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.84)

Add feedback

Air-Ground Collaboration for Language-Specified Missions in Unknown Environments

Cladera, Fernando, Ravichandran, Zachary, Hughes, Jason, Murali, Varun, Nieto-Granda, Carlos, Hsieh, M. Ani, Pappas, George J., Taylor, Camillo J., Kumar, Vijay

arXiv.org Artificial IntelligenceMay-15-2025

As autonomous robotic systems become increasingly mature, users will want to specify missions at the level of intent rather than in low-level detail. Language is an expressive and intuitive medium for such mission specification. However, realizing language-guided robotic teams requires overcoming significant technical hurdles. Interpreting and realizing language-specified missions requires advanced semantic reasoning. Successful heterogeneous robots must effectively coordinate actions and share information across varying viewpoints. Additionally, communication between robots is typically intermittent, necessitating robust strategies that leverage communication opportunities to maintain coordination and achieve mission objectives. In this work, we present a first-of-its-kind system where an unmanned aerial vehicle (UAV) and an unmanned ground vehicle (UGV) are able to collaboratively accomplish missions specified in natural language while reacting to changes in specification on the fly. We leverage a Large Language Model (LLM)-enabled planner to reason over semantic-metric maps that are built online and opportunistically shared between an aerial and a ground robot. We consider task-driven navigation in urban and rural areas. Our system must infer mission-relevant semantics and actively acquire information via semantic mapping. In both ground and air-ground teaming experiments, we demonstrate our system on seven different natural-language specifications at up to kilometer-scale navigation.

artificial intelligence, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

2505.09108

Country:

North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.14)
Europe > Italy (0.04)
North America > United States > Maryland > Prince George's County > Adelphi (0.04)
Asia > Japan (0.04)

Genre: Research Report (0.64)

Industry:

Government > Military (0.89)
Information Technology > Robotics & Automation (0.66)
Transportation > Ground > Road (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.93)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.68)
(2 more...)

Add feedback

Efficiently Manipulating Clutter via Learning and Search-Based Reasoning

Huang, Baichuan

arXiv.org Artificial IntelligenceMay-15-2025

This thesis presents novel algorithms to advance robotic object rearrangement, a critical task for autonomous systems in applications like warehouse automation and household assistance. Addressing challenges of high-dimensional planning, complex object interactions, and computational demands, our work integrates deep learning for interaction prediction, tree search for action sequencing, and parallelized computation for efficiency. Key contributions include the Deep Interaction Prediction Network (DIPN) for accurate push motion forecasting (over 90% accuracy), its synergistic integration with Monte Carlo Tree Search (MCTS) for effective non-prehensile object retrieval (100% completion in specific challenging scenarios), and the Parallel MCTS with Batched Simulations (PMBS) framework, which achieves substantial planning speed-up while maintaining or improving solution quality. The research further explores combining diverse manipulation primitives, validated extensively through simulated and real-world experiments.

machine learning, reinforcement learning, simulation, (19 more...)

arXiv.org Artificial Intelligence

2505.08853

Country:

North America > United States > Washington > King County > Seattle (0.04)
North America > United States > New Jersey > Middlesex County > New Brunswick (0.04)
North America > United States > Michigan > Washtenaw County > Ann Arbor (0.04)
Europe > Netherlands > South Holland > Dordrecht (0.04)

Genre:

Workflow (1.00)
Research Report > New Finding (0.67)

Industry:

Leisure & Entertainment > Games (1.00)
Health & Medicine (1.00)
Education (0.92)

Technology:

Information Technology > Artificial Intelligence > Robots > Manipulation (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
(3 more...)

Add feedback

HMR-ODTA: Online Diverse Task Allocation for a Team of Heterogeneous Mobile Robots

Verma, Ashish, Gautam, Avinash, Duhan, Tanishq, Shekhawat, V. S., Mohan, Sudeept

arXiv.org Artificial IntelligenceMay-14-2025

Coordinating time-sensitive deliveries in environments like hospitals poses a complex challenge, particularly when managing multiple online pickup and delivery requests within strict time windows using a team of heterogeneous robots. Traditional approaches fail to address dynamic rescheduling or diverse service requirements, typically restricting robots to single-task types. This paper tackles the Multi-Pickup and Delivery Problem with Time Windows (MPDPTW), where autonomous mobile robots are capable of handling varied service requests. The objective is to minimize late delivery penalties while maximizing task completion rates. To achieve this, we propose a novel framework leveraging a heterogeneous robot team and an efficient dynamic scheduling algorithm that supports dynamic task rescheduling. Users submit requests with specific time constraints, and our decentralized algorithm, Heterogeneous Mobile Robots Online Diverse Task Allocation (HMR-ODTA), optimizes task assignments to ensure timely service while addressing delays or task rejections. Extensive simulations validate the algorithm's effectiveness. For smaller task sets (40-160 tasks), penalties were reduced by nearly 63%, while for larger sets (160-280 tasks), penalties decreased by approximately 50%. These results highlight the algorithm's effectiveness in improving task scheduling and coordination in multi-robot systems, offering a robust solution for enhancing delivery performance in structured, time-critical environments.

artificial intelligence, optimization problem, planning & scheduling, (17 more...)

arXiv.org Artificial Intelligence

2505.08419

Country: Asia (0.46)

Genre: Research Report > Promising Solution (0.46)

Industry:

Transportation > Freight & Logistics Services (1.00)
Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Robots > Locomotion (0.81)

Add feedback

PierGuard: A Planning Framework for Underwater Robotic Inspection of Coastal Piers

Wang, Pengyu, Lin, Hin Wang, Li, Jialu, Wang, Jiankun, Shi, Ling, Meng, Max Q. -H.

arXiv.org Artificial IntelligenceMay-14-2025

Using underwater robots instead of humans for the inspection of coastal piers can enhance efficiency while reducing risks. A key challenge in performing these tasks lies in achieving efficient and rapid path planning within complex environments. Sampling-based path planning methods, such as Rapidly-exploring Random Tree* (RRT*), have demonstrated notable performance in high-dimensional spaces. In recent years, researchers have begun designing various geometry-inspired heuristics and neural network-driven heuristics to further enhance the effectiveness of RRT*. However, the performance of these general path planning methods still requires improvement when applied to highly cluttered underwater environments. In this paper, we propose PierGuard, which combines the strengths of bidirectional search and neural network-driven heuristic regions. We design a specialized neural network to generate high-quality heuristic regions in cluttered maps, thereby improving the performance of the path planning. Through extensive simulation and real-world ocean field experiments, we demonstrate the effectiveness and efficiency of our proposed method compared with previous research. Our method achieves approximately 2.6 times the performance of the state-of-the-art geometric-based sampling method and nearly 4.9 times that of the state-of-the-art learning-based sampling method. Our results provide valuable insights for the automation of pier inspection and the enhancement of maritime safety. The updated experimental video is available in the supplementary materials.

artificial intelligence, machine learning, path planning, (16 more...)

arXiv.org Artificial Intelligence

2505.07845

Country:

Asia (1.00)
North America > United States (0.67)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback