AITopics | reach-avoid problem

Current reinforcement-learning methods are unable to directly learn policies that solve the minimum cost reach-avoid problem to minimize cumulative costs subject to the constraints of reaching the goal and avoiding unsafe states, as the structure of this new optimization problem is incompatible with current methods. Instead, a surrogate problem is solved where all objectives are combined with a weighted sum. However, this surrogate objective results in suboptimal policies that do not directly minimize the cumulative cost. In this work, we propose RC-PPO, a reinforcement-learning-based method for solving the minimum-cost reach-avoid problem by using connections to Hamilton-Jacobi reachability. Empirical results demonstrate that RC-PPO learns policies with comparable goal-reaching rates to while achieving up to 57% lower cumulative costs compared to existing methods on a suite of minimum-cost reach-avoid benchmarks on the Mujoco simulator. The project page can be found at https://oswinso.xyz/rcppo.

artificial intelligence, machine learning, reinforcement learning, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

3750e99b522bd36a099d2e8b9f0550c7-Paper-Conference.pdf

Neural Information Processing SystemsOct-11-2025, 00:17:28 GMT

cumulative cost, minimum-cost reach-avoid problem, reach-avoid problem, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Energy (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Add feedback

d2f6f1dfbf9cd89a78c5a58ef0dec245-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 08:18:14 GMT

artificial intelligence, machine learning, reinforcement learning, (21 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Asia > Japan > Honshū > Chūgoku > Hiroshima Prefecture > Hiroshima (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(4 more...)

Industry: Leisure & Entertainment > Games (0.93)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

2f89a23a19d1617e7fb16d4f7a049ce2-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 09:35:38 GMT

algorithm, decision tree, reachable state, (15 more...)

Neural Information Processing Systems

Country:

Europe > Netherlands > South Holland > Delft (0.04)
Europe > Denmark > North Jutland > Aalborg (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Austria (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.74)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Solving Minimum-Cost Reach Avoid using Reinforcement Learning

Neural Information Processing SystemsMay-26-2025, 21:14:08 GMT

Current reinforcement-learning methods are unable to directly learn policies that solve the minimum cost reach-avoid problem to minimize cumulative costs subject to the constraints of reaching the goal and avoiding unsafe states, as the structure of this new optimization problem is incompatible with current methods. Instead, a surrogate problem is solved where all objectives are combined with a weighted sum. However, this surrogate objective results in suboptimal policies that do not directly minimize the cumulative cost. In this work, we propose RC-PPO, a reinforcement-learning-based method for solving the minimum-cost reach-avoid problem by using connections to Hamilton-Jacobi reachability. Empirical results demonstrate that RC-PPO learns policies with comparable goal-reaching rates to while achieving up to 57% lower cumulative costs compared to existing methods on a suite of minimum-cost reach-avoid benchmarks on the Mujoco simulator.

artificial intelligence, machine learning, reinforcement learning, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Solving Minimum-Cost Reach Avoid using Reinforcement Learning

So, Oswin, Ge, Cheng, Fan, Chuchu

arXiv.org Artificial IntelligenceOct-29-2024

Current reinforcement-learning methods are unable to directly learn policies that solve the minimum cost reach-avoid problem to minimize cumulative costs subject to the constraints of reaching the goal and avoiding unsafe states, as the structure of this new optimization problem is incompatible with current methods. Instead, a surrogate problem is solved where all objectives are combined with a weighted sum. However, this surrogate objective results in suboptimal policies that do not directly minimize the cumulative cost. In this work, we propose RC-PPO, a reinforcement-learning-based method for solving the minimum-cost reach-avoid problem by using connections to Hamilton-Jacobi reachability. Empirical results demonstrate that RC-PPO learns policies with comparable goal-reaching rates to while achieving up to 57% lower cumulative costs compared to existing methods on a suite of minimum-cost reach-avoid benchmarks on the Mujoco simulator. The project page can be found at https://oswinso.xyz/rcppo/.

machine learning, reach-avoid problem, reinforcement learning, (14 more...)

arXiv.org Artificial Intelligence

2410.226

Genre: Research Report > New Finding (0.34)

Industry: Energy (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Guaranteed Reach-Avoid for Black-Box Systems through Narrow Gaps via Neural Network Reachability

Chung, Long Kiu, Jung, Wonsuhk, Pullabhotla, Srivatsank, Shinde, Parth, Sunil, Yadu, Kota, Saihari, Batista, Luis Felipe Wolf, Pradalier, Cédric, Kousik, Shreyas

arXiv.org Artificial IntelligenceSep-19-2024

In the classical reach-avoid problem, autonomous mobile robots are tasked to reach a goal while avoiding obstacles. However, it is difficult to provide guarantees on the robot's performance when the obstacles form a narrow gap and the robot is a black-box (i.e. the dynamics are not known analytically, but interacting with the system is cheap). To address this challenge, this paper presents NeuralPARC. The method extends the authors' prior Piecewise Affine Reach-avoid Computation (PARC) method to systems modeled by rectified linear unit (ReLU) neural networks, which are trained to represent parameterized trajectory data demonstrated by the robot. NeuralPARC computes the reachable set of the network while accounting for modeling error, and returns a set of states and parameters with which the black-box system is guaranteed to reach the goal and avoid obstacles. Through numerical experiments, NeuralPARC is shown to outperform PARC in generating provably-safe extreme vehicle drift parking maneuvers, as well as enabling safety on an autonomous surface vehicle (ASV) subjected to large disturbances and controlled by a deep reinforcement learning (RL) policy.

artificial intelligence, machine learning, neuralparc, (17 more...)

arXiv.org Artificial Intelligence

2409.13195

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.04)
Europe > Portugal > Porto > Porto (0.04)
Europe > Netherlands > South Holland > Delft (0.04)
(3 more...)

Genre: Research Report (0.50)

Industry: Transportation > Air (0.83)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Filters

Collaborating Authors

reach-avoid problem

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

d2f6f1dfbf9cd89a78c5a58ef0dec245-Paper-Conference.pdf

Solving Minimum-Cost Reach Avoid using Reinforcement Learning

Safety Verification of Decision-Tree Policies in Continuous Time

Solving Minimum-Cost Reach Avoid using Reinforcement Learning

3750e99b522bd36a099d2e8b9f0550c7-Paper-Conference.pdf

d2f6f1dfbf9cd89a78c5a58ef0dec245-Paper-Conference.pdf

2f89a23a19d1617e7fb16d4f7a049ce2-Paper-Conference.pdf

Solving Minimum-Cost Reach Avoid using Reinforcement Learning

Solving Minimum-Cost Reach Avoid using Reinforcement Learning

Guaranteed Reach-Avoid for Black-Box Systems through Narrow Gaps via Neural Network Reachability