AITopics

Country: Asia > China (0.15)

Genre: Research Report (0.93)

Industry:

Transportation > Ground > Road (0.96)
Information Technology (0.71)
Energy (0.68)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Vision (0.93)
Information Technology > Sensing and Signal Processing (0.93)
(3 more...)

Neural Information Processing SystemsFeb-8-2026, 00:33:34 GMT

286a371d8a0a559281f682f8fbf89834-Paper-Conference.pdf

autonomous driving, prediction, trajectory, (16 more...)

Country:

Asia > China > Shanghai > Shanghai (0.06)
Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.93)

Industry:

Transportation > Ground > Road (0.96)
Information Technology (0.71)
Energy (0.68)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.93)
(3 more...)

Neural Information Processing SystemsFeb-7-2026, 16:55:59 GMT

ThompsonSamplingEfficientlyLearnstoControl DiffusionProcesses

Despite its simplicity, guaranteeing efficiency andwhether sampling theactions fromtheposterior could leadtounbounded future trajectories is unknown.

artificial intelligence, arxivpreprintarxiv, machine learning, (18 more...)

Country:

North America > United States > California > Santa Clara County > Stanford (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.69)

arXiv.org Artificial IntelligenceDec-11-2025

COVLM-RL: Critical Object-Oriented Reasoning for Autonomous Driving Using VLM-Guided Reinforcement Learning

Li, Lin, Cai, Yuxin, Fang, Jianwu, Xue, Jianru, Lv, Chen

End-to-end autonomous driving frameworks face persistent challenges in generalization, training efficiency, and interpretability. While recent methods leverage Vision-Language Models (VLMs) through supervised learning on large-scale datasets to improve reasoning, they often lack robustness in novel scenarios. Conversely, reinforcement learning (RL)-based approaches enhance adaptability but remain data-inefficient and lack transparent decision-making. % contribution To address these limitations, we propose COVLM-RL, a novel end-to-end driving framework that integrates Critical Object-oriented (CO) reasoning with VLM-guided RL. Specifically, we design a Chain-of-Thought (CoT) prompting strategy that enables the VLM to reason over critical traffic elements and generate high-level semantic decisions, effectively transforming multi-view visual inputs into structured semantic decision priors. These priors reduce the input dimensionality and inject task-relevant knowledge into the RL loop, accelerating training and improving policy interpretability. However, bridging high-level semantic guidance with continuous low-level control remains non-trivial. To this end, we introduce a consistency loss that encourages alignment between the VLM's semantic plans and the RL agent's control outputs, enhancing interpretability and training stability. Experiments conducted in the CARLA simulator demonstrate that COVLM-RL significantly improves the success rate by 30\% in trained driving environments and by 50\% in previously unseen environments, highlighting its strong generalization capability.

machine learning, reinforcement learning, vlm, (20 more...)

2512.09349

Country: Asia > China (0.14)

Genre: Research Report (0.64)

Industry:

Transportation > Ground > Road (1.00)
Information Technology > Robotics & Automation (0.63)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(3 more...)

Neural Information Processing SystemsNov-13-2025, 16:31:06 GMT

298f587406c914fad5373bb689300433-Paper.pdf

artificial intelligence, machine learning, predictive control, (18 more...)

Country:

North America > United States > California > Los Angeles County > Pasadena (0.05)
Asia > China > Beijing > Beijing (0.04)

Industry:

Energy > Renewable (0.68)
Energy > Power Industry (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.94)

Carrizosa-Rendon, Alvaro, Zhou, Jian, Frisk, Erik, Puig, Vicenc, Nejjari, Fatiha

Behavior-Aware Online Prediction of Obstacle Occupancy using Zonotopes

arXiv.org Artificial IntelligenceOct-24-2025

Abstract-- Predicting the motion of surrounding vehicles is key to safe autonomous driving, especially in unstructured environments without prior information. This paper proposes a novel online method to accurately predict the occupancy sets of surrounding vehicles based solely on motion observations. The approach is divided into two stages: first, an Extended Kalman Filter and a Linear Programming (LP) problem are used to estimate a compact zonotopic set of control actions; then, a reachability analysis propagates this set to predict future occupancy. The effectiveness of the method has been validated through simulations in an urban environment, showing accurate and compact predictions without relying on prior assumptions or prior training data. I. INTRODUCTION Autonomous driving has generated great research interests given the expected benefits, such as reducing accidents, optimizing traffic efficiency and energy management [1]. However, ensuring safety remains a major challenge, particularly in urban environments, where multiple agents interact dynamically [2].Predicting the motion of surrounding vehicles (SVs) is critical to designing safe motion planning and control strategies for autonomous vehicles.

artificial intelligence, control action, machine learning, (18 more...)

2510.20437

Country: Europe (0.14)

Genre:

Workflow (0.93)
Research Report (0.82)

Industry: Transportation > Ground > Road (0.54)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.75)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)

Kraipatthanapong, Nutkritta, Thathong, Natthaphat, Suksawas, Pannita, Klunklin, Thanunnut, Vongthonglua, Kritin, Attahakul, Krit, Aueawatthanaphisut, Aueaphum

Lyapunov-Aware Quantum-Inspired Reinforcement Learning for Continuous-Time Vehicle Control: A Feasibility Study

arXiv.org Artificial IntelligenceOct-22-2025

This paper presents a novel Lyapunov-Based Quantum Reinforcement Learning (LQRL) framework that integrates quantum policy optimization with Lyapunov stability analysis for continuous-time vehicle control. The proposed approach combines the representational power of variational quantum circuits (VQCs) with a stability-aware policy gradient mechanism to ensure asymptotic convergence and safe decision-making under dynamic environments. The vehicle longitudinal control problem was formulated as a continuous-state reinforcement learning task, where the quantum policy network generates control actions subject to Lyapunov stability constraints. Simulation experiments were conducted in a closed-loop adaptive cruise control scenario using a quantum-inspired policy trained under stability feedback. The results demonstrate that the LQRL framework successfully embeds Lyapunov stability verification into quantum policy learning, enabling interpretable and stability-aware control performance. Although transient overshoot and Lyapunov divergence were observed under aggressive acceleration, the system maintained bounded state evolution, validating the feasibility of integrating safety guarantees within quantum reinforcement learning architectures. The proposed framework provides a foundational step toward provably safe quantum control in autonomous systems and hybrid quantum-classical optimization domains.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

2510.18852

Country: Asia (0.28)

Genre: Research Report > New Finding (0.49)

Industry:

Transportation (0.58)
Energy (0.49)
Automobiles & Trucks (0.37)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Trende, Mattias, Ögren, Petter

Least Restrictive Hyperplane Control Barrier Functions

arXiv.org Artificial IntelligenceOct-22-2025

Control Barrier Functions (CBFs) can provide provable safety guarantees for dynamic systems. However, finding a valid CBF for a system of interest is often non-trivial, especially if the shape of the unsafe region is complex and the CBFs are of higher order. A common solution to this problem is to make a conservative approximation of the unsafe region in the form of a line/hyperplane, and use the corresponding conservative Hyperplane-CBF when deciding on safe control actions. In this letter, we note that conservative constraints are only a problem if they prevent us from doing what we want. Thus, instead of first choosing a CBF and then choosing a safe control with respect to the CBF, we optimize over a combination of CBFs and safe controls to get as close as possible to our desired control, while still having the safety guarantee provided by the CBF. We call the corresponding CBF the least restrictive Hyperplane-CBF. Finally, we also provide a way of creating a smooth parameterization of the CBF-family for the optimization, and illustrate the approach on a double integrator dynamical system with acceleration constraints, moving through a group of arbitrarily shaped static and moving obstacles.

artificial intelligence, control barrier function, obstacle, (14 more...)

2510.18643

Genre: Research Report (0.40)

Industry: Transportation (0.69)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.47)

arXiv.org Machine LearningOct-17-2025

Active Measuring in Reinforcement Learning With Delayed Negative Effects

Gao, Daiqi, Xu, Ziping, Rawashdeh, Aseel, Klasnja, Predrag, Murphy, Susan A.

Measuring states in reinforcement learning (RL) can be costly in real-world settings and may negatively influence future outcomes. We introduce the Actively Observable Markov Decision Process (AOMDP), where an agent not only selects control actions but also decides whether to measure the latent state. The measurement action reveals the true latent state but may have a negative delayed effect on the environment. We show that this reduced uncertainty may provably improve sample efficiency and increase the value of the optimal policy despite these costs. We formulate an AOMDP as a periodic partially observable MDP and propose an online RL algorithm based on belief states. To approximate the belief states, we further propose a sequential Monte Carlo method to jointly approximate the posterior of unknown static environment parameters and unobserved latent states. We evaluate the proposed algorithm in a digital health application, where the agent decides when to deliver digital interventions and when to assess users' health status through surveys.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

arXiv.org Machine Learning

2510.14315

Genre: Research Report (1.00)

Industry: Health & Medicine > Consumer Health (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Oddo, Girolamo, Nuca, Roberto, Parsani, Matteo

VeMo: A Lightweight Data-Driven Approach to Model Vehicle Dynamics

arXiv.org Artificial IntelligenceOct-10-2025

Abstract--Developing a dynamic model for a high-performance vehicle is a complex problem that requires extensive structural information about the system under analysis. This information is often unavailable to those who did not design the vehicle and represents a typical issue in autonomous driving applications, which are frequently developed on top of existing vehicles; therefore, vehicle models are developed under conditions of information scarcity. This paper proposes a lightweight encoder-decoder model based on Gate Recurrent Unit layers to correlate the vehicle's future state with its past states, measured onboard, and control actions the driver performs. The results demonstrate that the model achieves a maximum mean relative error below 2.6% in extreme dynamic conditions. It also shows good robustness when subject to noisy input data across the interested frequency components. Furthermore, being entirely data-driven and free from physical constraints, the model exhibits physical consistency in the output signals, such as longitudinal and lateral accelerations, yaw rate, and the vehicle's longitudinal velocity. N the automotive sector developing a representative vehicle dynamics model is a complex and multifaceted challenge [1]-[3]. Numerous nonlinear factors influence vehicle dynamics, including tire characteristics, suspension geometry, aerodynamics, drivetrain effects, and external environmental factors, such as road surface grip conditions and climatic effects (e.g., wind). Accurately capturing these effects in a computational model requires high-fidelity multibody simulation software and a profound understanding of the vehicle system.

artificial intelligence, machine learning, publisher, (19 more...)

2510.07447

Country: Asia (0.46)

Genre: Research Report > New Finding (0.66)

Industry:

Automobiles & Trucks (1.00)
Transportation > Ground > Road (0.48)
Leisure & Entertainment > Sports > Motorsports (0.46)
Information Technology > Robotics & Automation (0.34)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)