Goto

Collaborating Authors

 vehicle


HCRMP: ALLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving

Neural Information Processing Systems

Integrating the understanding and reasoning capabilities of Large Language Models (LLM) with the self-learning capabilities of Reinforcement Learning (RL) enables more reliable driving performance under complex driving conditions. There has been a lot of work exploring LLM-Dominated RL methods in the field of autonomous driving motion planning. These methods, which utilize LLM to directly generate policies or provide decisive instructions during policy learning of RL agent, are centrally characterized by an over-reliance on LLM outputs. However, LLM outputs are susceptible to hallucinations. Evaluations show that state-of-theart LLM indicates a non-hallucination rate of only approximately 57.95% when assessed on essential driving-related tasks. Thus, in these methods, hallucinations from the LLM can directly jeopardize the performance of driving policies.


AGC-Drive: ALarge-Scale Dataset for Real-World Aerial-Ground Collaboration in Driving Scenarios

Neural Information Processing Systems

By sharing information across multiple agents, collaborative perception helps autonomous vehicles mitigate occlusions and improve overall perception accuracy. While most previous work focus on vehicle-to-vehicle and vehicle-to-infrastructure collaboration, with limited attention to aerial perspectives provided by UAVs, which uniquely offer dynamic, top-down views to alleviate occlusions and monitor large-scale interactive environments. A major reason for this is the lack of highquality datasets for aerial-ground collaborative scenarios. To bridge this gap, we present AGC-Drive, the first large-scale real-world dataset for Aerial-Ground Cooperative 3D perception. The data collection platform consists of two vehicles, each equipped with five cameras and one LiDAR sensor, and one UAV carrying a forward-facing camera and a LiDAR sensor, enabling comprehensive multi-view and multi-agent perception.


Traffic Sign Invisible Recognition ResultUVLight PPUVLamp STOP PFluorescentInk

Neural Information Processing Systems

Recently, traffic sign recognition (TSR) systems have become a prominent target for physical adversarial attacks. These attacks typically rely on conspicuous stickers and projections, or using invisible light and acoustic signals that can be easily blocked. In this paper, we introduce a novel attack medium, i.e., fluorescent ink, to design a stealthy and effective physical adversarial patch, namely FIPatch, to advance the state-of-the-art. Specifically, we first model the fluorescence effect in the digital domain to identify the optimal attack settings, which guide the realworld fluorescence parameters. By applying a carefully designed fluorescence perturbation to the target sign, the attacker can later trigger a fluorescent effect using invisible ultraviolet light, causing the TSR system to misclassify the sign and potentially leading to traffic accidents. We conducted a comprehensive evaluation to investigate the effectiveness of FIPatch, which shows a success rate of 98.31% in low-light conditions. Furthermore, our attack successfully bypasses five popular defenses and achieves a success rate of 96.72%.


MTL-KD: Multi-Task Learning Via Knowledge Distillation for Generalizable Neural Vehicle Routing Solver

Neural Information Processing Systems

Multi-Task Learning (MTL) in Neural Combinatorial Optimization (NCO) is a promising approach for training a unified model capable of solving multiple Vehicle Routing Problem (VRP) variants. However, existing Reinforcement Learning (RL)-based multi-task methods can only train light decoder models on small-scale problems, exhibiting limited generalization ability when solving large-scale problems. To overcome this limitation, this work introduces a novel multi-task learning method driven by knowledge distillation (MTL-KD), which enables efficient training of heavy decoder models with strong generalization ability. The proposed MTL-KD method transfers policy knowledge from multiple distinct RL-based single-task models to a single heavy decoder model, facilitating label-free training and effectively improving the model's generalization ability across diverse tasks. In addition, we introduce a flexible inference strategy termed Random Reordering Re-Construction (R3C), which is specifically adapted for diverse VRP tasks and further boosts the performance of the multi-task model. Experimental results on 6 seen and 10 unseen VRP variants with up to 1,000 nodes indicate that our proposed method consistently achieves superior performance on both uniform and real-world benchmarks, demonstrating robust generalization abilities.


Learning to Insert for Constructive Neural Vehicle Routing Solver

Neural Information Processing Systems

Neural Combinatorial Optimisation (NCO) is a promising learning-based approach for solving Vehicle Routing Problems (VRPs) without extensive manual design. While existing constructive NCO methods typically follow an appending-based paradigm that sequentially adds unvisited nodes to partial solutions, this rigid approach often leads to suboptimal results. To overcome this limitation, we explore the idea of the insertion-based paradigm and propose Learning to Construct with Insertion-based Paradigm (L2C-Insert), a novel learning-based method for constructive NCO. Unlike traditional approaches, L2C-Insert builds solutions by strategically inserting unvisited nodes at any valid position in the current partial solution, which can significantly enhance the flexibility and solution quality. The proposed framework introduces three key components: a novel model architecture for precise insertion position prediction, an efficient training scheme for model optimization, and an advanced inference technique that fully exploits the insertion paradigm's flexibility. Extensive experiments on both synthetic and real-world instances of the Travelling Salesman Problem (TSP) and Capacitated Vehicle Routing Problem (CVRP) demonstrate that L2C-Insert consistently achieves superior performance across various problem sizes.


Rivian faces a class action lawsuit over self-driving in its early vehicles

Engadget

Plaintiffs claim the company overstated the capabilities of the R1T and R1S. Rivian has been sued on allegations that it made misleading statements about the self-driving capabilities of its R1T truck and R1S SUV. According to the class action complaint brought by Rivian customers, the first-generation models of these vehicles are not capable of the offering the self-driving potential that the company had promised. The plaintiffs argued that Rivian represented that those early models would be capable of level 3 autonomous driving, meaning the vehicle would be able to steer, accelerate and break without driver action. In reality, Rivian manufactured its Gen 1 Vehicles without the hardware, cameras, sensors, and compute to enable hands-free driving and/or Level 3 autonomous operation, the complaint states.


HMARL-CBF - Hierarchical Multi-Agent Reinforcement Learning with Control Barrier Functions for Safety-Critical Autonomous Systems

Neural Information Processing Systems

We address the problem of safe policy learning in multi-agent safety-critical autonomous systems. In such systems, it is necessary for each agent to meet the safety requirements at all times while also cooperating with other agents to accomplish the task. Toward this end, we propose a safe Hierarchical Multi-Agent Reinforcement Learning (HMARL) approach based on Control Barrier Functions (CBFs). Our proposed hierarchical approach decomposes the overall reinforcement learning problem into two levels -- learning joint cooperative behavior at the higher level and learning safe individual behavior at the lower or agent level, conditioned on the high-level policy. Specifically, we propose a skill-based HMARL-CBF algorithm in which the higher-level problem involves learning a joint policy over the skills for all the agents, and the lower-level problem involves learning policies to execute the skills safely with CBFs. We validate our approach in challenging environment scenarios, whereby a large number of agents have to safely navigate through conflicting road networks. Compared with existing state-of-the-art methods, our approach significantly improves the safety, achieving a near-perfect ( 95%) success/safety rate while improving performance across all the environments 1.


Waymo Recalls Robotaxis Over Risk They'll Drive at Speed Into Freeway Construction Zones

WIRED

The company's latest recall of 3,871 vehicles follows incidents of its autonomous cars "prioritizing other hazards" or failing to recognize closed construction zones altogether. Waymo has filed its fourth safety recall since February 2024, after its driverless cars were caught entering closed freeway-construction zones. The recall, filed with the National Highway Traffic Safety Administration (NHTSA) on June 17, appears to affect Waymo's entire US fleet, covering 3,871 vehicles running Waymo's 5th Generation automated driving system (ADS). NHTSA estimates 100 precent of the affected units carry the defect, which is outlined in the filed safety recall report as "under certain circumstances, the AV may enter and drive at speed in freeway-construction zones due to inappropriately prioritizing the avoidance of other freeway hazards and/or failing to recognize the construction zone." Waymo started offering highway rides in late 2025, and the underlying problem appears to be a failure of priority logic.


Waymo recalls over 3,800 robotaxis that might drive onto closed freeways

Engadget

The company is working on a fix and has restricted freeway driving. Waymo is recalling over 3,800 of its self-driving taxis due to a software issue that could cause them to enter closed freeway construction zones at speed, according to a National Highway Traffic Safety Admininstration (NHTSA) bulletin seen by Reuters . The company is reportedly working on a fix and has restricted freeway driving, the NHTSA safety notice states. It's not known if Waymo had an incident that prompted the recall. We identified an area of improvement regarding performance around freeway construction zones, Waymo told Engadget in a statement.