pedestrian
- North America > Canada > Ontario (0.04)
- Asia > South Korea > Seoul > Seoul (0.04)
- Asia > South Korea > Gyeongsangnam-do > Changwon (0.04)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Vision (0.99)
- Information Technology > Artificial Intelligence > Machine Learning (0.97)
- Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.95)
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
- North America > Canada (0.04)
- Africa > Central African Republic > Ombella-M'Poko > Bimbo (0.04)
- Transportation (0.68)
- Information Technology (0.68)
Supplementary Material: Progressive Coordinate Transforms for Monocular 3D Object Detection
In this supplementary material, we provide additional experimental results and qualitative visualizations. Specifically, we demonstrate the impacts of using different off-the-shelf models in Sec. 2, We show that our proposed PCT method achieves consistent improvements with all configurations. Sec. 3. We present both successful predictions and failure cases. In this section, we firstly show the additional results on "Pedestrian" and "Cyclist" categories.Then, At last, we would like to demonstrate the impacts of using different 2D detectors and depth estimators on the performance of 3D detection, which is the first two steps of the coordinate-based methods as mentioned in Sec. 3 of the main submission. Non-rigid structures and various shape make it more challenging for monocular 3D detection to accurately detect "Pedestrian" and "Cyclist".
Valeo Near-Field: a novel dataset for pedestrian intent detection
Musabini, Antonyo, Benmokhtar, Rachid, Bhanushali, Jagdish, Galizzi, Victor, Luvison, Bertrand, Perrotton, Xavier
This paper presents a novel dataset aimed at detecting pedestrians' intentions as they approach an ego-vehicle. The dataset comprises synchronized multi-modal data, including fisheye camera feeds, lidar laser scans, ultrasonic sensor readings, and motion capture-based 3D body poses, collected across diverse real-world scenarios. Key contributions include detailed annotations of 3D body joint positions synchronized with fisheye camera images, as well as accurate 3D pedestrian positions extracted from lidar data, facilitating robust benchmarking for perception algorithms. W e release a portion of the dataset along with a comprehensive benchmark suite, featuring evaluation metrics for accuracy, efficiency, and scalability on embedded systems. By addressing real-world challenges such as sensor occlusions, dynamic environments, and hardware constraints, this dataset offers a unique resource for developing and evaluating state-of-the-art algorithms in pedestrian detection, 3D pose estimation and 4D trajectory and intention prediction. Additionally, we provide baseline performance metrics using custom neural network architectures and suggest future research directions to encourage the adoption and enhancement of the dataset. This work aims to serve as a foundation for researchers seeking to advance the capabilities of intelligent vehicles in near-field scenarios.
- North America > United States > California > San Mateo County > San Mateo (0.04)
- Europe > France > Île-de-France (0.04)
- Automobiles & Trucks > Parts Supplier (0.76)
- Transportation > Ground > Road (0.68)
ViTA-PAR: Visual and Textual Attribute Alignment with Attribute Prompting for Pedestrian Attribute Recognition
Park, Minjeong, Park, Hongbeen, Kim, Jinkyu
The Pedestrian Attribute Recognition (PAR) task aims to identify various detailed attributes of an individual, such as clothing, accessories, and gender. To enhance PAR performance, a model must capture features ranging from coarse-grained global attributes (e.g., for identifying gender) to fine-grained local details (e.g., for recognizing accessories) that may appear in diverse regions. Recent research suggests that body part representation can enhance the model's robustness and accuracy, but these methods are often restricted to attribute classes within fixed horizontal regions, leading to degraded performance when attributes appear in varying or unexpected body locations. In this paper, we propose Visual and Textual Attribute Alignment with Attribute Prompting for Pedestrian Attribute Recognition, dubbed as ViTA-PAR, to enhance attribute recognition through specialized multimodal prompting and vision-language alignment. We introduce visual attribute prompts that capture global-to-local semantics, enabling diverse attribute representations. To enrich textual embeddings, we design a learnable prompt template, termed person and attribute context prompting, to learn person and attributes context. Finally, we align visual and textual attribute features for effective fusion. ViTA-PAR is validated on four PAR benchmarks, achieving competitive performance with efficient inference. We release our code and model at https://github.com/mlnjeongpark/ViTA-PAR.
Safe Driving in Occluded Environments
Wang, Zhuoyuan, Jia, Tongyao, Rajborirug, Pharuj, Ramesh, Neeraj, Okuda, Hiroyuki, Suzuki, Tatsuya, Kar, Soummya, Nakahira, Yorie
Abstract--Ensuring safe autonomous driving in the presence of occlusions poses a significant challenge in its policy design. While existing model-driven control techniques based on set invariance can handle visible risks, occlusions create latent risks in which safety-critical states are not observable. Data-driven techniques also struggle to handle latent risks because direct mappings from risk-critical objects in sensor inputs to safe actions cannot be learned without visible risk-critical objects. Motivated by these challenges, in this paper, we propose a probabilistic safety certificate for latent risk. Our key technical enabler is the application of probabilistic invariance: It relaxes the strict observability requirements imposed by set-invariance methods that demand the knowledge of risk-critical states. The proposed techniques provide linear action constraints that confine the latent risk probability within tolerance. Such constraints can be integrated into model predictive controllers or embedded in data-driven policies to mitigate latent risks. The proposed method is tested using the CARLA simulator and compared with a few existing techniques. The theoretical and empirical analysis jointly demonstrate that the proposed methods assure long-term safety in real-time control in occluded environments without being overly conservative and with transparency to exposed risks. ISUAL occlusions impose significant challenges in the policy design of autonomous driving.
- North America > United States > California > Santa Barbara County > Santa Barbara (0.14)
- Asia > China > Hong Kong (0.04)
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- (6 more...)
- Personal (0.93)
- Research Report (0.82)
- Transportation > Ground > Road (1.00)
- Information Technology (1.00)
- Energy (1.00)
- Automobiles & Trucks (1.00)
- Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
- North America > Canada > Ontario (0.04)
- Asia > South Korea > Seoul > Seoul (0.04)
- Asia > South Korea > Gyeongsangnam-do > Changwon (0.04)
AD-VF: LLM-Automatic Differentiation Enables Fine-Tuning-Free Robot Planning from Formal Methods Feedback
Yang, Yunhao, Hong, Junyuan, Perin, Gabriel Jacob, Fan, Zhiwen, Yin, Li, Wang, Zhangyang, Topcu, Ufuk
Large language models (LLMs) can translate natural language instructions into executable action plans for robotics, autonomous driving, and other domains. Yet, deploying LLM-driven planning in the physical world demands strict adherence to safety and regulatory constraints, which current models often violate due to hallucination or weak alignment. Traditional data-driven alignment methods, such as Direct Preference Optimization (DPO), require costly human labeling, while recent formal-feedback approaches still depend on resource-intensive fine-tuning. In this paper, we propose LAD-VF, a fine-tuning-free framework that leverages formal verification feedback for automated prompt engineering. By introducing a formal-verification-informed text loss integrated with LLM-AutoDiff, LAD-VF iteratively refines prompts rather than model parameters. This yields three key benefits: (i) scalable adaptation without fine-tuning; (ii) compatibility with modular LLM architectures; and (iii) interpretable refinement via auditable prompts. Experiments in robot navigation and manipulation tasks demonstrate that LAD-VF substantially enhances specification compliance, improving success rates from 60% to over 90%. Our method thus presents a scalable and interpretable pathway toward trustworthy, formally-verified LLM-driven control systems.
- North America > United States > Texas > Travis County > Austin (0.14)
- South America > Brazil > São Paulo (0.04)
- Oceania > New Zealand (0.04)
- (4 more...)
- Research Report (0.64)
- Workflow (0.47)
#IJCAI2025 distinguished paper: Combining MORL with restraining bolts to learn normative behaviour
Image provided by the authors – generated using Gemini. For many of us, artificial intelligence (AI) has become part of everyday life, and the rate at which we assign previously human roles to AI systems shows no signs of slowing down. AI systems are the crucial ingredients of many technologies -- e.g., self-driving cars, smart urban planning, digital assistants -- across a growing number of domains. At the core of many of these technologies are autonomous agents -- systems designed to act on behalf of humans and make decisions without direct supervision. In order to act effectively in the real world, these agents must be capable of carrying out a wide range of tasks despite possibly unpredictable environmental conditions, which often requires some form of machine learning (ML) for achieving adaptive behaviour.
Can a mobile robot learn from a pedestrian model to prevent the sidewalk salsa?
Siebinga, Olger, Abbink, David
Pedestrians approaching each other on a sidewalk sometimes end up in an awkward interaction known as the "sidewalk salsa": they both (repeatedly) deviate to the same side to avoid a collision. This provides an interesting use case to study interactions between pedestrians and mobile robots because, in the vast majority of cases, this phenomenon is avoided through a negotiation based on implicit communication. Understanding how it goes wrong and how pedestrians end up in the sidewalk salsa will therefore provide insight into the implicit communication. This understanding can be used to design safe and acceptable robotic behaviour. In a previous attempt to gain this understanding, a model of pedestrian behaviour based on the Communication-Enabled Interaction (CEI) framework was developed that can replicate the sidewalk salsa. However, it is unclear how to leverage this model in robotic planning and decision-making since it violates the assumptions of game theory, a much-used framework in planning and decision-making. Here, we present a proof-of-concept for an approach where a Reinforcement Learning (RL) agent leverages the model to learn how to interact with pedestrians. The results show that a basic RL agent successfully learned to interact with the CEI model. Furthermore, a risk-averse RL agent that had access to the perceived risk of the CEI model learned how to effectively communicate its intention through its motion and thereby substantially lowered the perceived risk, and displayed effort by the modelled pedestrian. These results show this is a promising approach and encourage further exploration.
- Europe > Netherlands > South Holland > Delft (0.05)
- Europe > Netherlands > North Brabant > Eindhoven (0.04)
- Europe > Austria > Vienna (0.04)
- Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)