AITopics | pedestrian

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > Canada (0.04)
Africa > Central African Republic > Ombella-M'Poko > Bimbo (0.04)

Genre: Research Report (0.46)

Industry:

Transportation (0.68)
Information Technology (0.68)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Neural Information Processing SystemsFeb-14-2026, 06:40:42 GMT

d09bf41544a3365a46c9077ebb5e35c3-AuthorFeedback.pdf

agent, interaction, sequence, (15 more...)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.30)

Neural Information Processing SystemsFeb-11-2026, 14:26:49 GMT

4d6a000c216974f59e597bc878cd6325-Paper-Datasets_and_Benchmarks.pdf

artificial intelligence, dataset, machine learning, (12 more...)

Country:

North America > Canada > Ontario (0.04)
Asia > South Korea > Seoul > Seoul (0.04)
Asia > South Korea > Gyeongsangnam-do > Changwon (0.04)

Industry: Information Technology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Vision (0.99)
Information Technology > Artificial Intelligence > Machine Learning (0.97)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.95)

Neural Information Processing SystemsFeb-9-2026, 07:17:41 GMT

6f3ef77ac0e3619e98159e9b6febf557-Supplemental.pdf

depth estimator, detection, detector, (15 more...)

Country: South America > Brazil (0.05)

Technology: Information Technology > Artificial Intelligence > Vision (0.69)

Musabini, Antonyo, Benmokhtar, Rachid, Bhanushali, Jagdish, Galizzi, Victor, Luvison, Bertrand, Perrotton, Xavier

Valeo Near-Field: a novel dataset for pedestrian intent detection

arXiv.org Artificial IntelligenceOct-20-2025

This paper presents a novel dataset aimed at detecting pedestrians' intentions as they approach an ego-vehicle. The dataset comprises synchronized multi-modal data, including fisheye camera feeds, lidar laser scans, ultrasonic sensor readings, and motion capture-based 3D body poses, collected across diverse real-world scenarios. Key contributions include detailed annotations of 3D body joint positions synchronized with fisheye camera images, as well as accurate 3D pedestrian positions extracted from lidar data, facilitating robust benchmarking for perception algorithms. W e release a portion of the dataset along with a comprehensive benchmark suite, featuring evaluation metrics for accuracy, efficiency, and scalability on embedded systems. By addressing real-world challenges such as sensor occlusions, dynamic environments, and hardware constraints, this dataset offers a unique resource for developing and evaluating state-of-the-art algorithms in pedestrian detection, 3D pose estimation and 4D trajectory and intention prediction. Additionally, we provide baseline performance metrics using custom neural network architectures and suggest future research directions to encourage the adoption and enhancement of the dataset. This work aims to serve as a foundation for researchers seeking to advance the capabilities of intelligent vehicles in near-field scenarios.

artificial intelligence, dataset, machine learning, (20 more...)

2510.15673

Country:

North America > United States > California > San Mateo County > San Mateo (0.04)
Europe > France > Île-de-France (0.04)

Genre: Research Report (0.64)

Industry:

Automobiles & Trucks > Parts Supplier (0.76)
Transportation > Ground > Road (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.86)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.67)
Information Technology > Artificial Intelligence > Vision > Video Understanding (0.60)

Park, Minjeong, Park, Hongbeen, Kim, Jinkyu

ViTA-PAR: Visual and Textual Attribute Alignment with Attribute Prompting for Pedestrian Attribute Recognition

arXiv.org Artificial IntelligenceOct-17-2025

The Pedestrian Attribute Recognition (PAR) task aims to identify various detailed attributes of an individual, such as clothing, accessories, and gender. To enhance PAR performance, a model must capture features ranging from coarse-grained global attributes (e.g., for identifying gender) to fine-grained local details (e.g., for recognizing accessories) that may appear in diverse regions. Recent research suggests that body part representation can enhance the model's robustness and accuracy, but these methods are often restricted to attribute classes within fixed horizontal regions, leading to degraded performance when attributes appear in varying or unexpected body locations. In this paper, we propose Visual and Textual Attribute Alignment with Attribute Prompting for Pedestrian Attribute Recognition, dubbed as ViTA-PAR, to enhance attribute recognition through specialized multimodal prompting and vision-language alignment. We introduce visual attribute prompts that capture global-to-local semantics, enabling diverse attribute representations. To enrich textual embeddings, we design a learnable prompt template, termed person and attribute context prompting, to learn person and attributes context. Finally, we align visual and textual attribute features for effective fusion. ViTA-PAR is validated on four PAR benchmarks, achieving competitive performance with efficient inference. We release our code and model at https://github.com/mlnjeongpark/ViTA-PAR.

machine learning, natural language, recognition, (16 more...)

2506.01411

Country: Asia > South Korea > Seoul > Seoul (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

arXiv.org Artificial IntelligenceOct-16-2025

Safe Driving in Occluded Environments

Wang, Zhuoyuan, Jia, Tongyao, Rajborirug, Pharuj, Ramesh, Neeraj, Okuda, Hiroyuki, Suzuki, Tatsuya, Kar, Soummya, Nakahira, Yorie

Abstract--Ensuring safe autonomous driving in the presence of occlusions poses a significant challenge in its policy design. While existing model-driven control techniques based on set invariance can handle visible risks, occlusions create latent risks in which safety-critical states are not observable. Data-driven techniques also struggle to handle latent risks because direct mappings from risk-critical objects in sensor inputs to safe actions cannot be learned without visible risk-critical objects. Motivated by these challenges, in this paper, we propose a probabilistic safety certificate for latent risk. Our key technical enabler is the application of probabilistic invariance: It relaxes the strict observability requirements imposed by set-invariance methods that demand the knowledge of risk-critical states. The proposed techniques provide linear action constraints that confine the latent risk probability within tolerance. Such constraints can be integrated into model predictive controllers or embedded in data-driven policies to mitigate latent risks. The proposed method is tested using the CARLA simulator and compared with a few existing techniques. The theoretical and empirical analysis jointly demonstrate that the proposed methods assure long-term safety in real-time control in occluded environments without being overly conservative and with transparency to exposed risks. ISUAL occlusions impose significant challenges in the policy design of autonomous driving.

artificial intelligence, machine learning, vehicle, (17 more...)

2510.13114

Country:

North America > United States > California > Santa Barbara County > Santa Barbara (0.14)
Asia > China > Hong Kong (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
(6 more...)

Genre:

Personal (0.93)
Research Report (0.82)

Industry:

Transportation > Ground > Road (1.00)
Information Technology (1.00)
Energy (1.00)
Automobiles & Trucks (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Neural Information Processing SystemsOct-8-2025, 15:52:24 GMT

SiT Dataset: Socially Interactive Pedestrian Trajectory Dataset for Social Navigation Robots

To ensure secure and dependable mobility in environments shared by humans and robots, social navigation robots should possess the capability to accurately perceive and predict the trajectories of nearby pedestrians.

artificial intelligence, dataset, proceedings, (11 more...)

Country:

North America > Canada > Ontario (0.04)
Asia > South Korea > Seoul > Seoul (0.04)
Asia > South Korea > Gyeongsangnam-do > Changwon (0.04)

Industry: Information Technology (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.95)

arXiv.org Artificial IntelligenceSep-24-2025

AD-VF: LLM-Automatic Differentiation Enables Fine-Tuning-Free Robot Planning from Formal Methods Feedback

Yang, Yunhao, Hong, Junyuan, Perin, Gabriel Jacob, Fan, Zhiwen, Yin, Li, Wang, Zhangyang, Topcu, Ufuk

Large language models (LLMs) can translate natural language instructions into executable action plans for robotics, autonomous driving, and other domains. Yet, deploying LLM-driven planning in the physical world demands strict adherence to safety and regulatory constraints, which current models often violate due to hallucination or weak alignment. Traditional data-driven alignment methods, such as Direct Preference Optimization (DPO), require costly human labeling, while recent formal-feedback approaches still depend on resource-intensive fine-tuning. In this paper, we propose LAD-VF, a fine-tuning-free framework that leverages formal verification feedback for automated prompt engineering. By introducing a formal-verification-informed text loss integrated with LLM-AutoDiff, LAD-VF iteratively refines prompts rather than model parameters. This yields three key benefits: (i) scalable adaptation without fine-tuning; (ii) compatibility with modular LLM architectures; and (iii) interpretable refinement via auditable prompts. Experiments in robot navigation and manipulation tasks demonstrate that LAD-VF substantially enhances specification compliance, improving success rates from 60% to over 90%. Our method thus presents a scalable and interpretable pathway toward trustworthy, formally-verified LLM-driven control systems.

large language model, machine learning, specification, (18 more...)

2509.18384

Country:

North America > United States > Texas > Travis County > Austin (0.14)
South America > Brazil > São Paulo (0.04)
Oceania > New Zealand (0.04)
(4 more...)

Genre:

Research Report (0.64)
Workflow (0.47)

Industry: Transportation > Ground > Road (0.38)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

AIHubSep-4-2025, 08:38:07 GMT

#IJCAI2025 distinguished paper: Combining MORL with restraining bolts to learn normative behaviour

Image provided by the authors – generated using Gemini. For many of us, artificial intelligence (AI) has become part of everyday life, and the rate at which we assign previously human roles to AI systems shows no signs of slowing down. AI systems are the crucial ingredients of many technologies -- e.g., self-driving cars, smart urban planning, digital assistants -- across a growing number of domains. At the core of many of these technologies are autonomous agents -- systems designed to act on behalf of humans and make decisions without direct supervision. In order to act effectively in the real world, these agents must be capable of carrying out a wide range of tasks despite possibly unpredictable environmental conditions, which often requires some form of machine learning (ML) for achieving adaptive behaviour.

agent, artificial intelligence, obligation, (15 more...)

AIHub

Country: Europe > Austria > Vienna (0.05)

Industry: Transportation > Ground > Road (0.36)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.69)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.57)