AITopics | safety property

2512.0227

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
North America > United States > California > Santa Clara County > Santa Clara (0.04)

Genre: Research Report > New Finding (0.54)

Industry:

Transportation > Air (0.48)
Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.46)

Neural Information Processing SystemsNov-20-2025, 15:27:09 GMT

Efficient Formal Safety Analysis of Neural Networks

Shiqi Wang, Kexin Pei, Justin Whitehouse, Junfeng Yang, Suman Jana

Such mistakes can have disastrous and even potentially fatal consequences.

artificial intelligence, machine learning, neural network, (19 more...)

Country:

North America > United States > Massachusetts (0.04)
North America > United States > California (0.04)
North America > Canada > Quebec > Montreal (0.04)

Industry:

Transportation > Ground > Road (0.69)
Automobiles & Trucks (0.69)
Information Technology > Robotics & Automation (0.47)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Neural Information Processing SystemsNov-17-2025, 02:39:51 GMT

Efficient Formal Safety Analysis of Neural Networks

Shiqi Wang, Kexin Pei, Justin Whitehouse, Junfeng Yang, Suman Jana

Such mistakes can have disastrous and even potentially fatal consequences.

artificial intelligence, machine learning, neural network, (19 more...)

Country:

North America > United States > Massachusetts (0.04)
North America > United States > California (0.04)
North America > Canada > Quebec > Montreal (0.04)

Industry:

Transportation > Ground > Road (0.69)
Automobiles & Trucks (0.69)
Information Technology > Robotics & Automation (0.47)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Neural Information Processing SystemsNov-13-2025, 22:38:05 GMT

A Safely Imitating a Neural Policy

Here we provide proofs of the theoretical results from Section 3.2 and extend the discussion of a few

artificial intelligence, benchmark, machine learning, (17 more...)

Industry: Transportation (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.48)

Manino, Edoardo, Farias, Bruno, Menezes, Rafael Sá, Shmarov, Fedor, Cordeiro, Lucas C.

Floating-Point Neural Network Verification at the Software Level

arXiv.org Artificial IntelligenceOct-28-2025

The behaviour of neural network components must be proven correct before deployment in safety-critical systems. Unfortunately, existing neural network verification techniques cannot certify the absence of faults at the software level. In this paper, we show how to specify and verify that neural networks are safe, by explicitly reasoning about their floating-point implementation. In doing so, we construct NeuroCodeBench 2.0, a benchmark comprising 912 neural network verification examples that cover activation functions, common layers, and full neural networks of up to 170K parameters. Our verification suite is written in plain C and is compatible with the format of the International Competition on Software Verification (SV-COMP). Thanks to it, we can conduct the first rigorous evaluation of eight state-of-the-art software verifiers on neural network code. The results show that existing automated verification tools can correctly solve an average of 11% of our benchmark, while producing around 3% incorrect verdicts. At the same time, a historical analysis reveals that the release of our benchmark has already had a significantly positive impact on the latter.

artificial intelligence, machine learning, verifier, (18 more...)

2510.23389

Country:

Europe > Austria > Vienna (0.14)
Europe > United Kingdom > England > Greater Manchester > Manchester (0.04)
North America > United States > New York > New York County > New York City (0.04)
(12 more...)

Genre: Research Report > New Finding (0.87)

Industry:

Government (0.67)
Information Technology > Security & Privacy (0.45)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Adam, Mustafa, Anisi, David A., Ribeiro, Pedro

A Verification Methodology for Safety Assurance of Robotic Autonomous Systems

arXiv.org Artificial IntelligenceOct-16-2025

Autonomous robots deployed in shared human environments, such as agricultural settings, require rigorous safety assurance to meet both functional reliability and regulatory compliance. These systems must operate in dynamic, unstructured environments, interact safely with humans, and respond effectively to a wide range of potential hazards. This paper presents a verification workflow for the safety assurance of an autonomous agricultural robot, covering the entire development life-cycle, from concept study and design to runtime verification. The outlined methodology begins with a systematic hazard analysis and risk assessment to identify potential risks and derive corresponding safety requirements. A formal model of the safety controller is then developed to capture its behaviour and verify that the controller satisfies the specified safety properties with respect to these requirements. The proposed approach is demonstrated on a field robot operating in an agricultural setting. The results show that the methodology can be effectively used to verify safety-critical properties and facilitate the early identification of design issues, contributing to the development of safer robots and autonomous systems.

artificial intelligence, requirement, verification, (16 more...)

doi: 10.1007/978-3-032-01486-3_23

2506.19622

Country:

Europe > United Kingdom > England > North Yorkshire > York (0.04)
Europe > Switzerland (0.04)

Genre:

Research Report (0.70)
Workflow (0.49)

Industry:

Government (0.35)
Law (0.34)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.89)

Sayed, Abdelrahman Sayed, Meyer, Pierre-Jean, Ghazel, Mohamed

Bridging Neural ODE and ResNet: A Formal Error Bound for Safety Verification

arXiv.org Artificial IntelligenceOct-14-2025

A neural ordinary differential equation (neural ODE) is a machine learning model that is commonly described as a continuous-depth generalization of a residual network (ResNet) with a single residual block, or conversely, the ResNet can be seen as the Euler discretization of the neural ODE. These two models are therefore strongly related in a way that the behaviors of either model are considered to be an approximation of the behaviors of the other. In this work, we establish a more formal relationship between these two models by bounding the approximation error between two such related models. The obtained error bound then allows us to use one of the models as a verification proxy for the other, without running the verification tools twice: if the reachable output set expanded by the error bound satisfies a safety property on one of the models, this safety property is then guaranteed to be also satisfied on the other model. This feature is fully reversible, and the initial safety verification can be run indifferently on either of the two models. This novel approach is illustrated on a numerical example of a fixed-point attractor system modeled as a neural ODE.

artificial intelligence, machine learning, neural ode, (16 more...)

2506.03227

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
North America > United States > New York (0.04)
Europe > France (0.04)

Genre: Research Report (0.70)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Neural Information Processing SystemsOct-9-2025, 14:09:55 GMT

A Safely Imitating a Neural Policy

Here we provide proofs of the theoretical results from Section 3.2 and extend the discussion of a few

artificial intelligence, benchmark, machine learning, (17 more...)

Industry: Transportation (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.48)

Gross, Dennis, Spieker, Helge, Gotlieb, Arnaud

Verifying Memoryless Sequential Decision-making of Large Language Models

arXiv.org Artificial IntelligenceOct-9-2025

We introduce a tool for rigorous and automated verification of large language model (LLM)- based policies in memoryless sequential decision-making tasks. Given a Markov decision process (MDP) representing the sequential decision-making task, an LLM policy, and a safety requirement expressed as a PCTL formula, our approach incrementally constructs only the reachable portion of the MDP guided by the LLM's chosen actions. Each state is encoded as a natural language prompt, the LLM's response is parsed into an action, and reachable successor states by the policy are expanded. The resulting formal model is checked with Storm to determine whether the policy satisfies the specified safety property. In experiments on standard grid world benchmarks, we show that open source LLMs accessed via Ollama can be verified when deterministically seeded, but generally underperform deep reinforcement learning baselines. Our tool natively integrates with Ollama and supports PRISM-specified tasks, enabling continuous benchmarking in user-specified sequential decision-making tasks and laying a practical foundation for formally verifying increasingly capable LLMs.

large language model, machine learning, reinforcement learning, (16 more...)

2510.06756

Country:

North America > United States (0.04)
Europe > Norway > Eastern Norway > Oslo (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Transportation (0.70)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.48)

Wang, Xuekang, Zhu, Shengyu, Cheng, Xueqi

Speculative Safety-Aware Decoding

arXiv.org Artificial IntelligenceSep-30-2025

Despite extensive efforts to align Large Language Models (LLMs) with human values and safety rules, jailbreak attacks that exploit certain vulnerabilities continuously emerge, highlighting the need to strengthen existing LLMs with additional safety properties to defend against these attacks. However, tuning large models has become increasingly resource intensive and may have difficulty ensuring consistent performance. We introduce Speculative Safety-Aware Decoding (SSD), a lightweight decoding-time approach that equips LLMs with the desired safety property while accelerating inference. We assume that there exists a small language model that possesses this desired property. SSD integrates speculative sampling during decoding and leverages the match ratio between the small and composite models to quantify jailbreak risks. This enables SSD to dynamically switch between decoding schemes to prioritize utility or safety, to handle the challenge of different model capacities. The output token is then sampled from a new distribution that combines the distributions of the original and the small models. Experimental results show that SSD successfully equips the large model with the desired safety property, and also allows the model to remain helpful to benign queries. Furthermore, SSD accelerates the inference time, thanks to the speculative sampling design.

large language model, machine learning, natural language, (17 more...)

2508.17739

Country:

North America > United States (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.97)