AITopics | domain randomization

Collaborating Authors

domain randomization

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

SPiDR: A Simple Approach for Zero-Shot Safety in Sim-to-Real Transfer

Neural Information Processing SystemsJun-13-2026, 01:56:06 GMT

Deploying reinforcement learning (RL) safely in the real world is challenging, as policies trained in simulators must face the inevitable . Robust safe RL techniques are provably safe, however difficult to scale, while domain randomization is more practical yet prone to unsafe behaviors. We address this gap by proposing SPiDR, short for Sim-to-real via Pessimistic Domain Randomization--a scalable algorithm with provable guarantees for safe sim-to-real transfer. SPiDR uses domain randomization to incorporate the uncertainty about the sim-to-real gap into the safety constraints, making it versatile and highly compatible with existing training pipelines. Through extensive experiments on sim-to-sim benchmarks and two distinct real-world robotic platforms, we demonstrate that SPiDR effectively ensures safety despite the sim-to-real gap while maintaining strong performance.

artificial intelligence, machine learning, proceedings, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.41)

Add feedback

05b63fa06784b71aab3939004e0f0a0d-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 08:15:33 GMT

artificial intelligence, domain randomization, machine learning, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement Learning

Neural Information Processing SystemsApr-24-2026, 08:15:29 GMT

Many real-world domains require safe decision making in uncertain environments. In this work, we introduce a deep reinforcement learning framework for approaching this important problem. We consider a distribution over transition models, and apply a risk-averse perspective towards model uncertainty through the use of coherent distortion risk measures. We provide robustness guarantees for this framework by showing it is equivalent to a specific class of distributionally robust safe reinforcement learning problems. Unlike existing approaches to robustness in deep reinforcement learning, however, our formulation does not involve minimax optimization. This leads to an efficient, model-free implementation of our approach that only requires standard data collection from a single training environment. In experiments on continuous control tasks with safety constraints, we demonstrate that our framework produces robust performance and safety at deployment time across a range of perturbed test environments.

machine learning, model uncertainty, reinforcement learning, (15 more...)

Neural Information Processing Systems

Industry: Education (0.56)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

d3e2d61af1e9612ddecd099144e50404-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 03:17:29 GMT

agent, hyperparameter, samplr, (15 more...)

Neural Information Processing Systems

Country:

Europe > Italy (0.04)
Europe > Germany (0.04)
Asia > Singapore (0.04)

Industry: Leisure & Entertainment > Sports (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.47)

Add feedback

A Properties of coherent distortion risk measures

Neural Information Processing SystemsFeb-7-2026, 10:14:08 GMT

The properties of coherent risk measures also lead to a useful dual representation. Let ρ be a proper, real-valued coherent risk measure. See Shapiro et al. [42] for a general treatment of this result. Therefore, we have that the RAMU safe RL problem in (3) is equivalent to (6).B.3 Proof of Corollary 1 Fix ϵ > 0 and consider ( s, a) S A . Safety constraints and environment perturbations In all of our experiments, we consider the problem of optimizing a task objective while satisfying a safety constraint.

artificial intelligence, domain randomization, machine learning, (13 more...)

Neural Information Processing Systems

Country: North America > United States (0.07)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement Learning James Queeney

Neural Information Processing SystemsFeb-7-2026, 10:14:04 GMT

machine learning, model uncertainty, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country: North America > United States (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Sim2Swim: Zero-Shot Velocity Control for Agile AUV Maneuvering in 3 Minutes

Fosso, Lauritz Rismark, Amundsen, Herman Biørn, Xanthidis, Marios, Ohrem, Sveinung Johan

arXiv.org Artificial IntelligenceDec-10-2025

Holonomic autonomous underwater vehicles (AUVs) have the hardware ability for agile maneuvering in both translational and rotational degrees of freedom (DOFs). However, due to challenges inherent to underwater vehicles, such as complex hydrostatics and hydrodynamics, parametric uncertainties, and frequent changes in dynamics due to payload changes, control is challenging. Performance typically relies on carefully tuned controllers targeting unique platform configurations, and a need for re-tuning for deployment under varying payloads and hydrodynamic conditions. As a consequence, agile maneuvering with simultaneous tracking of time-varying references in both translational and rotational DOFs is rarely utilized in practice. To the best of our knowledge, this paper presents the first general zero-shot sim2real deep reinforcement learning-based (DRL) velocity controller enabling path following and agile 6DOF maneuvering with a training duration of just 3 minutes. Sim2Swim, the proposed approach, inspired by state-of-the-art DRL-based position control, leverages domain randomization and massively parallelized training to converge to field-deployable control policies for AUVs of variable characteristics without post-processing or tuning. Sim2Swim is extensively validated in pool trials for a variety of configurations, showcasing robust control for highly agile motions.

large language model, machine learning, reinforcement learning, (20 more...)

arXiv.org Artificial Intelligence

2512.08656

Country: Europe (0.46)

Genre: Research Report (0.50)

Industry: Energy > Renewable > Wind (0.47)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.73)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Domain-randomized deep learning for neuroimage analysis

Hoffmann, Malte

arXiv.org Artificial IntelligenceDec-8-2025

Abstract--Deep learning has revolutionized neuroimage analysis by delivering unprecedented speed and accuracy. However, the narrow scope of many training datasets constrains model robustness and generalizability. This challenge is particularly acute in magnetic resonance imaging (MRI), where image appearance varies widely across pulse sequences and scanner hardware. A recent domain-randomization strategy addresses the generalization problem by training deep neural networks on synthetic images with randomized intensities and anatomical content. By generating diverse data from anatomical segmentation maps, the approach enables models to accurately process image types unseen during training, without retraining or fine-tuning. It has demonstrated effectiveness across modalities including MRI, computed tomography, positron emission tomography, and optical coherence tomography, as well as beyond neuroimaging in ultrasound, electron and fluorescence microscopy, and X-ray microtomography. This tutorial paper reviews the principles, implementation, and potential of the synthesis-driven training paradigm. It highlights key benefits, such as improved generalization and resistance to overfitting, while discussing trade-offs such as increased computational demands. Finally, the article explores practical considerations for adopting the technique, aiming to accelerate the development of generalizable tools that make deep learning more accessible to domain experts without extensive computational resources or machine learning knowledge. EUROIMAGING techniques, such as magnetic resonance imaging (MRI), have enabled the study of the human brain in vivo. Alongside advances in acquisition technology, research in neuroimage processing has led to software that automates systematic data analysis, minimizing human effort while improving accuracy and reproducibility [1]. In recent years, deep learning (DL) has been driving the development of a new class of algorithms with unprecedented speed and accuracy, and for a broad range of tasks, deep neural networks have largely replaced classical techniques. However, a key challenge for DL in neuroimaging is small and highly specific datasets. Many studies include only hundreds or even tens of subjects [2], due to factors such as the high cost of data acquisition, multiple modalities competing for scan time, the large size of multi-dimensional data like time-series acquisitions, the low prevalence of certain neurological disorders, and privacy concerns regarding data sharing [3]. Malte Hoffmann (mhoffmann@mgh.harvard.edu) is with the Athinoula A. Martinos Center for Biomedical Imaging and the Departments of Radiology at Harvard Medical School and Massachusetts General Hospital.

artificial intelligence, machine learning, noise, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/MSP.2025.3590806

2507.13458

Country: North America > United States > Massachusetts (0.24)

Genre:

Research Report (1.00)
Overview (0.93)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Learning Sim-to-Real Humanoid Locomotion in 15 Minutes

Seo, Younggyo, Sferrazza, Carmelo, Chen, Juyue, Shi, Guanya, Duan, Rocky, Abbeel, Pieter

arXiv.org Artificial IntelligenceDec-2-2025

Massively parallel simulation has reduced reinforcement learning (RL) training time for robots from days to minutes. However, achieving fast and reliable sim-to-real RL for humanoid control remains difficult due to the challenges introduced by factors such as high dimensionality and domain randomization. In this work, we introduce a simple and practical recipe based on off-policy RL algorithms, i.e., FastSAC and FastTD3, that enables rapid training of humanoid locomotion policies in just 15 minutes with a single RTX 4090 GPU. Our simple recipe stabilizes off-policy RL algorithms at massive scale with thousands of parallel environments through carefully tuned design choices and minimalist reward functions. We demonstrate rapid end-to-end learning of humanoid locomotion controllers on Unitree G1 and Booster T1 robots under strong domain randomization, e.g., randomized dynamics, rough terrain, and push perturbations, as well as fast training of whole-body human-motion tracking policies. We provide videos and open-source implementation at: https://younggyo.me/fastsac-humanoid.

arxiv preprint arxiv, machine learning, reinforcement learning, (13 more...)

arXiv.org Artificial Intelligence

2512.01996

Genre: Research Report (0.50)

Industry: Leisure & Entertainment (0.46)

Technology: