AITopics | latent world model

Collaborating Authors

latent world model

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Latent World Models For Intrinsically Motivated Exploration

Neural Information Processing SystemsFeb-4-2026, 04:47:41 GMT

In this work we consider partially observable environments with sparse rewards. We present a self-supervised representation learning method for image-based observations, which arranges embeddings respecting temporal distance of observations. This representation is empirically robust to stochasticity and suitable for novelty detection from the error of a predictive forward model. We consider episodic and life-long uncertainties to guide the exploration. We propose to estimate the missing information about the environment with the world model, which operates in the learned latent space.

artificial intelligence, data mining, machine learning, (5 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.68)
Information Technology > Data Science > Data Mining > Anomaly Detection (0.61)
Information Technology > Artificial Intelligence > Machine Learning (0.41)

Add feedback

CoIRL-AD: Collaborative-Competitive Imitation-Reinforcement Learning in Latent World Models for Autonomous Driving

Zheng, Xiaoji, Yang, Ziyuan, Chen, Yanhao, Peng, Yuhang, Tang, Yuanrong, Liu, Gengyuan, Chen, Bokui, Gong, Jiangtao

arXiv.org Artificial IntelligenceOct-15-2025

End-to-end autonomous driving models trained solely with imitation learning (IL) often suffer from poor generalization. In contrast, reinforcement learning (RL) promotes exploration through reward maximization but faces challenges such as sample inefficiency and unstable convergence. A natural solution is to combine IL and RL. Moving beyond the conventional two-stage paradigm (IL pretraining followed by RL fine-tuning), we propose CoIRL-AD, a competitive dual-policy framework that enables IL and RL agents to interact during training. CoIRL-AD introduces a competition-based mechanism that facilitates knowledge exchange while preventing gradient conflicts. Experiments on the nuScenes dataset show an 18% reduction in collision rate compared to baselines, along with stronger generalization and improved performance on long-tail scenarios. Code is available at: https://github.com/SEU-zxj/CoIRL-AD.

machine learning, reinforcement learning, world model, (18 more...)

arXiv.org Artificial Intelligence

2510.1256

Country: Asia (0.28)

Genre: Research Report (0.51)

Industry:

Transportation > Ground > Road (0.73)
Information Technology > Robotics & Automation (0.64)
Automobiles & Trucks (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.91)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Review for NeurIPS paper: Latent World Models For Intrinsically Motivated Exploration

Neural Information Processing SystemsJan-23-2025, 14:54:39 GMT

Summary and Contributions: The paper proposes a novel method to the address the problem of exploration in RL. It is know problem in RL that sparse rewards make random exploration _very_ inefficient. One approach for overcoming such limitations is using intrinsic motivation methods, building an auxiliary reward signal to encourage an agent to seek novel or rare states, for example proportional to inverse visit counts or, as proposed in this paper, some prediction error. Prediction error as a measure of novely can by heaviliy affected by three types of uncertainty by sources: 1. from novelty (epistemic) -- this is the signal we are typically after. This propose a belief state formulation that the authors claim is not too sensitivity to stochasticity and has the ability to extrapolate the state dynamics, such that the prediction error can be a genuine measurement for novelty.

artificial intelligence, intrinsically motivated exploration, prediction error, (6 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.78)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.43)

Add feedback

Review for NeurIPS paper: Latent World Models For Intrinsically Motivated Exploration

Neural Information Processing SystemsJan-23-2025, 14:54:32 GMT

All reviewers unanimously agree that this paper should be accepted to NeurIPS. The authors did a great job addressing almost all of the reviewer's concerns, leading to three reviewers increasing their score after the author response. Reviewers particularly praised the readability of the paper, the fact that the method is clearly defined, and that the authors did a good job of visually demonstrating how it works. However, the reviewers also agree that CPC Action would be an important baseline to compare to, so I strongly encourage the authors to take the suggested improvements seriously and work towards an improved version of the paper. I am confident that the authors can make the requested changes and am recommending acceptance.

intrinsically motivated exploration, latent world model, neurips paper, (1 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.40)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.40)

Add feedback

Latent World Models For Intrinsically Motivated Exploration

Neural Information Processing SystemsOct-10-2024, 00:22:37 GMT

intrinsically motivated exploration, latent world model, representation

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.66)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.66)
Information Technology > Data Science > Data Mining > Anomaly Detection (0.66)
Information Technology > Artificial Intelligence > Machine Learning (0.46)

Add feedback

Learning Multiple Probabilistic Decisions from Latent World Model in Autonomous Driving

Xiao, Lingyu, Liu, Jiang-Jiang, Yang, Sen, Li, Xiaofan, Ye, Xiaoqing, Yang, Wankou, Wang, Jingdong

arXiv.org Artificial IntelligenceSep-24-2024

The autoregressive world model exhibits robust generalization capabilities in vectorized scene understanding but encounters difficulties in deriving actions due to insufficient uncertainty modeling and self-delusion. In this paper, we explore the feasibility of deriving decisions from an autoregressive world model by addressing these challenges through the formulation of multiple probabilistic hypotheses. We propose LatentDriver, a framework models the environment's next states and the ego vehicle's possible actions as a mixture distribution, from which a deterministic control signal is then derived. By incorporating mixture modeling, the stochastic nature of decisionmaking is captured. Additionally, the self-delusion problem is mitigated by providing intermediate actions sampled from a distribution to the world model. Experimental results on the recently released close-loop benchmark Waymax demonstrate that LatentDriver surpasses state-of-the-art reinforcement learning and imitation learning methods, achieving expert-level performance. The code and models will be made available at https://github.com/Sephirex-X/LatentDriver.

scenario, vehicle, world model, (14 more...)

arXiv.org Artificial Intelligence

2409.1573

Country:

Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)

Genre: Research Report (0.64)

Industry:

Transportation > Ground > Road (0.52)
Information Technology > Robotics & Automation (0.42)
Automobiles & Trucks (0.42)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.84)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Contrastive Variational Reinforcement Learning for Complex Observations

Ma, Xiao, Chen, Siwei, Hsu, David, Lee, Wee Sun

arXiv.org Machine LearningNov-9-2020

Model-free reinforcement learning (MFRL) has achieved great success in game playing [1, 2], robot navigation [3, 4] and etc. However, extending existing RL methods to real-world environments remains challenging, because they require long-horizon reasoning with the low-dimensional useful features, e.g., the position of a robot, embedded in high-dimensional complex observations, e.g., visually rich images. Consider a four-legged mini-cheetah robot [5] navigating on the campus. To determine the traversable path, the robot must extract the relevant geometric features that coexist with irrelevant variable backgrounds, such as the moving pedestrians, paintings on the wall, etc. Model-based RL (MBRL), in contrast to the model-free methods, reasons a world model trained by generative learning and greatly improves the sample efficiency of the model-free methods [6, 7, 8]. Recent MBRL methods learn compact latent world models from high-dimensional visual inputs with Variational Autoencoders (VAEs) [9] by optimizing the evidence lower bound (ELBO) of an observation sequence [10, 11]. However, learning a generative model under complex observations is challenging.

complex observation, learning, world model, (11 more...)

arXiv.org Machine Learning

2008.0243

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Asia > Singapore (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Add feedback