AITopics | vime

VIME: Variational Information Maximizing Exploration

Neural Information Processing SystemsMar-17-2026, 10:33:29 GMT

Scalable and effective exploration remains a key challenge in reinforcement learning (RL). While there are methods with optimality guarantees in the setting of discrete state and action spaces, these methods cannot be applied in high-dimensional deep RL scenarios. As such, most contemporary RL relies on simple heuristics such as epsilon-greedy exploration or adding Gaussian noise to the controls. This paper introduces Variational Information Maximizing Exploration (VIME), an exploration strategy based on maximization of information gain about the agent's belief of environment dynamics. We propose a practical implementation, using variational inference in Bayesian neural networks which efficiently handles continuous state and action spaces. VIME modifies the MDP reward function, and can be applied with several different underlying RL algorithms. We demonstrate that VIME achieves significantly better performance compared to heuristic exploration methods across a variety of continuous control tasks and algorithms, including tasks with very sparse rewards.

artificial intelligence, machine learning, reinforcement learning, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.61)

Add feedback

7d97667a3e056acab9aaf653807b4a03-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 02:59:42 GMT

learning, pretext task, tabular data, (14 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Genre: Research Report (0.94)

Industry: Health & Medicine > Therapeutic Area > Oncology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

7d97667a3e056acab9aaf653807b4a03-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-9-2026, 02:59:31 GMT

dataset, pretext task, tabular data, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.80)

Add feedback

VIME: Extending the Success of Self- and Semi-supervised Learning to Tabular Domain

Neural Information Processing SystemsDec-24-2025, 05:36:19 GMT

Self-and semi-supervised learning frameworks have made significant progress in training machine learning models with limited labeled data in image and language domains. These methods heavily rely on the unique structure in the domain datasets (such as spatial relationships in images or semantic relationships in language). They are not adaptable to general tabular data which does not have the same explicit structure as image and language data. In this paper, we fill this gap by proposing novel self-and semi-supervised learning frameworks for tabular data, which we refer to collectively as VIME (Value Imputation and Mask Estimation). We create a novel pretext task of estimating mask vectors from corrupted tabular data in addition to the reconstruction pretext task for self-supervised learning. We also introduce a novel tabular data augmentation method for self-and semi-supervised learning frameworks. In experiments, we evaluate the proposed framework in multiple tabular datasets from various application domains, such as genomics and clinical data. VIME exceeds state-of-the-art performance in comparison to the existing baseline methods.

self-and semi-supervised learning, semi-supervised learning, semi-supervised learning framework, (10 more...)

Neural Information Processing Systems

Industry: Health & Medicine (0.60)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

VIME: Variational Information Maximizing Exploration

Neural Information Processing SystemsNov-21-2025, 15:13:05 GMT

Scalable and effective exploration remains a key challenge in reinforcement learning (RL). While there are methods with optimality guarantees in the setting of discrete state and action spaces, these methods cannot be applied in high-dimensional deep RL scenarios. As such, most contemporary RL relies on simple heuristics such as epsilon-greedy exploration or adding Gaussian noise to the controls. This paper introduces Variational Information Maximizing Exploration (VIME), an exploration strategy based on maximization of information gain about the agent's belief of environment dynamics. We propose a practical implementation, using variational inference in Bayesian neural networks which efficiently handles continuous state and action spaces. VIME modifies the MDP reward function, and can be applied with several different underlying RL algorithms. We demonstrate that VIME achieves significantly better performance compared to heuristic exploration methods across a variety of continuous control tasks and algorithms, including tasks with very sparse rewards.

name change, variational information maximizing exploration, vime, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.61)

Add feedback

VIME: Variational Information Maximizing Exploration

Rein Houthooft, Xi Chen, Xi Chen, Yan Duan, John Schulman, Filip De Turck, Pieter Abbeel

Neural Information Processing SystemsNov-21-2025, 08:52:05 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, exploration, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Europe > Belgium > Flanders (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

7d97667a3e056acab9aaf653807b4a03-Supplemental.pdf

Neural Information Processing SystemsOct-3-2025, 08:36:48 GMT

dataset, matrix, pretext task, (15 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.05)
North America > United States (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Industry: Health & Medicine > Therapeutic Area > Oncology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

VIME: Extending the Success of Self-and Semi-supervised Learning to Tabular Domain

Neural Information Processing SystemsOct-3-2025, 08:36:41 GMT

These methods heavily rely on the unique structure in the domain datasets (such as spatial relationships in images or semantic relationships in language).

learning, pretext task, tabular data, (15 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report (0.94)

Industry: Health & Medicine > Therapeutic Area > Oncology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

7d97667a3e056acab9aaf653807b4a03-AuthorFeedback.pdf

Neural Information Processing SystemsOct-3-2025, 08:36:30 GMT

dataset, pretext task, tabular data, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.80)

Add feedback

Review for NeurIPS paper: VIME: Extending the Success of Self- and Semi-supervised Learning to Tabular Domain

Neural Information Processing SystemsJan-26-2025, 03:03:03 GMT

Weaknesses: My central concern for this paper is the misalignment between the motivation and methodology. As motivation, the authors argue that self-supervised CV and **NLP** "algorithms are not effective for tabular data." The proposed model, though, is effectively the binary masked language model whose variants pervade self-supervised NLP research (e.g. Granted, instead of masking words, the proposed models are masking tabular values, but this is performing a very similar pretext task. In fact, there is concurrent work that learns tabular representations using a BERT model [1].

self-and semi-supervised learning, tabular data, tabular domain, (7 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.47)

Add feedback

Collaborating Authors

vime

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

VIME: Variational Information Maximizing Exploration

7d97667a3e056acab9aaf653807b4a03-Paper.pdf

7d97667a3e056acab9aaf653807b4a03-AuthorFeedback.pdf

VIME: Extending the Success of Self- and Semi-supervised Learning to Tabular Domain

VIME: Variational Information Maximizing Exploration

VIME: Variational Information Maximizing Exploration

7d97667a3e056acab9aaf653807b4a03-Supplemental.pdf

VIME: Extending the Success of Self-and Semi-supervised Learning to Tabular Domain

7d97667a3e056acab9aaf653807b4a03-AuthorFeedback.pdf

Review for NeurIPS paper: VIME: Extending the Success of Self- and Semi-supervised Learning to Tabular Domain