AITopics | morel

Unsupervised Video Object Segmentation for Deep Reinforcement Learning

Neural Information Processing SystemsMar-16-2026, 23:26:38 GMT

We present a new technique for deep reinforcement learning that automatically detects moving objects and uses the relevant information for action selection. The detection of moving objects is done in an unsupervised way by exploiting structure from motion. Instead of directly learning a policy from raw images, the agent first learns to detect and segment moving objects by exploiting flow information in video sequences. The learned representation is then used to focus the policy of the agent on the moving objects. Over time, the agent identifies which objects are critical for decision making and gradually builds a policy based on relevant moving objects.

artificial intelligence, machine learning, reinforcement learning, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)

Add feedback

Unsupervised Video Object Segmentation for Deep Reinforcement Learning

Vikash Goel, Jameson Weng, Pascal Poupart

Neural Information Processing SystemsFeb-13-2026, 20:03:07 GMT

Neural Information Processing Systems http://nips.cc/

architecture, information, reinforcement learning, (13 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.04)
North America > Canada > Ontario > Waterloo Region > Waterloo (0.04)
North America > Canada > Ontario > Toronto (0.04)

Industry: Leisure & Entertainment > Games (0.70)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

MOReL: Model-Based Offline Reinforcement Learning

Neural Information Processing SystemsDec-24-2025, 21:54:40 GMT

In offline reinforcement learning (RL), the goal is to learn a highly rewarding policy based solely on a dataset of historical interactions with the environment. This serves as an extreme test for an agent's ability to effectively use historical data which is known to be critical for efficient RL. Prior work in offline RL has been confined almost exclusively to model-free RL approaches.

model-based offline reinforcement learning, morel, pessimistic mdp, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.30)

Add feedback

Unsupervised Video Object Segmentation for Deep Reinforcement Learning

Neural Information Processing SystemsNov-20-2025, 22:41:56 GMT

We present a new technique for deep reinforcement learning that automatically detects moving objects and uses the relevant information for action selection. The detection of moving objects is done in an unsupervised way by exploiting structure from motion. Instead of directly learning a policy from raw images, the agent first learns to detect and segment moving objects by exploiting flow information in video sequences. The learned representation is then used to focus the policy of the agent on the moving objects. Over time, the agent identifies which objects are critical for decision making and gradually builds a policy based on relevant moving objects.

deep reinforcement learning, name change, unsupervised video object segmentation, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)

Add feedback

Unsupervised Video Object Segmentation for Deep Reinforcement Learning

Vikash Goel, Jameson Weng, Pascal Poupart

Neural Information Processing SystemsNov-20-2025, 18:32:20 GMT

Tremendous progress has been made in the development of reinforcement learning (RL) algorithms for tasks where the input consists of images.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.04)
North America > Canada > Ontario > Waterloo Region > Waterloo (0.04)
North America > Canada > Ontario > Toronto (0.04)

Industry: Leisure & Entertainment > Games > Computer Games (0.31)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

f7efa4f864ae9b88d43527f4b14f750f-Supplemental.pdf

Neural Information Processing SystemsSep-26-2025, 05:02:40 GMT

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County (0.14)
Europe > Spain (0.14)
Asia (0.14)

Genre: Research Report > New Finding (0.46)

Industry: Energy > Oil & Gas (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

f7efa4f864ae9b88d43527f4b14f750f-Paper.pdf

Neural Information Processing SystemsSep-26-2025, 05:02:33 GMT

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County (0.14)
North America > Canada (0.14)
Europe > Spain (0.14)
Asia (0.14)

Genre: Research Report > New Finding (0.46)

Industry: Energy > Oil & Gas (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Review for NeurIPS paper: MOReL: Model-Based Offline Reinforcement Learning

Neural Information Processing SystemsFeb-8-2025, 10:51:12 GMT

Additional Feedback: Most of recent offline RL algorithms rely on policy regularization where the optimizing policy is prevented from deviating too much from the data-logging policy. Differently, MOReL does not directly rely on the data-logging policy but exploits pessimism to a model-based approach, providing another good direction for offline RL. However, it would be more natural to penalize more to more uncertain states. For example, one classical model-based RL algorithm (MBIE-EB) constructs an optimistic MDP that rewarding the uncertain regions by the bonus proportional to the 1/sqrt(N(s,a)) where N(s,a) is the visitation count. In contrast, but similarly to MBIE-EB, we may consider a pessimistic MDP that penalizes the uncertain regions by the penalty proportional to the 1/sqrt(N(s,a)). How is it justified to use alpha greater than zero for USAD? - It would be great to see how sensitive the performance of the algorithm with respect to kappa in the reward penalty and threshold in USAD.

model-based offline reinforcement learning, offline reinforcement learning, reinforcement learning, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Review for NeurIPS paper: MOReL: Model-Based Offline Reinforcement Learning

Neural Information Processing SystemsFeb-8-2025, 10:51:05 GMT

All three reviewers have favourable opinion towards this paper. There are some minor questions or comments, but they can be addressed without requiring another round of reviewing. Therefore, I recommend acceptance of this work. I encourage the authors to incorporate the reviewers' comments and concerns as much as possible.

model-based offline reinforcement learning, morel, neurips paper, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.40)

Add feedback

MOReL: Model-Based Offline Reinforcement Learning

Neural Information Processing SystemsJan-16-2025, 07:41:36 GMT

In offline reinforcement learning (RL), the goal is to learn a highly rewarding policy based solely on a dataset of historical interactions with the environment. This serves as an extreme test for an agent's ability to effectively use historical data which is known to be critical for efficient RL. Prior work in offline RL has been confined almost exclusively to model-free RL approaches. This framework consists of two steps: (a) learning a pessimistic MDP using the offline dataset; (b) learning a near-optimal policy in this pessimistic MDP. The design of the pessimistic MDP is such that for any policy, the performance in the real environment is approximately lower-bounded by the performance in the pessimistic MDP.

model-based offline reinforcement learning, offline rl, pessimistic mdp, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.64)

Add feedback

Filters

Collaborating Authors

morel

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Unsupervised Video Object Segmentation for Deep Reinforcement Learning

Unsupervised Video Object Segmentation for Deep Reinforcement Learning

MOReL: Model-Based Offline Reinforcement Learning

Unsupervised Video Object Segmentation for Deep Reinforcement Learning

Unsupervised Video Object Segmentation for Deep Reinforcement Learning

f7efa4f864ae9b88d43527f4b14f750f-Supplemental.pdf

f7efa4f864ae9b88d43527f4b14f750f-Paper.pdf

Review for NeurIPS paper: MOReL: Model-Based Offline Reinforcement Learning

Review for NeurIPS paper: MOReL: Model-Based Offline Reinforcement Learning

MOReL: Model-Based Offline Reinforcement Learning