AITopics

e92e1b476bb5262d793fd40931e0ed53-AuthorFeedback.pdf

Neural Information Processing SystemsMar-21-2025, 13:31:36 GMT

artificial intelligence, book review, reviewer, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Vision (0.51)

Add feedback

Mutual Information Regularized Offline Reinforcement Learning

Neural Information Processing SystemsMar-21-2025, 13:31:28 GMT

The major challenge of offline RL is the distribution shift that appears when outof-distribution actions are queried, which makes the policy improvement direction biased by extrapolation errors. Most existing methods address this problem by penalizing the policy or value for deviating from the behavior policy during policy improvement or evaluation. In this work, we propose a novel MISA framework to approach offline RL from the perspective of Mutual Information between States and Actions in the dataset by directly constraining the policy improvement direction.

artificial intelligence, machine learning, reinforcement learning, (12 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)

Add feedback

A Proof of Theorem 1 is guaranteed in Π

Neural Information Processing SystemsMar-21-2025, 13:31:20 GMT

For the first argument, we use induction. For the second part, we it is essentially a Coupon Collector's problem. Consider a general sampling problem: for any finite set N with |N | = N. For any n, whose sampling probability is p(c), with a probability at least 1 δ, it requires at most log(1/δ) for n to be sampled. We consider the combination lock problem [20].

artificial intelligence, molecule, probability, (15 more...)

Neural Information Processing Systems

Industry: Energy (0.33)

Technology: Information Technology > Artificial Intelligence (0.31)

Add feedback

e904831f48e729f9ad8355a894334700-Paper.pdf

Neural Information Processing SystemsMar-21-2025, 13:31:14 GMT

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Energy (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Robots (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)
(2 more...)

Add feedback

e904831f48e729f9ad8355a894334700-AuthorFeedback.pdf

Neural Information Processing SystemsMar-21-2025, 13:31:03 GMT

artificial intelligence, conformer, gibbs score, (13 more...)

Neural Information Processing Systems

Industry: Energy (0.57)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.30)

Add feedback

Average-Reward Learning and Planning with Options Yi Wan, Richard S. Sutton

Neural Information Processing SystemsMar-21-2025, 13:30:59 GMT

We extend the options framework for temporal abstraction in reinforcement learning from discounted Markov decision processes (MDPs) to average-reward MDPs. Our contributions include general convergent off-policy inter-option learning algorithms, intra-option algorithms for learning values and models, as well as samplebased planning variants of our learning algorithms. Our algorithms and convergence proofs extend those recently developed by Wan, Naik, and Sutton. We also extend the notion of option-interrupting behavior from the discounted to the average-reward formulation. We show the efficacy of the proposed algorithms with experiments on a continuing version of the Four-Room domain.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country: North America > Canada > Alberta (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.49)

Add feedback

PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation

Neural Information Processing SystemsMar-21-2025, 13:30:49 GMT

Language-guided robotic manipulation is a challenging task that requires an embodied agent to follow abstract user instructions to accomplish various complex manipulation tasks.

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: Asia > China (0.14)

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)
(2 more...)

Add feedback

4bad7c27534efca029ca0d366c47c0e3-Paper-Conference.pdf

Neural Information Processing SystemsMar-21-2025, 13:30:38 GMT

artificial intelligence, detection, machine learning, (16 more...)

Neural Information Processing Systems

Country: Asia (0.28)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Learning to Merge Tokens via Decoupled Embedding for Efficient Vision Transformers

Neural Information Processing SystemsMar-21-2025, 13:30:30 GMT

Recent token reduction methods for Vision Transformers (ViTs) incorporate token merging, which measures the similarities between token embeddings and combines the most similar pairs. However, their merging policies are directly dependent on intermediate features in ViTs, which prevents exploiting features tailored for merging and requires end-to-end training to improve token merging. This paper proposes Decoupled Token Embedding for Merging (DTEM) that enhances token merging through a decoupled embedding learned via a continuously relaxed token merging process. Our method introduces a lightweight embedding module decoupled from the ViT forward pass to extract dedicated features for token merging, addressing the restriction from using intermediate features. The continuously relaxed token merging, applied during training, enables us to learn the decoupled embeddings in a differentiable manner. Thanks to the decoupled structure, our method can be seamlessly integrated into existing ViT backbones and trained either modularly by learning only the decoupled embeddings or end-to-end by fine-tuning. We demonstrate the applicability of DTEM on various tasks, including classification, captioning, and segmentation, with consistent improvement in token merging. Especially in the ImageNet-1k classification, DTEM achieves a 37.2% reduction in FLOPs while maintaining a top-1 accuracy of 79.85% with DeiT-small. Code is available at https://github.com/movinghoon/dtem.

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Genre: