AITopics | hindsight

745b7e084d5ca5afc07fb454ab2be522-Paper-Conference.pdf

Neural Information Processing SystemsApr-28-2026, 15:10:27 GMT

algorithm, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.71)

Add feedback

NeurIPS2022_camera

Neural Information Processing SystemsApr-24-2026, 07:53:38 GMT

Offline goal-conditioned reinforcement learning (GCRL) promises general-purpose skill learning in the form of reaching diverse goals from purely offline datasets. We propose Goal-conditioned f-Advantage Regression (GoFAR), a novel regressionbased offline GCRL algorithm derived from a state-occupancy matching perspective; the key intuition is that the goal-reaching task can be formulated as a stateoccupancy matching problem between a dynamics-abiding imitator agent and an expert agent that directly teleports to the goal. In contrast to prior approaches, GoFAR does not require any hindsight relabeling and enjoys uninterleaved optimization for its value and policy networks. These distinct features confer GoFAR with much better offline performance and stability as well as statistical performance guarantee that is unattainable for prior methods. Furthermore, we demonstrate that GoFAR's training objectives can be re-purposed to learn an agent-independent goal-conditioned planner from purely offline source-domain data, which enables zero-shot transfer to new target domains.

gofar, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Country: North America > United States (0.67)

Genre: Research Report > New Finding (0.46)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Learning Formal Mathematics From Intrinsic Motivation Gabriel Poesia 1 David Broman 4 Nick Haber 1,3 Noah D. Goodman

Neural Information Processing SystemsFeb-12-2026, 16:11:47 GMT

How did humanity coax mathematics from the æther?

large language model, logic & formal reasoning, machine learning, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > Sweden (0.04)
Europe > Italy (0.04)
(2 more...)

Genre: Research Report > Experimental Study (1.00)

Industry:

Leisure & Entertainment > Games (0.46)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)

Add feedback

PersonalizedOnlineFederatedLearning withMultipleKernels

Neural Information Processing SystemsFeb-12-2026, 04:50:19 GMT

Employing multiple kernels instead ofasingle pre-selected one, can lead toobtaining more accurate function approximation since multi-kernel learning (MKL) can learn combination of kernels [24].

artificial intelligence, kernel, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

b7da6669894867f04b8727876a69ffc0-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 21:01:50 GMT

algorithm, fairness, sequence, (17 more...)

Neural Information Processing Systems

Country:

South America > Chile (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

9381fc93ad66f9ec4b2eef71147a6665-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 09:16:30 GMT

architecture, information, trajectory, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada (0.04)

Industry:

Leisure & Entertainment > Games (0.68)
Leisure & Entertainment > Sports (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

9381fc93ad66f9ec4b2eef71147a6665-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 09:16:22 GMT

information, learning, value prediction, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada (0.04)

Industry: Leisure & Entertainment > Games (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

57e5cb96e22546001f1d6520ff11d9ba-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-8-2026, 12:24:52 GMT

algorithm, trajectory, transition, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.70)

Add feedback

CEIL: Generalized Contextual Imitation Learning

Neural Information Processing SystemsDec-27-2025, 03:50:29 GMT

Inspired by the formulation of hindsight information matching, we derive CEIL by explicitly learning a hindsight embedding function together with a contextual policy using the hindsight embeddings. To achieve the expert matching objective for IL, we advocate for optimizing a contextual variable such that it biases the contextual policy towards mimicking expert behaviors. Beyond the typical learning from demonstrations (LfD) setting, CEIL is a generalist that can be effectively applied to multiple settings including: 1) learning from observations (LfO), 2) offline IL, 3) cross-domain IL (mismatched experts), and 4) one-shot IL settings.

ceil, generalized contextual imitation learning, name change, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.65)

Add feedback

A Bandit Learning Algorithm and Applications to Auction Design

Neural Information Processing SystemsDec-24-2025, 06:48:26 GMT

We consider online bandit learning in which at every time step, an algorithm has to make a decision and then observe only its reward. The goal is to design efficient (polynomial-time) algorithms that achieve a total reward approximately close to that of the best fixed decision in hindsight. In this paper, we introduce a new notion of $(\lambda,\mu)$-concave functions and present a bandit learning algorithm that achieves a performance guarantee which is characterized as a function of the concavity parameters $\lambda$ and $\mu$. The algorithm is based on the mirror descent algorithm in which the update directions follow the gradient of the multilinear extensions of the reward functions. The regret bound induced by our algorithm is $\widetilde{O}(\sqrt{T})$ which is nearly optimal.

algorithm, bandit learning algorithm and application, maximization, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.81)

Add feedback

Filters

Collaborating Authors

hindsight

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

745b7e084d5ca5afc07fb454ab2be522-Paper-Conference.pdf

NeurIPS2022_camera

Learning Formal Mathematics From Intrinsic Motivation Gabriel Poesia 1 David Broman 4 Nick Haber 1,3 Noah D. Goodman

PersonalizedOnlineFederatedLearning withMultipleKernels

b7da6669894867f04b8727876a69ffc0-Paper.pdf

9381fc93ad66f9ec4b2eef71147a6665-Supplemental.pdf

9381fc93ad66f9ec4b2eef71147a6665-Paper.pdf

57e5cb96e22546001f1d6520ff11d9ba-AuthorFeedback.pdf

CEIL: Generalized Contextual Imitation Learning

A Bandit Learning Algorithm and Applications to Auction Design