AITopics | Education

Collaborating Authors

Education

Learning Differentiable Programs with Admissible Neural Heuristics Ameesh Shah

Neural Information Processing SystemsOct-2-2025, 15:39:00 GMT

This relaxed program is differentiable and can be trained end-to-end, and the resulting training loss is an approximately admissible heuristic that can guide the combinatorial search.

artificial intelligence, machine learning, natural language, (15 more...)

Neural Information Processing Systems

Country:

North America > United States (1.00)
North America > Canada (0.94)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Natural Language (0.69)
(2 more...)

Add feedback

Explore no more: Improved high-probability regret bounds for non-stochastic bandits

Gergely Neu

Neural Information Processing SystemsOct-2-2025, 15:28:12 GMT

This work addresses the problem of regret minimization in non-stochastic multi-armed bandit problems, focusing on performance guarantees that hold with high probability. Such results are rather scarce in the literature since proving them requires a large deal of technical effort and significant modifications to the standard, more intuitive algorithms that come only with guarantees that hold on expectation. One of these modifications is forcing the learner to sample arms from the uniform distribution at least Ω( T) times over T rounds, which can adversely affect performance if many of the arms are suboptimal. While it is widely conjectured that this property is essential for proving high-probability regret bounds, we show in this paper that it is possible to achieve such strong results without this undesirable exploration component. Our result relies on a simple and intuitive loss-estimation strategy called Implicit eXploration (IX) that allows a remarkably clean analysis. To demonstrate the flexibility of our technique, we derive several improved high-probability bounds for various extensions of the standard multi-armed bandit framework. Finally, we conduct a simple experiment that illustrates the robustness of our implicit exploration technique.

artificial intelligence, data mining, machine learning, (21 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Genre: Research Report > New Finding (0.34)

Industry: Education > Educational Setting (0.47)

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

33a5435d4f945aa6154b31a73bab3b73-Paper.pdf

Neural Information Processing SystemsOct-2-2025, 15:27:01 GMT

artificial intelligence, deep learning, machine learning, (15 more...)

Neural Information Processing Systems

Country: Europe (0.67)

Genre: Research Report > New Finding (0.93)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.93)
Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Model Selection for Contextual Bandits

Dylan J. Foster, Akshay Krishnamurthy, Haipeng Luo

Neural Information Processing SystemsOct-2-2025, 15:18:58 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, contextual bandit, machine learning, (14 more...)

Neural Information Processing Systems

Country: North America > United States (0.46)

Industry: Education > Educational Setting (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Graph-Based Semi-Supervised Learning with Non-ignorable Non-response

Fan Zhou, Tengfei Li, Haibo Zhou, Hongtu Zhu, Ye Jieping

Neural Information Processing SystemsOct-2-2025, 15:12:07 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, identifiability, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America (0.28)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Multi-Task Reinforcement Learning with Soft Modularization Ruihan Y ang

Neural Information Processing SystemsOct-2-2025, 15:09:21 GMT

Multi-task learning is a very challenging problem in reinforcement learning.

artificial intelligence, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Country: Asia (0.46)

Industry: Education (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Meta-Reinforced Synthetic Data for One-Shot Fine-Grained Visual Recognition

Satoshi Tsutsui, Yanwei Fu, David Crandall

Neural Information Processing SystemsOct-2-2025, 15:03:44 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, classifier, machine learning, (12 more...)

Neural Information Processing Systems

Country: North America > United States (0.47)

Genre: Research Report (0.48)

Industry:

Information Technology (0.47)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Learner-aware Teaching: Inverse Reinforcement Learning with Preferences and Constraints

Sebastian Tschiatschek, Ahana Ghosh, Luis Haug, Rati Devidze, Adish Singla

Neural Information Processing SystemsOct-2-2025, 14:26:44 GMT

Neural Information Processing Systems http://nips.cc/

learner, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Continual Deep Learning by Functional Regularisation of Memorable Past

Neural Information Processing SystemsOct-2-2025, 14:18:28 GMT

The ability to quickly adapt to changing environments is an important quality of intelligent systems. For such quick adaptation, it is important to be able to identify, memorise, and recall useful past experiences when acquiring new ones.

artificial intelligence, deep learning, machine learning, (16 more...)

Neural Information Processing Systems

Country: