AITopics | Education

Partial monitoring is a general model for online learning with limited feedback: a learner chooses actions in a sequential manner while an opponent chooses outcomes. In every round, the learner suffers some loss and receives some feedback based on the action and the outcome.

algorithm, opponent, outcome distribution, (13 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland (0.05)
North America > United States (0.04)
Europe > United Kingdom > Scotland > City of Edinburgh > Edinburgh (0.04)

Industry:

Leisure & Entertainment > Games (0.48)
Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.94)
Information Technology > Communications > Social Media (0.68)
Information Technology > Game Theory (0.68)

Add feedback

The Blinded Bandit: Learning with Adaptive Feedback

Ofer Dekel, Elad Hazan, Tomer Koren

Neural Information Processing SystemsOct-2-2025, 21:31:20 GMT

We study an online learning setting where the player is temporarily deprived of feedback each time it switches to a different action. Such model of adaptive feedback naturally occurs in scenarios where the environment reacts to the player's actions and requires some time to recover and stabilize after the algorithm switches actions. This motivates a variant of the multi-armed bandit problem, which we call the blinded multi-armed bandit, in which no feedback is given to the algorithm whenever it switches arms. We develop efficient online learning algorithms for this problem and prove that they guarantee the same asymptotic regret as the optimal algorithms for the standard multi-armed bandit problem. This result stands in stark contrast to another recent result, which states that adding a switching cost to the standard multi-armed bandit makes it substantially harder to learn, and provides a direct comparison of how feedback and loss contribute to the difficulty of an online learning problem. We also extend our results to the general prediction framework of bandit linear optimization, again attaining near-optimal regret bounds.

algorithm, multi-armed bandit problem, sequence, (14 more...)

Neural Information Processing Systems

Genre: Research Report (0.35)

Industry: Education (0.96)

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Regularized linear autoencoders recover the principal components, eventually

Neural Information Processing SystemsOct-2-2025, 21:27:49 GMT

While there has been rapid progress in understanding the learning dynamics of neural networks, most such work focuses on the networks' ability to fit input-output relationships. However, many machine learning problems require learning representations with general utility.

artificial intelligence, machine learning, representation, (16 more...)

Neural Information Processing Systems

Country: North America > Canada (0.46)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Online Structured Meta-learning

Neural Information Processing SystemsOct-2-2025, 20:54:03 GMT

In addition, new knowledge is further incorporated into the selected blocks.

artificial intelligence, knowledge block, machine learning, (13 more...)

Neural Information Processing Systems

Country: North America (0.28)

Industry:

Education (0.46)
Information Technology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Learning from Future: A Novel Self-Training Framework for Semantic Segmentation Y e Du

Neural Information Processing SystemsOct-2-2025, 20:33:12 GMT

Self-training has shown great potential in semi-supervised learning.

artificial intelligence, machine learning, segmentation, (17 more...)

Neural Information Processing Systems

Country:

Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Zhejiang Province > Hangzhou (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Education (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Learning New Tricks From Old Dogs: Multi-Source Transfer Learning From Pre-Trained Networks

Joshua Lee, Prasanna Sattigeri, Gregory Wornell

Neural Information Processing SystemsOct-2-2025, 20:27:25 GMT

In the case of, e.g., object classification in images collected by

artificial intelligence, deep learning, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (0.28)
North America > Canada > Ontario > Toronto (0.14)

Genre: Research Report (0.48)

Industry:

Information Technology (0.46)
Education (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

On Making Stochastic Classifiers Deterministic

Andrew Cotter, Maya Gupta, Harikrishna Narasimhan

Neural Information Processing SystemsOct-2-2025, 20:21:37 GMT

Stochastic classifiers arise in a number of machine learning problems, and have become especially prominent of late, as they often result from constrained optimization problems, e.g. for fairness, churn, or custom losses.

classifier, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.46)

Industry: Education (0.92)

Technology: