AITopics | Learning Management

Collaborating Authors

Learning Management

News Overviews Instructional Materials AI-Alerts Classics

Online Learning of Delayed Choices

Neural Information Processing SystemsMar-17-2026, 20:28:45 GMT

Choice models are essential for understanding decision-making processes in domains like online advertising, product recommendations, and assortment optimization. The Multinomial Logit (MNL) model is particularly versatile in selecting products or advertisements for display. However, challenges arise with unknown MNL parameters and delayed feedback, requiring sellers to learn customers' choice behavior and make dynamic decisions with biased knowledge due to delays. We address these challenges by developing an algorithm that handles delayed feedback, balancing exploration and exploitation using confidence bounds and optimism. We first consider a censored setting where a threshold for considering feedback is imposed by business requirements. Our algorithm demonstrates a $\tilde{O}(\sqrt{NT})$ regret, with a matching lower bound up to a logarithmic term. Furthermore, we extend our analysis to environments with non-thresholded delays, achieving a $\tilde{O}(\sqrt{NT})$ regret. To validate our approach, we conduct experiments that confirm the effectiveness of our algorithm.

artificial intelligence, machine learning, neural information processing system 37, (8 more...)

Neural Information Processing Systems

Industry:

Marketing (0.61)
Education > Educational Setting > Online (0.44)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (0.44)
Information Technology > Artificial Intelligence > Machine Learning (0.44)

Add feedback

Online Learning with Transductive Regret

Neural Information Processing SystemsMar-17-2026, 15:41:49 GMT

We study online learning with the general notion of transductive regret, that is regret with modification rules applying to expert sequences (as opposed to single experts) that are representable by weighted finite-state transducers. We show how transductive regret generalizes existing notions of regret, including: (1) external regret; (2) internal regret; (3) swap regret; and (4) conditional swap regret. We present a general and efficient online learning algorithm for minimizing transductive regret. We further extend that to design efficient algorithms for the time-selection and sleeping expert settings. A by-product of our study is an algorithm for swap regret, which, under mild assumptions, is more efficient than existing ones, and a substantially more efficient algorithm for time selection swap regret.

artificial intelligence, machine learning, neural information processing system 30, (7 more...)

Neural Information Processing Systems

Industry: Education > Educational Setting > Online (0.93)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (0.93)
Information Technology > Artificial Intelligence > Machine Learning (0.78)

Add feedback

Online Learning with an Unknown Fairness Metric

Stephen Gillen, Christopher Jung, Michael Kearns, Aaron Roth

Neural Information Processing SystemsMar-15-2026, 07:35:17 GMT

We consider the problem of online learning in the linear contextual bandits setting, but in which there are also strong individual fairness constraints governed by an unknown similarity metric. These constraints demand that we select similar actions or individuals with approximately equal probability [?], which may be at odds with optimizing reward, thus modeling settings where profit and social policy are in tension. We assume we learn about an unknown Mahalanobis similarity metric from only weak feedback that identifies fairness violations, but does not quantify their extent. This is intended to represent the interventions of a regulator who "knows unfairness when he sees it" but nevertheless cannot enunciate a quantitative fairness metric over individuals. Our main result is an algorithm in the adversarial context setting that has a number of fairness violations that depends only logarithmically on T, while obtaining an optimal O( T) regret bound to the best fair policy.

algorithm, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Country: North America > United States (0.68)

Industry: Education > Educational Setting > Online (0.61)

Technology:

Information Technology > Data Science (0.94)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Add feedback

80b618ebcac7aa97a6dac2ba65cb7e36-Supplemental.pdf

Neural Information Processing SystemsFeb-19-2026, 03:53:23 GMT

batch, fairness, violation, (17 more...)

Neural Information Processing Systems

Country:

Europe > Sweden > Stockholm > Stockholm (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > Quebec > Montreal (0.04)
(6 more...)

Genre: Research Report (0.66)

Industry: Education > Educational Setting > Online (0.72)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.71)
Information Technology > Data Science > Data Mining (0.68)

Add feedback

The Gain of Ordering in Online Learning

Neural Information Processing SystemsFeb-17-2026, 02:20:22 GMT

V ov95, CBL06] and online convex optimization [Haz16, Ora19] have been developed. Until the labels of all examples of X have been predicted: The learning algorithm picks a point x X and makes a prediction z R about its label.

artificial intelligence, inductive learning, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Oceania > Australia > Australian Capital Territory > Canberra (0.04)
North America > United States > California (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Industry: Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.52)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.46)

Add feedback

Universal Rates for Active Learning

Neural Information Processing SystemsFeb-16-2026, 10:19:07 GMT

In this work we study the problem of actively learning binary classifiers from a given concept class, i.e., learning by utilizing unlabeled data and submitting targeted queries about their labels to a domain expert. We evaluate the quality of our solutions by considering the learning curves they induce, i.e., the rate of

algorithm, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Israel (0.04)

Genre: Research Report > Experimental Study (0.92)

Industry: Education > Educational Setting > Online (0.66)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.34)

Add feedback

77fa8253adfc8b33209639f3e9985741-Paper-Conference.pdf

Neural Information Processing SystemsFeb-15-2026, 23:23:46 GMT

adversary, algorithm, online, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Asia > China > Hong Kong (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.46)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.43)

Add feedback

Riemannian Projection-free Online Learning

Neural Information Processing SystemsFeb-15-2026, 14:25:32 GMT

In Euclidean space, OCO boasts a robust theoretical foundation and numerous real-world applications, such as online load balancing (Molinaro, 2017), optimal control (Li et al., 2019), revenue maximization (Lin et al., 2019), and portfolio management (Jézéquel et al., 2022).

artificial intelligence, exp 1, machine learning, (18 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Industry:

Education > Educational Setting > Online (0.41)
Energy (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.41)

Add feedback

Universal Online Learning with Gradient Variations: A Multi-layer Online Ensemble Approach

Neural Information Processing SystemsFeb-14-2026, 19:58:09 GMT

In this paper, we propose an online convex optimization approach with two different levels of adaptivity. On a higher level, our approach is agnostic to the unknown types and curvatures of the online functions, while at a lower level, it can exploit the unknown niceness of the environments and attain problem-dependent guarantees.

algorithm, artificial intelligence, machine learning, (15 more...)

Neural Information Processing Systems

Country: