AITopics | Learning Management

Collaborating Authors

Learning Management

News Overviews Instructional Materials AI-Alerts Classics

Learning Predictions for Algorithms with Predictions

Neural Information Processing SystemsApr-24-2026, 19:35:24 GMT

A burgeoning paradigm in algorithm design is the field of algorithms with predictions, in which algorithms can take advantage of a possibly-imperfect prediction of some aspect of the problem. While much work has focused on using predictions to improve competitive ratios, running times, or other performance measures, less effort has been devoted to the question of how to obtain the predictions themselves, especially in the critical online setting. We introduce a general design approach for algorithms that learn predictors: (1) identify a functional dependence of the performance measure on the prediction quality and (2) apply techniques from online learning to learn predictors, tune robustness-consistency trade-offs, and bound the sample complexity. We demonstrate the effectiveness of our approach by applying it to bipartite matching, ski-rental, page migration, and job scheduling. In several settings we improve upon multiple existing results while utilizing a much simpler analysis, while in the others we provide the first learning-theoretic guarantees.

algorithm, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Country: North America > United States (0.68)

Genre: Research Report (0.93)

Industry: Education > Educational Setting > Online (0.36)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.36)

Add feedback

Mistake Bounds for Binary Matrix Completion

Mark Herbster, Stephen Pasteris, Massimiliano Pontil

Neural Information Processing SystemsApr-22-2026, 07:04:29 GMT

We study the problem of completing a binary matrix in an online learning setting. On each trial we predict a matrix entry and then receive the true entry. We propose a Matrix Exponentiated Gradient algorithm [1] to solve this problem. We provide a mistake bound for the algorithm, which scales with the margin complexity [2, 3] of the underlying matrix. The bound suggests an interpretation where each row of the matrix is a prediction task over a finite set of objects, the columns. Using this we show that the algorithm makes a number of mistakes which is comparable up to a logarithmic factor to the number of mistakes made by the Kernel Perceptron with an optimal kernel in hindsight. We discuss applications of the algorithm to predicting as well as the best biclustering and to the problem of predicting the labeling of a graph without knowing the graph in advance.

algorithm, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Europe (0.68)
North America > United States (0.46)

Industry:

Government (0.47)
Education > Educational Setting > Online (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.36)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.35)

Add feedback

Online learning with noisy side observations

Kocák, Tomáš, Neu, Gergely, Valko, Michal

arXiv.org Machine LearningApr-16-2026

We propose a new partial-observability model for online learning problems where the learner, besides its own loss, also observes some noisy feedback about the other actions, depending on the underlying structure of the problem. We represent this structure by a weighted directed graph, where the edge weights are related to the quality of the feedback shared by the connected nodes. Our main contribution is an efficient algorithm that guarantees a regret of $\widetilde{O}(\sqrt{α^* T})$ after $T$ rounds, where $α^*$ is a novel graph property that we call the effective independence number. Our algorithm is completely parameter-free and does not require knowledge (or even estimation) of $α^*$. For the special case of binary edge weights, our setting reduces to the partial-observability models of Mannor and Shamir (2011) and Alon et al. (2013) and our algorithm recovers the near-optimal regret bounds.

artificial intelligence, data mining, machine learning, (20 more...)

arXiv.org Machine Learning

2604.1374

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
Europe > Spain > Andalusia > Cádiz Province > Cadiz (0.04)

Genre: Research Report (0.50)

Industry: Education > Educational Setting > Online (0.72)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.72)
Information Technology > Data Science > Data Mining > Big Data (0.46)

Add feedback

Gradient-Variation Regret Bounds for Unconstrained Online Learning

Zhao, Yuheng, Jacobsen, Andrew, Cesa-Bianchi, Nicolò, Zhao, Peng

arXiv.org Machine LearningApr-14-2026

We develop parameter-free algorithms for unconstrained online learning with regret guarantees that scale with the gradient variation $V_T(u) = \sum_{t=2}^T \|\nabla f_t(u)-\nabla f_{t-1}(u)\|^2$. For $L$-smooth convex loss, we provide fully-adaptive algorithms achieving regret of order $\widetilde{O}(\|u\|\sqrt{V_T(u)} + L\|u\|^2+G^4)$ without requiring prior knowledge of comparator norm $\|u\|$, Lipschitz constant $G$, or smoothness $L$. The update in each round can be computed efficiently via a closed-form expression. Our results extend to dynamic regret and find immediate implications to the stochastically-extended adversarial (SEA) model, which significantly improves upon the previous best-known result [Wang et al., 2025].

algorithm, artificial intelligence, machine learning, (17 more...)

arXiv.org Machine Learning

2604.11151

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)

Genre: Research Report > New Finding (0.34)

Industry: Education > Educational Setting > Online (0.62)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.62)

Add feedback

Fully Unconstrained Online Learning

Neural Information Processing SystemsMar-18-2026, 07:19:42 GMT

Importantly, this matches the optimal bound $G\|w_\star\|\sqrt{T}$ available with such knowledge (up to logarithmic factors), unless either $\|w_\star\|$ or $G$ is so large that even $G\|w_\star\|\sqrt{T}$ is roughly linear in $T$. Thus, at a high level it matches the optimal bound in all cases in which one can achieve sublinear regret.

artificial intelligence, machine learning, unconstrained online learning ashok cutkosky, (7 more...)

Neural Information Processing Systems

Industry: Education > Educational Setting > Online (0.45)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (0.45)
Information Technology > Artificial Intelligence > Machine Learning (0.45)

Add feedback

Online Learning of Delayed Choices

Neural Information Processing SystemsMar-17-2026, 20:28:45 GMT

Choice models are essential for understanding decision-making processes in domains like online advertising, product recommendations, and assortment optimization. The Multinomial Logit (MNL) model is particularly versatile in selecting products or advertisements for display. However, challenges arise with unknown MNL parameters and delayed feedback, requiring sellers to learn customers' choice behavior and make dynamic decisions with biased knowledge due to delays. We address these challenges by developing an algorithm that handles delayed feedback, balancing exploration and exploitation using confidence bounds and optimism. We first consider a censored setting where a threshold for considering feedback is imposed by business requirements. Our algorithm demonstrates a $\tilde{O}(\sqrt{NT})$ regret, with a matching lower bound up to a logarithmic term. Furthermore, we extend our analysis to environments with non-thresholded delays, achieving a $\tilde{O}(\sqrt{NT})$ regret. To validate our approach, we conduct experiments that confirm the effectiveness of our algorithm.

artificial intelligence, machine learning, neural information processing system 37, (8 more...)

Neural Information Processing Systems

Industry:

Marketing (0.61)
Education > Educational Setting > Online (0.44)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (0.44)
Information Technology > Artificial Intelligence > Machine Learning (0.44)

Add feedback

Online Learning with Transductive Regret

Neural Information Processing SystemsMar-17-2026, 15:41:49 GMT

We study online learning with the general notion of transductive regret, that is regret with modification rules applying to expert sequences (as opposed to single experts) that are representable by weighted finite-state transducers. We show how transductive regret generalizes existing notions of regret, including: (1) external regret; (2) internal regret; (3) swap regret; and (4) conditional swap regret. We present a general and efficient online learning algorithm for minimizing transductive regret. We further extend that to design efficient algorithms for the time-selection and sleeping expert settings. A by-product of our study is an algorithm for swap regret, which, under mild assumptions, is more efficient than existing ones, and a substantially more efficient algorithm for time selection swap regret.

artificial intelligence, machine learning, neural information processing system 30, (7 more...)

Neural Information Processing Systems

Industry: Education > Educational Setting > Online (0.93)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (0.93)
Information Technology > Artificial Intelligence > Machine Learning (0.78)

Add feedback

Online Learning with an Unknown Fairness Metric

Stephen Gillen, Christopher Jung, Michael Kearns, Aaron Roth

Neural Information Processing SystemsMar-15-2026, 07:35:17 GMT

We consider the problem of online learning in the linear contextual bandits setting, but in which there are also strong individual fairness constraints governed by an unknown similarity metric. These constraints demand that we select similar actions or individuals with approximately equal probability [?], which may be at odds with optimizing reward, thus modeling settings where profit and social policy are in tension. We assume we learn about an unknown Mahalanobis similarity metric from only weak feedback that identifies fairness violations, but does not quantify their extent. This is intended to represent the interventions of a regulator who "knows unfairness when he sees it" but nevertheless cannot enunciate a quantitative fairness metric over individuals. Our main result is an algorithm in the adversarial context setting that has a number of fairness violations that depends only logarithmically on T, while obtaining an optimal O( T) regret bound to the best fair policy.

algorithm, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Country: North America > United States (0.68)

Industry: Education > Educational Setting > Online (0.61)

Technology:

Information Technology > Data Science (0.94)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Add feedback

80b618ebcac7aa97a6dac2ba65cb7e36-Supplemental.pdf

Neural Information Processing SystemsFeb-19-2026, 03:53:23 GMT

batch, fairness, violation, (17 more...)

Neural Information Processing Systems

Country:

Europe > Sweden > Stockholm > Stockholm (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > Quebec > Montreal (0.04)
(6 more...)

Genre: Research Report (0.66)

Industry: Education > Educational Setting > Online (0.72)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.71)
Information Technology > Data Science > Data Mining (0.68)

Add feedback

The Gain of Ordering in Online Learning

Neural Information Processing SystemsFeb-17-2026, 02:20:22 GMT

V ov95, CBL06] and online convex optimization [Haz16, Ora19] have been developed. Until the labels of all examples of X have been predicted: The learning algorithm picks a point x X and makes a prediction z R about its label.

artificial intelligence, inductive learning, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Oceania > Australia > Australian Capital Territory > Canberra (0.04)
North America > United States > California (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Industry: Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.52)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.46)

Add feedback