AITopics | Learning Management

We formalize sequential decision-making with information acquisition as the probing-augmented user-centric selection (PUCS) framework, where a learner first probes a subset of arms to obtain side information on resources and rewards, and then assigns $K$ plays to $M$ arms. PUCS covers applications such as ridesharing, wireless scheduling, and content recommendation, in which both resources and payoffs are initially unknown and probing is costly. For the offline setting with known distributions, we present a greedy probing algorithm with a constant-factor approximation guarantee $ζ= (e-1)/(2e-1)$. For the online setting with unknown distributions, we introduce OLPA, a stochastic combinatorial bandit algorithm that achieves a regret bound $\mathcal{O}(\sqrt{T} + \ln^{2} T)$. We also prove a lower bound $Ω(\sqrt{T})$, showing that the upper bound is tight up to logarithmic factors. Experiments on real-world data demonstrate the effectiveness of our solutions.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Machine Learning

2507.20112

Country: North America > United States (0.46)

Genre: Research Report (0.81)

Industry:

Transportation > Ground > Road (0.66)
Energy > Oil & Gas (0.46)
Education > Educational Setting > Online (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.40)

Add feedback

A Large-Scale Web Search Dataset for Federated Online Learning to Rank

Gregoriadis, Marcel, Kang, Jingwei, Pouwelse, Johan

arXiv.org Artificial IntelligenceAug-19-2025

The centralized collection of search interaction logs for training ranking models raises significant privacy concerns. Federated Online Learning to Rank (FOLTR) offers a privacy-preserving alternative by enabling collaborative model training without sharing raw user data. However, benchmarks in FOLTR are largely based on random partitioning of classical learning-to-rank datasets, simulated user clicks, and the assumption of synchronous client participation. This oversimplifies real-world dynamics and undermines the realism of experimental results. We present AOL4FOLTR, a large-scale web search dataset with 2.6 million queries from 10,000 users. Our dataset addresses key limitations of existing benchmarks by including user identifiers, real click data, and query timestamps, enabling realistic user partitioning, behavior modeling, and asynchronous federated learning scenarios.

information retrieval, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3746252.3761651

2508.12353

Country: Europe > Netherlands (0.29)

Genre: Research Report (1.00)

Industry:

Information Technology > Security & Privacy (0.68)
Education > Educational Setting > Online (0.63)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
(2 more...)

Add feedback

Best-of-All-Worlds Bounds for Online Learning with Feedback Graphs

Neural Information Processing SystemsAug-18-2025, 17:31:34 GMT

We study the online learning with feedback graphs framework introduced by Man-nor and Shamir [24], in which the feedback received by the online learner is specified by a graphnull over the available actions.

algorithm, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)

Industry: Education > Educational Setting > Online (0.62)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.62)

Add feedback

Luckiness in Multiscale Online Learning

Neural Information Processing SystemsAug-17-2025, 07:36:41 GMT

We investigate the multiscale extension of the problem where the loss ranges of the experts are vastly different.

artificial intelligence, machine learning, uscada, (15 more...)

Neural Information Processing Systems

Industry: Education > Educational Setting > Online (0.51)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.51)

Add feedback

Stochastic Online Learning with Feedback Graphs: Finite-Time and Asymptotic Optimality

Neural Information Processing SystemsAug-17-2025, 06:58:49 GMT

We show that, surprisingly, the notion of optimal finite-time regret is not a uniquely defined property in this context and that, in general, it is decoupled from the asymptotic rate. We discuss alternative choices and propose a notion of finite-time optimality that we argue is meaningful .

artificial intelligence, data mining, machine learning, (19 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Industry: Education > Educational Setting > Online (0.51)

Technology: