AITopics | Education

Collaborating Authors

Education

63dc7ed1010d3c3b8269faf0ba7491d4-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 01:00:07 GMT

In this document, we provide details and supplementary materials that cannot fit into the main manuscript due to the page limit. The specific form ofcenter distribution isunknown, but we can still train a generatorG to approximate it. IfR(G,D,T)),wechooseλ=0, i.e., no restriction onR(G,D,T)), to obtain the minimal cost. IfR(G,D,T)) >, then a large λshould be applied as apenalization. According to the derivation of Eq. (3), we obtain arelaxed versionoftheintractableEq.(2),expressedasfollows: min Inknowledge distillation, student models arecrafted using unlabeled datasets, where only thesoft targets from teachers are utilized.

artificial intelligence, machine learning, stride, (18 more...)

Neural Information Processing Systems

Industry: Education (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.36)

Add feedback

63dc7ed1010d3c3b8269faf0ba7491d4-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 01:00:04 GMT

arxiv preprint arxiv, knowledge distillation, ood data, (11 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.04)
North America > United States > Colorado > El Paso County > Colorado Springs (0.04)
North America > United States > California (0.04)
Asia > China > Zhejiang Province (0.04)

Genre: Research Report (0.68)

Industry: Education (0.69)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Loss Decoupling for Task-Agnostic Continual Learning Y an-Shuo Liang and Wu-Jun Li

Neural Information Processing SystemsFeb-9-2026, 00:59:36 GMT

Continual learning requires the model to learn multiple tasks in a sequential order.

artificial intelligence, continual learning, machine learning, (15 more...)

Neural Information Processing Systems

Country: Asia > China > Jiangsu Province > Nanjing (0.04)

Genre:

Research Report (0.46)
Instructional Material (0.30)

Industry: Education > Educational Setting (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

DPIC: Decoupling Prompt and Intrinsic Characteristics for LLM Generated Text Detection Xiao Y u 1,2, Y uang Qi

Neural Information Processing SystemsFeb-9-2026, 00:44:05 GMT

Consequently, detecting whether a text is generated by LLMs has become increasingly important.

large language model, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country: Asia > China > Anhui Province > Hefei (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.68)

Industry:

Information Technology (0.68)
Education (0.67)
Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

62e0973455fd26eb03e91d5741a4a3bb-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 00:43:44 GMT

moment-detr, query, video, (13 more...)

Neural Information Processing Systems

Country: North America > United States > North Carolina (0.04)

Industry: Education (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Vision (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

1d0ed12c3fda52f2c241a0cebcf739a6-Paper-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 00:08:57 GMT

agent, jaxnav, learnability, (14 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
(4 more...)

Genre: Research Report > Experimental Study (0.93)

Industry: Education (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Performative Learning Theory

Rodemann, Julian, Fischer-Abaigar, Unai, Bailie, James, Muandet, Krikamol

arXiv.org Machine LearningFeb-9-2026

Performative predictions influence the very outcomes they aim to forecast. We study performative predictions that affect a sample (e.g., only existing users of an app) and/or the whole population (e.g., all potential app users). This raises the question of how well models generalize under performativity. For example, how well can we draw insights about new app users based on existing users when both of them react to the app's predictions? We address this question by embedding performative predictions into statistical learning theory. We prove generalization bounds under performative effects on the sample, on the population, and on both. A key intuition behind our proofs is that in the worst case, the population negates predictions, while the sample deceptively fulfills them. We cast such self-negating and self-fulfilling predictions as min-max and min-min risk functionals in Wasserstein space, respectively. Our analysis reveals a fundamental trade-off between performatively changing the world and learning from it: the more a model affects data, the less it can learn from it. Moreover, our analysis results in a surprising insight on how to improve generalization guarantees by retraining on performatively distorted samples. We illustrate our bounds in a case study on prediction-informed assignments of unemployed German residents to job trainings, drawing upon administrative labor market records from 1975 to 2017 in Germany.

artificial intelligence, machine learning, prediction, (16 more...)

arXiv.org Machine Learning

2602.04402

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
(8 more...)

Genre:

Research Report > Experimental Study (0.68)
Research Report > New Finding (0.46)

Industry:

Banking & Finance > Economy (0.48)
Education > Educational Setting (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Operationalizing Stein's Method for Online Linear Optimization: CLT-Based Optimal Tradeoffs

Zhang, Zhiyu, Ramdas, Aaditya

arXiv.org Machine LearningFeb-9-2026

Adversarial online linear optimization (OLO) is essentially about making performance tradeoffs with respect to the unknown difficulty of the adversary. In the setting of one-dimensional fixed-time OLO on a bounded domain, it has been observed since Cover (1966) that achievable tradeoffs are governed by probabilistic inequalities, and these descriptive results can be converted into algorithms via dynamic programming, which, however, is not computationally efficient. We address this limitation by showing that Stein's method, a classical framework underlying the proofs of probabilistic limit theorems, can be operationalized as computationally efficient OLO algorithms. The associated regret and total loss upper bounds are "additively sharp", meaning that they surpass the conventional big-O optimality and match normal-approximation-based lower bounds by additive lower order terms. Our construction is inspired by the remarkably clean proof of a Wasserstein martingale central limit theorem (CLT) due to Röllin (2018). Several concrete benefits can be obtained from this general technique. First, with the same computational complexity, the proposed algorithm improves upon the total loss upper bounds of online gradient descent (OGD) and multiplicative weight update (MWU). Second, our algorithm can realize a continuum of optimal two-point tradeoffs between the total loss and the maximum regret over comparators, improving upon prior works in parameter-free online learning. Third, by allowing the adversary to randomize on an unbounded support, we achieve sharp in-expectation performance guarantees for OLO with noisy feedback.

algorithm, artificial intelligence, machine learning, (19 more...)

arXiv.org Machine Learning

2602.06545

Country: