AITopics | Education

Collaborating Authors

Education

fb7451e43f9c1c35b774bcfad7a5714b-Paper-Conference.pdf

Neural Information Processing SystemsFeb-13-2026, 01:16:11 GMT

arxiv preprint arxiv, generalization, length generalization, (13 more...)

Neural Information Processing Systems

Country: North America > Canada > Ontario > Toronto (0.14)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Learning

Neural Information Processing SystemsFeb-13-2026, 01:16:03 GMT

For additional motivation, it is reasonable to consider Massart noise to be a more realistic model of real-life noise (even when benign) when compared to the RCN model, as it allows for some amount of non-uniformity. This made Definition 1 a possibly tractable way to relax the noise assumption, without running intotheaforementioned computational barriers foragnostic learning.

artificial intelligence, justification, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
Asia > Afghanistan > Parwan Province > Charikar (0.04)

Genre: Research Report (0.46)

Industry: Education (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

CanLanguageModels LearntoSkipSteps?

Neural Information Processing SystemsFeb-13-2026, 01:14:09 GMT

Yet they are still far from true intelligence, which opens up intriguing opportunities to explore the parallels of humans and modelbehaviors.

machine learning, natural language, urlhttp, (19 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.04)
Asia > Indonesia > Bali (0.04)
Africa > Rwanda > Kigali > Kigali (0.04)
(2 more...)

Genre: Research Report > New Finding (0.93)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (0.68)

Add feedback

fadec8f2e65f181d777507d1df69b92f-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-13-2026, 00:36:01 GMT

experiment, fedntd, knowledge, (15 more...)

Neural Information Processing Systems

Industry: Education (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

fadec8f2e65f181d777507d1df69b92f-Paper-Conference.pdf

Neural Information Processing SystemsFeb-13-2026, 00:35:57 GMT

arxiv preprint arxiv, federated learning, learning, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia (0.04)
North America > Canada > Ontario > Toronto (0.04)
Asia > Middle East > Israel (0.04)

Genre: Research Report > New Finding (0.68)

Industry:

Education (1.00)
Information Technology > Security & Privacy (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

TransferableBoltzmannGenerators

Neural Information Processing SystemsFeb-13-2026, 00:35:46 GMT

The generation of equilibrium samples of molecular systems has been a longstanding problem in statistical physics.

artificial intelligence, dipeptide, machine learning, (12 more...)

Neural Information Processing Systems

Genre: Research Report (0.67)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.48)
Education (0.46)
Government (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

TADAM: Task dependent adaptive metric for improved few-shot learning

Boris Oreshkin, Pau Rodríguez López, Alexandre Lacoste

Neural Information Processing SystemsFeb-13-2026, 00:35:19 GMT

Neural Information Processing Systems http://nips.cc/

classification task, feature extractor, few-shot learning, (15 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > Spain (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Decomposition of Spillover Effects Under Misspecification:Pseudo-true Estimands and a Local--Global Extension

Park, Yechan, Yang, Xiaodong

arXiv.org Machine LearningFeb-13-2026

Applied work with interference typically models outcomes as functions of own treatment and a low-dimensional exposure mapping of others' treatments, even when that mapping may be misspecified. This raises a basic question: what policy object are exposure-based estimands implicitly targeting, and how should we interpret their direct and spillover components relative to the underlying policy question? We take as primitive the marginal policy effect, defined as the effect of a small change in the treatment probability under the actual experimental design, and show that any researcher-chosen exposure mapping induces a unique pseudo-true outcome model. This model is the best approximation to the underlying potential outcomes that depends only on the user-chosen exposure. Utilizing that representation, the marginal policy effect admits a canonical decomposition into exposure-based direct and spillover effects, and each component provides its optimal approximation to the corresponding oracle objects that would be available if interference were fully known. We then focus on a setting that nests important empirical and theoretical applications in which both local network spillovers and global spillovers, such as market equilibrium, operate. There, the marginal policy effect further decomposes asymptotically into direct, local, and global channels. An important implication is that many existing methods are more robust than previously understood once we reinterpret their targets as channel-specific components of this pseudo-true policy estimand. Simulations and a semi-synthetic experiment calibrated to a large cash-transfer experiment show that these components can be recovered in realistic experimental designs.

artificial intelligence, exposure mapping, machine learning, (16 more...)

arXiv.org Machine Learning

2602.12023

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
South America > Chile (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(3 more...)

Genre:

Research Report > Strength High (1.00)
Research Report > Experimental Study (1.00)

Industry:

Banking & Finance (1.00)
Health & Medicine > Therapeutic Area (0.68)
Health & Medicine > Epidemiology (0.46)
Education > Educational Setting (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications (0.67)

Add feedback

High-Probability Minimax Adaptive Estimation in Besov Spaces via Online-to-Batch

Liautaud, Paul, Gaillard, Pierre, Wintenberger, Olivier

arXiv.org Machine LearningFeb-13-2026

We study nonparametric regression over Besov spaces from noisy observations under sub-exponential noise, aiming to achieve minimax-optimal guarantees on the integrated squared error that hold with high probability and adapt to the unknown noise level. To this end, we propose a wavelet-based online learning algorithm that dynamically adjusts to the observed gradient noise by adaptively clipping it at an appropriate level, eliminating the need to tune parameters such as the noise variance or gradient bounds. As a by-product of our analysis, we derive high-probability adaptive regret bounds that scale with the $\ell_1$-norm of the competitor. Finally, in the batch statistical setting, we obtain adaptive and minimax-optimal estimation rates for Besov spaces via a refined online-to-batch conversion. This approach carefully exploits the structure of the squared loss in combination with self-normalized concentration inequalities.

artificial intelligence, machine learning, sup 1nullt, (19 more...)

arXiv.org Machine Learning

2602.11747

Country:

Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)
Europe > France > Île-de-France > Paris > Paris (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.64)

Industry: Education > Educational Setting > Online (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.34)

Add feedback

Provable Offline Reinforcement Learning for Structured Cyclic MDPs

Lee, Kyungbok, Sarteau, Angelica Cristello, Kosorok, Michael R.

arXiv.org Machine LearningFeb-13-2026

We introduce a novel cyclic Markov decision process (MDP) framework for multi-step decision problems with heterogeneous stage-specific dynamics, transitions, and discount factors across the cycle. In this setting, offline learning is challenging: optimizing a policy at any stage shifts the state distributions of subsequent stages, propagating mismatch across the cycle. To address this, we propose a modular structural framework that decomposes the cyclic process into stage-wise sub-problems. While generally applicable, we instantiate this principle as CycleFQI, an extension of fitted Q-iteration enabling theoretical analysis and interpretation. It uses a vector of stage-specific Q-functions, tailored to each stage, to capture within-stage sequences and transitions between stages. This modular design enables partial control, allowing some stages to be optimized while others follow predefined policies. We establish finite-sample suboptimality error bounds and derive global convergence rates under Besov regularity, demonstrating that CycleFQI mitigates the curse of dimensionality compared to monolithic baselines. Additionally, we propose a sieve-based method for asymptotic inference of optimal policy values under a margin condition. Experiments on simulated and real-world Type 1 Diabetes data sets demonstrate CycleFQI's effectiveness.

artificial intelligence, machine learning, provable offline reinforcement learning, (13 more...)

arXiv.org Machine Learning

2602.11679

Country: