AITopics | Education

Collaborating Authors

Education

Attack-Resistant Uniform Fairness for Linear and Smooth Contextual Bandits

arXiv.org Machine LearningFeb-5-2026

Modern systems, such as digital platforms and service systems, increasingly rely on contextual bandits for online decision-making; however, their deployment can inadvertently create unfair exposure among arms, undermining long-term platform sustainability and supplier trust. This paper studies the contextual bandit problem under a uniform $(1-δ)$-fairness constraint, and addresses its unique vulnerabilities to strategic manipulation. The fairness constraint ensures that preferential treatment is strictly justified by an arm's actual reward across all contexts and time horizons, using uniformity to prevent statistical loopholes. We develop novel algorithms that achieve (nearly) minimax-optimal regret for both linear and smooth reward functions, while maintaining strong $(1-\tilde{O}(1/T))$-fairness guarantees, and further characterize the theoretically inherent yet asymptotically marginal "price of fairness". However, we reveal that such merit-based fairness becomes uniquely susceptible to signal manipulation. We show that an adversary with a minimal $\tilde{O}(1)$ budget can not only degrade overall performance as in traditional attacks, but also selectively induce insidious fairness-specific failures while leaving conspicuous regret measures largely unaffected. To counter this, we design robust variants incorporating corruption-adaptive exploration and error-compensated thresholding. Our approach yields the first minimax-optimal regret bounds under $C$-budgeted attack while preserving $(1-\tilde{O}(1/T))$-fairness. Numerical experiments and a real-world case demonstrate that our algorithms sustain both fairness and efficiency.

artificial intelligence, data mining, machine learning, (20 more...)

arXiv.org Machine Learning

2602.04125

Country:

Asia > Singapore (0.40)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > China (0.04)

Genre: Research Report > New Finding (0.67)

Industry:

Information Technology (0.46)
Education (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.67)

Add feedback

Theory of Optimal Learning Rate Schedules and Scaling Laws for a Random Feature Model

Bordelon, Blake, Mori, Francesco

arXiv.org Machine LearningFeb-5-2026

Setting the learning rate for a deep learning model is a critical part of successful training, yet choosing this hyperparameter is often done empirically with trial and error. In this work, we explore a solvable model of optimal learning rate schedules for a powerlaw random feature model trained with stochastic gradient descent (SGD). We consider the optimal schedule $η_T^\star(t)$ where $t$ is the current iterate and $T$ is the total training horizon. This schedule is computed both numerically and analytically (when possible) using optimal control methods. Our analysis reveals two regimes which we term the easy phase and hard phase. In the easy phase the optimal schedule is a polynomial decay $η_T^\star(t) \simeq T^{-ξ} (1-t/T)^δ$ where $ξ$ and $δ$ depend on the properties of the features and task. In the hard phase, the optimal schedule resembles warmup-stable-decay with constant (in $T$) initial learning rate and annealing performed over a vanishing (in $T$) fraction of training steps. We investigate joint optimization of learning rate and batch size, identifying a degenerate optimality condition. Our model also predicts the compute-optimal scaling laws (where model size and training steps are chosen optimally) in both easy and hard regimes. Going beyond SGD, we consider optimal schedules for the momentum $β(t)$, where speedups in the hard phase are possible. We compare our optimal schedule to various benchmarks in our task including (1) optimal constant learning rates $η_T(t) \sim T^{-ξ}$ (2) optimal power laws $η_T(t) \sim T^{-ξ} t^{-χ}$, finding that our schedule achieves better rates than either of these. Our theory suggests that learning rate transfer across training horizon depends on the structure of the model and task. We explore these ideas in simple experimental pretraining setups.

artificial intelligence, machine learning, optimal learning rate schedule, (11 more...)

arXiv.org Machine Learning

2602.04774

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre: Research Report (0.50)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.54)

Add feedback

Why Are Some Women Training for Pregnancy Like It's a Marathon?

WIREDFeb-4-2026, 11:00:00 GMT

Why Are Some Women Training for Pregnancy Like It's a Marathon? A growing legion of "zero trimester" influencers are convincing followers that healthy pregnancies are a choice--and that raw milk, watching sunsets, and pricey specialized courses can help. Three years ago, Esther Rohr and her husband decided to start thinking about pregnancy. The 26-year-old Oregon-based wedding photographer made small but intentional lifestyle changes--going to bed earlier, drinking more water and less alcohol, dialing in her fitness, loading up on protein, and taking supplements like beef organ capsules and Vitamin D3. They started charging their phones in the kitchen for better sleep and unplugging their Wi-Fi at night, because her research suggested it might affect cellular health. Concerned about their exposure to reproductive toxins, Rohr began the slow, painstaking task of swapping out all their synthetic workout clothes, nonstick pans, and scented personal care products that might contain phthalates or other endocrine-disrupting chemicals. She bought an air purifier and hopes to eventually replace their LED bulbs with incandescents, because she worries they might be affecting her circadian rhythm.

artificial intelligence, pregnancy, social media, (17 more...)

WIRED

Country: North America > United States > Oregon (0.24)

Genre:

Research Report (0.68)
Instructional Material > Course Syllabus & Notes (0.34)

Industry:

Health & Medicine > Therapeutic Area > Obstetrics/Gynecology (1.00)
Health & Medicine > Therapeutic Area > Endocrinology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
(3 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (0.71)

Add feedback

Preference-based Conditional Treatment Effects and Policy Learning

Parnas, Dovid, Even, Mathieu, Josse, Julie, Shalit, Uri

arXiv.org Machine LearningFeb-4-2026

We introduce a new preference-based framework for conditional treatment effect estimation and policy learning, built on the Conditional Preference-based Treatment Effect (CPTE). CPTE requires only that outcomes be ranked under a preference rule, unlocking flexible modeling of heterogeneous effects with multivariate, ordinal, or preference-driven outcomes. This unifies applications such as conditional probability of necessity and sufficiency, conditional Win Ratio, and Generalized Pairwise Comparisons. Despite the intrinsic non-identifiability of comparison-based estimands, CPTE provides interpretable targets and delivers new identifiability conditions for previous unidentifiable estimands. We present estimation strategies via matching, quantile, and distributional regression, and further design efficient influence-function estimators to correct plug-in bias and maximize policy value. Synthetic and semi-synthetic experiments demonstrate clear performance gains and practical impact.

artificial intelligence, machine learning, potential outcome, (16 more...)

arXiv.org Machine Learning

2602.03823

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > Tennessee (0.04)
Europe > France > Occitanie > Hérault > Montpellier (0.04)
(2 more...)

Genre:

Research Report > Strength High (1.00)
Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine (1.00)
Education (1.00)
Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.47)

Add feedback

Learning Better Certified Models from Empirically-Robust Teachers

De Palma, Alessandro

arXiv.org Machine LearningFeb-4-2026

Adversarial training attains strong empirical robustness to specific adversarial attacks by training on concrete adversarial perturbations, but it produces neural networks that are not amenable to strong robustness certificates through neural network verification. On the other hand, earlier certified training schemes directly train on bounds from network relaxations to obtain models that are certifiably robust, but display sub-par standard performance. Recent work has shown that state-of-the-art trade-offs between certified robustness and standard performance can be obtained through a family of losses combining adversarial outputs and neural network bounds. Nevertheless, differently from empirical robustness, verifiability still comes at a significant cost in standard performance. In this work, we propose to leverage empirically-robust teachers to improve the performance of certifiably-robust models through knowledge distillation. Using a versatile feature-space distillation objective, we show that distillation from adversarially-trained teachers consistently improves on the state-of-the-art in certified training for ReLU networks across a series of robust computer vision benchmarks.

artificial intelligence, machine learning, robustness, (16 more...)

arXiv.org Machine Learning

2602.02626

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)

Genre: Research Report (0.63)

Industry:

Education (0.68)
Information Technology (0.66)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Universal One-third Time Scaling in Learning Peaked Distributions

Liu, Yizhou, Liu, Ziming, Pehlevan, Cengiz, Gore, Jeff

arXiv.org Machine LearningFeb-4-2026

Training large language models (LLMs) is computationally expensive, partly because the loss exhibits slow power-law convergence whose origin remains debatable. Through systematic analysis of toy models and empirical evaluation of LLMs, we show that this behavior can arise intrinsically from the use of softmax and cross-entropy. When learning peaked probability distributions, e.g., next-token distributions, these components yield power-law vanishing losses and gradients, creating a fundamental optimization bottleneck. This ultimately leads to power-law time scaling of the loss with a universal exponent of $1/3$. Our results provide a mechanistic explanation for observed neural scaling and suggest new directions for improving LLM training efficiency.

large language model, machine learning, natural language, (13 more...)

arXiv.org Machine Learning

2602.03685

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (0.66)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

An 'Intimacy Crisis' Is Driving the Dating Divide

WIREDFeb-3-2026, 21:46:53 GMT

An'Intimacy Crisis' Is Driving the Dating Divide In his book, sex and relationships researcher Justin Garcia says people have miscalculated their need for human intimacy, which is the real issue at root of the loneliness epidemic. In the US, nearly half of adults are single. A quarter of men suffer from loneliness. Rates of depression are on the rise . And one in four Gen Z adults--the so-called kinkiest generation, according to one study --have never had partnered sex. In an age of endless connection, where hooking up happens with the ease of a swipe and nontraditional relationship structures like polyamory are celebrated, why are people seemingly so disconnected and alone?

artificial intelligence, sexual literacy, social media, (16 more...)

WIRED

Country:

North America > United States > Minnesota (0.14)
North America > United States > California (0.14)

Genre: Research Report (0.94)

Industry:

Law (0.94)
Education (0.70)
Leisure & Entertainment > Sports (0.69)
(2 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

First-grade teacher flips American flag upside down in San Diego classroom, sparks investigation

FOX NewsFeb-3-2026, 14:56:57 GMT

A San Diego elementary school teacher faces scrutiny after video shows her turning an American flag upside down inside her first-grade classroom.

artificial intelligence, lifestyle real estate tech science, social media, (9 more...)

FOX News

Country: North America > United States > California > San Diego County > San Diego (0.62)

Industry:

Leisure & Entertainment > Sports (1.00)
Health & Medicine > Therapeutic Area (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Education > Educational Setting > K-12 Education > Primary School (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (0.96)

Add feedback

JONATHAN TURLEY: When elites cheer the mob, history warns that revolutions devour their own

FOX NewsFeb-3-2026, 14:00:05 GMT

The American Revolution created a lasting democracy while the French Revolution became blood-soaked tyranny. But today's armchair revolutionaries echo similar calls.

artificial intelligence, revolution, social media, (12 more...)

FOX News

Country: North America > United States > Minnesota (0.15)

Industry:

Media > News (1.00)
Leisure & Entertainment > Sports (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
(6 more...)

Technology:

Information Technology > Artificial Intelligence (0.95)
Information Technology > Communications > Social Media (0.72)

Add feedback

Education advocates praise Texas A&M decision to wind down Women's and Gender Studies certificate

FOX NewsFeb-3-2026, 13:00:31 GMT

Texas A&M eliminates Women's and Gender Studies certificate program after reviewing 5,400 course syllabi, canceling six courses representing 0.11% of total offerings.

artificial intelligence, social media, university, (8 more...)

FOX News

Country: North America > United States > Texas (0.91)

Genre: Instructional Material > Course Syllabus & Notes (0.71)

Industry:

Leisure & Entertainment > Sports (1.00)
Law (1.00)
Health & Medicine > Therapeutic Area (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence (0.96)
Information Technology > Communications > Social Media (0.73)

Add feedback