Goto

Collaborating Authors

 Asia


Offline-Online Reinforcement Learning for Linear Mixture MDPs

arXiv.org Machine Learning

We study offline-online reinforcement learning in linear mixture Markov decision processes (MDPs) under environment shift. In the offline phase, data are collected by an unknown behavior policy and may come from a mismatched environment, while in the online phase the learner interacts with the target environment. We propose an algorithm that adaptively leverages offline data. When the offline data are informative, either due to sufficient coverage or small environment shift, the algorithm provably improves over purely online learning. When the offline data are uninformative, it safely ignores them and matches the online-only performance. We establish regret upper bounds that explicitly characterize when offline data are beneficial, together with nearly matching lower bounds. Numerical experiments further corroborate our theoretical findings.


An Optimal Sauer Lemma Over $k$-ary Alphabets

arXiv.org Machine Learning

The Sauer-Shelah-Perles Lemma is a cornerstone of combinatorics and learning theory, bounding the size of a binary hypothesis class in terms of its Vapnik-Chervonenkis (VC) dimension. For classes of functions over a $k$-ary alphabet, namely the multiclass setting, the Natarajan dimension has long served as an analogue of VC dimension, yet the corresponding Sauer-type bounds are suboptimal for alphabet sizes $k>2$. In this work, we establish a sharp Sauer inequality for multiclass and list prediction. Our bound is expressed in terms of the Daniely--Shalev-Shwartz (DS) dimension, and more generally with its extension, the list-DS dimension -- the combinatorial parameters that characterize multiclass and list PAC learnability. Our bound is tight for every alphabet size $k$, list size $\ell$, and dimension value, replacing the exponential dependence on $\ell$ in the Natarajan-based bound by the optimal polynomial dependence, and improving the dependence on $k$ as well. Our proof uses the polynomial method. In contrast to the classical VC case, where several direct combinatorial proofs are known, we are not aware of any purely combinatorial proof in the DS setting. This motivates several directions for future research, which are discussed in the paper. As consequences, we obtain improved sample complexity upper bounds for list PAC learning and for uniform convergence of list predictors, sharpening the recent results of Charikar et al.~(STOC~2023), Hanneke et al.~(COLT~2024), and Brukhim et al.~(NeurIPS~2024).


Fine-tuning Factor Augmented Neural Lasso for Heterogeneous Environments

arXiv.org Machine Learning

Fine-tuning is a widely used strategy for adapting pre-trained models to new tasks, yet its methodology and theoretical properties in high-dimensional nonparametric settings with variable selection have not yet been developed. This paper introduces the fine-tuning factor augmented neural Lasso (FAN-Lasso), a transfer learning framework for high-dimensional nonparametric regression with variable selection that simultaneously handles covariate and posterior shifts. We use a low-rank factor structure to manage high-dimensional dependent covariates and propose a novel residual fine-tuning decomposition in which the target function is expressed as a transformation of a frozen source function and other variables to achieve transfer learning and nonparametric variable selection. This augmented feature from the source predictor allows for the transfer of knowledge to the target domain and reduces model complexity there. We derive minimax-optimal excess risk bounds for the fine-tuning FAN-Lasso, characterizing the precise conditions, in terms of relative sample sizes and function complexities, under which fine-tuning yields statistical acceleration over single-task learning. The proposed framework also provides a theoretical perspective on parameter-efficient fine-tuning methods. Extensive numerical experiments across diverse covariate- and posterior-shift scenarios demonstrate that the fine-tuning FAN-Lasso consistently outperforms standard baselines and achieves near-oracle performance even under severe target sample size constraints, empirically validating the derived rates.


Information-Geometric Decomposition of Generalization Error in Unsupervised Learning

arXiv.org Machine Learning

We decompose the Kullback--Leibler generalization error (GE) -- the expected KL divergence from the data distribution to the trained model -- of unsupervised learning into three non-negative components: model error, data bias, and variance. The decomposition is exact for any e-flat model class and follows from two identities of information geometry: the generalized Pythagorean theorem and a dual e-mixture variance identity. As an analytically tractable demonstration, we apply the framework to $ε$-PCA, a regularized principal component analysis in which the empirical covariance is truncated at rank $N_K$ and discarded directions are pinned at a fixed noise floor $ε$. Although rank-constrained $ε$-PCA is not itself e-flat, it admits a technical reformulation with the same total GE on isotropic Gaussian data, under which each component of the decomposition takes closed form. The optimal rank emerges as the cutoff $λ_{\mathrm{cut}}^{*} = ε$ -- the model retains exactly those empirical eigenvalues exceeding the noise floor -- with the cutoff reflecting a marginal-rate balance between model-error gain and data-bias cost. A boundary comparison further yields a three-regime phase diagram -- retain-all, interior, and collapse -- separated by the lower Marchenko--Pastur edge and an analytically computable collapse threshold $ε_{*}(α)$, where $α$ is the dimension-to-sample-size ratio. All claims are verified numerically.


Adaptive Budget Allocation in LLM-Augmented Surveys

arXiv.org Machine Learning

Large language models (LLMs) can generate survey responses at low cost, but their reliability varies substantially across questions and is unknown before data collection. Deploying LLMs in surveys still requires costly human responses for verification and correction. How should a limited human-labeling budget be allocated across questions in real time? We propose an adaptive allocation algorithm that learns which questions are hardest for the LLM while simultaneously collecting human responses. Each human label serves a dual role: it improves the estimate for that question and reveals how well the LLM predicts human responses on it. The algorithm directs more budget to questions where the LLM is least reliable, without requiring any prior knowledge of question-level LLM accuracy. We prove that the allocation gap relative to the best possible allocation vanishes as the budget grows, and validate the approach on both synthetic data and a real survey dataset with 68 questions and over 2000 respondents. On real survey data, the standard practice of allocating human labels uniformly across questions wastes 10--12% of the budget relative to the optimal; our algorithm reduces this waste to 2--6%, and the advantage grows as questions become more heterogeneous in LLM prediction quality. The algorithm achieves the same estimation quality as traditional uniform sampling with fewer human samples, requires no pilot study, and is backed by formal performance guarantees validated on real survey data. More broadly, the framework applies whenever scarce human oversight must be allocated across tasks where LLM reliability is unknown.


New spider named for Pink Floyd devours bugs 6x its size

Popular Science

Maybe the tiny hunter should've been named after Metallica? More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. Breakthroughs, discoveries, and DIY tips sent six days a week. We can call this newly discovered spider another brick--or web--in the wall. Scientists in Colombia named the new species in honor of English rock band Pink Floyd and the arachnid's preferred habitat--walls.


Meteorologists predict a fairly chill 2026 Atlantic hurricane season

Popular Science

More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. Hurricane Edouard as seen from the International Space Station in 2014. Breakthroughs, discoveries, and DIY tips sent six days a week. More signs indicate the 2026 Atlantic hurricane season could ultimately be a welcome reprieve from more recent devastating storms . As La Niña transitions into a stronger El Niño climate pattern later this summer, the United States may experience a below-average number of hurricanes.


Candy now tastes different. It's not just you.

Popular Science

From recipe changes to aging taste buds, here's why those peanut butter cups don't hit like they used to. More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. There's a reason you might remember Hershey's chocolate differently. Breakthroughs, discoveries, and DIY tips sent six days a week. Brad Reese, grandson of Reese's Peanut Butter Cups inventor H.B. Reese, caused a stir this year with his claims that The Hershey Company had changed his grandfather's recipes beyond recognition .


Co-AI-chella! AI influencers cash in on the California music festival - as experts predict the staggering amount the people behind them could be making

Daily Mail - Science & tech

Vance grounded at White House as Iran peace talks in turmoil and Trump declares: 'I expect to be bombing' New'Hollywood dose' pill: A-listers hooked on'youth elixir' that dermatologists say is anti-ageing, shrinks pores, smooths wrinkles... and even banishes rosacea Days after we got engaged, the love of my life told me he'd killed a man and buried him in a bog. I reported him to police... but then I made this irreversible mistake Ark of the Covenant's final resting place pinpointed by archaeologists as fresh search begins Ritzy Bay Area town torn apart after teacher's daughter, 16, crashed car while speeding and killed four friends... then posted a TikTok video that poured fuel on the flames Jordon Hudson extends her control over Bill Belichick's empire with secret move that is set to leave his family and friends furious Life-threatening cantaloupe recall in four states upgraded to FDA's highest risk level... 'reasonable probability of death' AMANDA PLATELL: Why Sarah Ferguson - with the ghost of Princess Diana at her side - is ready to sensationally blow up the Royal Family. She knows ALL their secrets... Trump confronts Xi as US forces seize Chinese ship carrying mysterious'gift' to Iran Sydney Sweeney's role is cut from The Devil Wears Prada 2 Humiliating moment runner celebrates winning marathon... only to be pipped at the line by rival in brutal finish Patriots coach Mike Vrabel reveals'difficult conversations' with his wife as he speaks out for the first time since Dianna Russini photo scandal Nancy Mace fires back after accused sexual extorter Cory Mills tries to expel her from Congress: 'Bring it on' How to lose weight when perimenopause sabotages your metabolism: I'm a trainer but when I hit 46, I piled on the pounds overnight. The new'posh' drug that's easier to order than Uber Eats - and why all my middle-class friends have ditched booze and cocaine for it: JANA HOCKING Grieving mother says she went to LA school every day to complain daughter was being bullied... then tragedy struck when the lead tormentor, 12, hurled metal water bottle at victim's head Autistic woman, 24, worked hard to build independent life for herself... now she's PARALYZED thanks to selfishness of stranger READ MORE: Conjoined twin'influencers' are revealed to be AI AI influencers are cashing in on Coachella, with some set to make tens of thousands from their content. Among the celebrities and content creators, you may have noticed a surge in festival content from digital influencers.


That's one way to avoid boring meetings! Mark Zuckerberg is building an AI CLONE to replace him, report claims

Daily Mail - Science & tech

Ritzy Bay Area town torn apart after teacher's daughter, 16, was behind wheel when four friends died in high-speed crash... then she posted a TikTok video that poured fuel on the flames Two CIA officers killed in Mexico when their car skidded off ravine and exploded after meeting about bust of'largest ever drug lab' Insiders claim failed AI rollout could be to blame for Tim Cook's departure from Apple - as one says'the AI era requires a different kind of leadership' Trump confronts Xi as US forces seize Chinese ship carrying mysterious'gift' to Iran New'Hollywood dose' pill: A-listers hooked on'youth elixir' that dermatologists say is anti-ageing, shrinks pores, smooths wrinkles... and even banishes rosacea Days after we got engaged, the love of my life told me he'd killed a man and buried him in a bog. I reported him to police... but then I made this irreversible mistake Life-threatening cantaloupe recall in four states upgraded to FDA's highest risk level... 'reasonable probability of death' Fury as murderer marries pen pal behind bars... as teenage victim's mom says: 'I'm serving a life sentence without my son' Kate and William join Charles and Camilla in celebrating British centenarians at Buckingham Palace as Royal Family marks the late Queen's 100th birthday US troops board second tanker as Trump accuses Iran of violating ceasefire'numerous times' - Live updates AMANDA PLATELL: Why Sarah Ferguson - with the ghost of Princess Diana at her side - is ready to sensationally blow up the Royal Family. She knows ALL their secrets... Team USA Olympics star Noah Lyles slammed for'horrible' reaction to his wife's wedding dress reveal How to lose weight when perimenopause sabotages your metabolism: I'm a trainer but when I hit 46, I piled on the pounds overnight. I was losing hair so fast a bald spot the size of an orange appeared. I owe my life to a $1 at-home treatment that REVERSED the damage in a month.