Goto

Collaborating Authors

 Asia


Boosted Stochastic Frank-Wolfe for Constrained Nonconvex Optimization

arXiv.org Machine Learning

The boosted Frank-Wolfe algorithm accelerates the classical Frank-Wolfe algorithm by better aligning the update direction with the negative gradient. Its analysis, however, has been limited to deterministic convex problems, with step sizes that require either line search or knowledge of the Lipschitz constant of the gradient. We develop a novel step size strategy that does not depend on the Lipschitz constant of the gradient, which allows us to extend the boosted Frank-Wolfe algorithm to the stochastic setting. We prove that boosting with this step size strategy can be combined with many modern gradient estimators, including SAGA, L-SVRG, SAG, Heavy Ball momentum, and zeroth-order estimators, among others, while retaining the worst-case convergence rates of ordinary stochastic Frank-Wolfe. Our analysis also yields the first convergence rates for boosted Frank-Wolfe on nonconvex and quasar-convex objectives, results which are new even for deterministic problems. Experiments on sparse logistic regression and quantum process tomography show that stochastic boosted Frank-Wolfe achieves faster convergence per gradient oracle call (and on wall-clock) compared to the non-boosted baseline.


Mean-Shift PCA by Knockoff Mean

arXiv.org Machine Learning

Removing noise is difficult, but adding noise is easy. In this work, we show how to eliminate mean-shift noisy components from PCA by deliberately introducing knockoff mean-shift perturbation. Standard PCA is highly sensitive to shifts in the sample mean: a small fraction of samples from a shifted distribution can cause large deviations in the leading principal components. In high-dimensional regimes, existing Robust PCA approaches cannot handle the mean-shift contamination structure inherent in the mixture model. Using tools from Random Matrix Theory, we prove that the mean-shift spikes are spectrally separable from the stable eigenvalues of the original covariance. Furthermore, the original eigenspace remains asymptotically invariant to the contamination, independent of the mixture weight. Exploiting this spectral stability, we propose a simple, two-stage PCA algorithm by adding knockoff mean that identifies and removes the mean-shift component using only standard PCA operations.


Courtroom Analogy: New Perspective on Uncertainty-Aware Classification

arXiv.org Machine Learning

Single-pass uncertainty quantification (UQ) methods for classification represent uncertainty by predicting a tractable distribution over the class probability vector. While existing approaches primarily focus on enhancing the expressiveness of this distribution, they often provide limited insight into how predictive uncertainty is structured and aggregated, resulting in weak interpretability. We introduce the courtroom analogy, which conceptualizes uncertainty-aware classification as a structured debate among class-specific advocates. Each advocate forms a probabilistic opinion, and a final verdict is reached by aggregating these opinions using input-dependent plausibility weights. In this framework, each advocate's opinion is modeled as a Dirichlet distribution whose concentration parameter is decomposed into shared evidence and class-specific advocacy. This yields a structured mixture of Dirichlet distributions with semantically interpretable parameters. To instantiate this formulation, we propose Mixture of Dirichlet EXperts (MoDEX), a single-pass neural architecture that predicts the courtroom parameters, enabling efficient and expressive UQ while explicitly modeling uncertainty aggregation. We demonstrate that MoDEX enjoys strong theoretical properties and achieves state-of-the-art UQ performance across diverse benchmarks, yielding interpretable uncertainty estimates with meaningful semantics.


Stein-Encoder: A White-Box Supervised Encoder via Stein Identities in Multi-Modal Studies

arXiv.org Machine Learning

In multi-modal biomedical research, integrating high-dimensional genomic data with clinical baselines is essential for precision medicine. However, standard deep neural network approaches often entangle these modalities, obscuring the specific predictive impact of genetic features and leading to possibly suboptimal predictive performance. Motivated by the landmark METABRIC cohort primary breast tumors study, we propose the Stein-Encoder, a white-box supervised framework designed to isolate the genetic signal driving clinical outcomes conditional on nuisance covariates. By leveraging Stein's method and residualization techniques, our approach constructs an interpretable single index that summarizes relevant biological heterogeneity while flexibly incorporating clinical factors and can be used to improve downstream prediction. We establish theoretical guarantees for identification, consistency and efficiency improvement. Applied to the METABRIC cohort, the Stein-Encoder outperforms unsupervised benchmarks in predictive accuracy. Crucially, it achieves structural disentanglement by revealing response-specific biological mechanisms: we find that tumor size is driven primarily by mitotic networks, whereas prognostic indices rely on a distinct proliferation-versus-immune axis. This work contributes a unified, computationally efficient framework that bridges statistical rigor with the representational power of neural networks, enabling interpretable, task-specific and efficient compression of multi-modal health data for a wide range of precision medicine applications, beyond biomarker discovery.


On the Benefits of Free Exploration for Regret Minimization in Multi-Armed Bandits

arXiv.org Machine Learning

We study a stochastic multi-armed bandit problem where an agent is granted a free exploration budget before regret accumulates, a setting not captured by the classic regret minimization or pure exploration paradigms. The goal is to design an adaptive policy that strategically explores the bandit instance in the initial free exploration phase and minimizes the cumulative regret in the subsequent phase. We formalize this regret minimization with free exploration problem and identify an interesting regime where the free exploration budget scales logarithmically with the time horizon. To quantify the amount of regret saved with high probability as a result of the availability of the free exploration phase, we introduce a novel set of policies known as $(α,β)$-probably saving policies. We propose a two-phase, probably saving algorithm, UFE-KLUCB-H, which consists of a principled free exploration policy, UFE, and a history-aware regret minimization policy KLUCB-H. Instance-dependent upper bounds on UFE-KLUCB-H are derived, showing that UFE-KLUCB-H accumulates strictly less regret than policies that do not have access to a free exploration phase. Complementarily, we derive instance-dependent lower bounds based on novel multi-instance perturbation arguments tailored to the free-exploration setting, demonstrating the near-optimality of UFE-KLUCB-H for two-valued bandits. Our upper and lower bounds reveal sharp phase transitions in the accumulated regret depending on the amount of available free exploration. Simulations are conducted to demonstrate that forced exploration and adaptivity in the algorithm lead to greater regret savings.


Deployment-complete benchmarking

arXiv.org Machine Learning

Benchmarks increasingly guide deployment, procurement and scientific screening, yet a score supports only the response it records, not necessarily the deployment action. We introduce deployment-complete benchmarking, which tests whether benchmark evidence determines a deployment action. A benchmark is complete for a claim exactly when the action is constant on each evidence fiber; mixed fibers expose missing deployment information, and completion curves quantify the evidence required to resolve ambiguity. In controlled response spaces, benchmark-channel conformal coverage of 94.98% transferred poorly to an unmeasured deployment channel (10.07%), whereas response-rank intervals achieved 94.91% coverage; even zero benchmark error certified only 45.4% of candidates at the largest residual size. Public audits revealed incompleteness, including 97.9% mixed Tox21 fibers and zero median certifiable fraction in main Matbench and JARVIS audits. In held-out replays, certify-then-acquire reduced false decisions from 1.19% to 0.027% in Tox21 and from 20.3% to 0.128% in JARVIS, while changing model choice and identifying deployment-relevant probes. Deployment-ready benchmarks should report evidence, supported actions, ambiguity and completion cost rather than scores alone.


Lincoln Riley claims USC was 'snaps away' from the playoff, says he's a better coach now than when at Oklahoma

FOX News

Notre Dame's Josh Yago delivers Memorial Day salute during anthem before lacrosse championship game Dak Prescott reunites with ex-fiancée Sarah Jane Ramos to celebrate daughter's first birthday Celtics guard Jaylen Brown challenges ESPN's Stephen A Smith to a debate at Harvard or MIT Wyndham Clark adds to his funky resume, TPC Craig Ranch slander and LIV Golf's pitch to new investors Unearthed fan video shows who Kyle Busch really was, NASCAR's darkest hour & Bubba Wallace's'Rowdy' story California mom speaks with compassion but brutal honesty about presence of trans athlete in daughter's sport Curt Cignetti jokes he had to'coach the hell out' of undefeated Hoosiers to be Indy 500 pace car driver A screenshot has WNBA fans asking: did a player endorse a threat toward Caitlin Clark? MLB reporter Tricia Whitaker hit with line drive during Orioles' game Defense expert argues Iran has never been'so isolated' Joey Jones calls out Dem candidate Platner for'hiding behind the Purple Hearts' of fellow vets Trump doesn't want Iran to become his Afghanistan: Mike Sarraille Any Iran deal will be judged by'how much it cost' to secure, ex-CIA station chief says Dr Rebecca Grant: Iran has'no place to go,' will have to sign a deal Pope Leo XIV calls for AI to be'disarmed' in critical warning about emerging tech Kyle Busch's family reveals NASCAR champion died from severe pneumonia that led to sepsis NEW details emerge on suspected White House gunman's prior arrests OutKick-Sports Lincoln Riley claims USC was'snaps away' from the playoff, says he's a better coach now than when at Oklahoma Lincoln Riley joins Colin Cowherd to discuss USC's schedule and their number one recruiting class, and the development of QB Jayden Maiava. Lincoln Riley's tenure as head coach of the USC Trojans hasn't been smooth sailing. When he took over ahead of the 2022 season, expectations were high that a coach with his track record would bring the Trojans back to their heyday. While with the Oklahoma Sooners, he went 55-10 and 33-7 in conference, coached in four New Year's Six games, and won 12 games three consecutive seasons.


Netanyahu says Israel will intensify strikes against Hezbollah

BBC News

The Israeli military says it has begun a wave of strikes across Lebanon following an announcement by Prime Minister Benjamin Netanyahu that his country will intensify its attacks on Hezbollah. The Israel Defense Forces (IDF) said it had launched strikes against Hezbollah sites in the Bekaa Valley in the east of Lebanon and additional areas across the country. It followed a video statement on Monday evening in which Netanyahu said Israel was at war with Hezbollah and that he had given the military instructions to deal them a crushing blow. Earlier this month Lebanon and Israel agreed to extend a 45-day ceasefire, though some fighting has continued. There will be fears in Beirut that these latest Israeli attacks will widen to include Lebanon's capital city.


More than 1.5m foreign pilgrims begin Hajj despite Iran war fears

BBC News

More than 1.5m foreign pilgrims begin Hajj despite Iran war fears Muslims have begun the annual Hajj pilgrimage in Saudi Arabia against the backdrop of a region deeply shaken by the Iran war. Saudi authorities said last week that some 1.51 million pilgrims had arrived from outside the kingdom. That is 11,000 more than last year, despite concerns in the region about a resumption of the three-month-old conflict between the US, Israel and Iran. Before a fragile ceasefire took effect last month, Iran launched waves of missile and drone attacks on Saudi Arabia and its Gulf neighbours in retaliation for US and Israeli air strikes. Two civilians living in the central city of al-Kharj were killed in an Iranian attack on 8 March, along with a US service member stationed at the nearby Prince Sultan Air Base.


Why scammers target veterans and how to fight back

FOX News

This material may not be published, broadcast, rewritten, or redistributed. Quotes displayed in real-time or delayed by at least 15 minutes. Market data provided by Factset . Powered and implemented by FactSet Digital Solutions . Mutual Fund and ETF data provided by LSEG . You have a credit freeze; it still isn't enough Turning 65? Month-by-month plan to protect yourself China's AI growth is about'economic and political leverage,' Rep Hinson says Expert warns'red-green-green alliance' helping China gain AI edge AI's impact on jobs, economy debated as youth express growing fears Jury dismisses Elon Musk's lawsuit against OpenAI and Sam Altman China does not'innovate,' they'replicate': Former DHS spokeswoman Trump to press Xi to'open up' China as tech CEOs join key summit CIA calls COVID whistleblower hearing'political theater' in new statement How scammers use military records, VA data and benefit details to target veterans. Kurt'The CyberGuy' Knutsson breaks down the'perfect storm' for scammers to prey on Americans and how to protect yourself.