AITopics | thiscompletestheproof

Collaborating Authors

thiscompletestheproof

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

FL-Sailer: Efficient and Privacy-Preserving Federated Learning for Scalable Single-Cell Epigenetic Data Analysis via Adaptive Sampling

Zhang, Guangyi, Dai, Yi, He, Yiyun, Liu, Junhao

arXiv.org Machine LearningMay-7-2026

Single-cell ATAC-seq (scATAC-seq) enables high-resolution mapping of chromatin accessibility, yet privacy regulations and data size constraints hinder multi-institutional sharing. Federated learning (FL) offers a privacy-preserving alternative, but faces three fundamental barriers in scATAC-seq analysis: ultra-high dimensionality, extreme sparsity, and severe cross-institutional heterogeneity. We propose FL-Sailer, the first FL framework designed for scATAC-seq data. FL-Sailer integrates two key innovations: (i) adaptive leverage score sampling, which selects biologically interpretable features while reducing dimensionality by 80%, and (ii) an invariant VAE architecture, which disentangles biological signals from technical confounders via mutual information minimization. We provide a convergence guarantee, showing that FL-Sailer converges to an approximate solution of the original high-dimensional problem with bounded error. Extensive experiments on synthetic and real epigenomic datasets demonstrate that FL-Sailer not only enables previously infeasible multi-institutional collaborations but also surpasses centralized methods by leveraging adaptive sampling as an implicit regularizer to suppress technical noise. Our work establishes that federated learning, when tailored to domain-specific challenges, can become a superior paradigm for collaborative epigenomic research.

artificial intelligence, lmarginal, machine learning, (17 more...)

arXiv.org Machine Learning

2605.04519

Country: North America > United States > California (0.28)

Genre: Research Report (0.40)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.90)
Information Technology > Data Science (0.86)
Information Technology > Security & Privacy (0.66)

Add feedback

Estimating heterogeneous treatment effects with survival outcomes via a deep survival learner

Sun, Yuming, Kang, Jian, Li, Yi

arXiv.org Machine LearningApr-14-2026

Estimating heterogeneous treatment effects in survival settings is complicated by right censoring as well as the time-varying nature of the estimand. While the conditional average treatment effect (CATE) provides a natural target, most existing approaches focus on a single prespecified time point and do not account for the temporal trajectory, leading to instability in estimation. We propose a deep survival learner (DSL) for estimating heterogeneous treatment effects with right-censored outcomes. The method is based on a doubly robust pseudo-outcome whose conditional expectation identifies time-specific CATEs under standard assumptions. This construction remains unbiased if either the outcome model or the treatment assignment model is correctly specified, when properly accounting for censoring. To estimate CATEs over a clinically relevant time spectrum, DSL employs a multi-output deep neural network with shared representations, enabling joint estimation of treatment effect trajectories. From a theoretical perspective, we derive error bounds for both pointwise and joint estimation over time. We show that joint estimation can leverage temporal structure to control estimation error without incurring much additional approximation cost under smoothness conditions, leading to improved stability relative to separate estimation. Cross-fitting is incorporated to reduce overfitting and mitigate bias arising from flexible nuisance estimation. Simulation studies demonstrate favorable finite-sample performance, particularly under nuisance model misspecification. Applied to the Boston Lung Cancer Study, DSL reveals heterogeneity in the effects of perioperative chemotherapy across patient characteristics and over time.

artificial intelligence, causalsurvivalforest 0, machine learning, (18 more...)

arXiv.org Machine Learning

2604.10398

Genre: Research Report > Experimental Study (0.68)

Industry: Health & Medicine > Therapeutic Area > Oncology > Lung Cancer (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

MuonEq: Balancing Before Orthogonalization with Lightweight Equilibration

Chang, Da, Shi, Qiankun, Zhang, Lvgang, Li, Yu, Zhang, Ruijie, Lu, Yao, Liu, Yongxiang, Yuan, Ganzhao

arXiv.org Machine LearningMar-31-2026

Orthogonalized-update optimizers such as Muon improve training of matrix-valued parameters, but existing extensions mostly act either after orthogonalization by rescaling updates or before it with heavier whitening-based preconditioners. We introduce {\method}, a lightweight family of pre-orthogonalization equilibration schemes for Muon in three forms: two-sided row/column normalization (RC), row normalization (R), and column normalization (C). These variants rebalance the momentum matrix before finite-step Newton--Schulz using row/column squared-norm statistics and only $\mathcal{O}(m+n)$ auxiliary state. We show that finite-step orthogonalization is governed by input spectral properties, especially stable rank and condition number, and that row/column normalization is a zeroth-order whitening surrogate that removes marginal scale mismatch. For the hidden matrix weights targeted by {\method}, the row-normalized variant R is the natural default and preserves the $\widetilde{\mathcal{O}}(T^{-1/4})$ stationarity guarantee of Muon-type methods. In LLaMA2 pretraining on C4, the default R variant consistently outperforms Muon on 130M and 350M models, yielding faster convergence and lower validation perplexity.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Machine Learning

2603.28254

Country: Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

AlleviateAnchor-Shift: ExploreBlindSpotswith Cross-ViewReconstructionforIncompleteMulti-View Clustering

Neural Information Processing SystemsFeb-17-2026, 01:03:06 GMT

Despite efficiencyimprovements, existing methods overlook themisguidance in anchors learning induced by partial missing samples,i.e., the absence of samples results in shift of learned anchors, further leading to sub-optimal clustering performance.

artificial intelligence, conv, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.04)
Asia > China > Hunan Province > Changsha (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

NeuS: LearningNeuralImplicitSurfaces byVolumeRenderingforMulti-viewReconstruction-SupplementaryMaterial-ADerivationforComputingOpacityαi

Neural Information Processing SystemsFeb-11-2026, 15:16:40 GMT

Next consider the case where[ti,ti+1] lies in a range[t`,tr] over which the camera ray is exiting the surface, i.e. the signed distance function is increasing onp(t) over [t`,tr]. Then we have ( f(p(t)) v) < 0 in [ti,ti+1]. Then, according to Eqn. 1, we haveρ(t) = 0. Therefore, by Eqn.12ofthepaper,wehave αi=1 exp Recall that our S-density fieldφs(f(x)) is defined using the logistic density functionφs(x) = se sx/(1+e sx)2, which is the derivative of the Sigmoid functionΦs(x) = (1+e sx) 1, i.e. φs(x)=Φ0s(x). As a first-order approximation of signed distance functionf, suppose that locally the surface is tangentially approximated byasufficiently small planar patch with itsoutwardunitnormal vector denotedas n. Nowsupposep(t)isapoint on the surfaceS,that is, f(p(t)) = 0. Next we will examine the value ofdwdt(t) at t = t . Thesigneddistancefunction f ismodeledbyanMLP that consists of 8hidden layers with hidden size of 256.

artificial intelligence, arxivpreprintarxiv, machine learning, (17 more...)

Neural Information Processing Systems

Country: Oceania > Australia > New South Wales > Sydney (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.69)

Add feedback

Organizationofthesupplementarymaterial Thesupplementarymaterial(SM)isorganizedasfollows. 35thConferenceonNeuralInformationProcessingSystems(NeurIPS2021)

Neural Information Processing SystemsFeb-11-2026, 04:31:50 GMT

Modern deep neural networks areoverparameterizedwith respect to the amount of training data and achieve zero training error, yet generalize well on test data.

artificial intelligence, deep learning, machine learning, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.14)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

7274ed909a312d4d869cc328ad1c5f04-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 18:52:37 GMT

Machine learned models are increasingly entering wider ranges ofdomains inour lives, driving a constantly increasing number of important systems. Large scale systems can be trained in highly parallel and distributed training environments, with a large amount of randomness in training the models.

artificial intelligence, machine learning, xerrt, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Add feedback

6f6dd92b03ff9be7468a6104611c9187-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 16:56:05 GMT

In this paper, we study the finite-time convergence of nonlinear SA with multiplecoupledsequences.

artificial intelligence, machine learning, ynk yn, (16 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

AGloballyOptimalPortfolioform-SparseSharpe RatioMaximization

Neural Information Processing SystemsFeb-9-2026, 04:14:18 GMT

In modern portfolio management, one may require anm-sparse (no more thanm active assets) portfolio to save managerial and financial costs.

artificial intelligence, constraint, optimization problem, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Belmont (0.04)
Asia > China > Guangdong Province > Guangzhou (0.04)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.94)

Add feedback

2 Frameworkandassumptions 2.1 Stochasticoptimizationundertimedrift ThroughoutSections2-4,weconsiderthesequenceofstochasticoptimizationproblems min

Neural Information Processing SystemsFeb-9-2026, 00:44:29 GMT

Our results concisely explain the interplay between the learning rate, the noise variance in the gradient oracle, and the strength ofthetime drift. The high-probability results merely assume that thegradient noise and time drift have light tails. Moreover, none of the results require the objectives to have bounded domains.

artificial intelligence, exp, machine learning, (16 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback