Goto

Collaborating Authors

 Asia


Deep learning-powered biochip to detect genetic markers

AIHub

A team of scientists from Nanyang Technological University Singapore has developed a new biochip that, when paired with computer vision, can detect quickly and accurately extremely small amounts of microRNAs, which are tiny genetic markers linked to diseases such as heart disease. Published in the scientific journal, the new biosensing platform combines a specially designed nanophotonic chip with AI-automated image analysis. With a tiny drop of blood loaded into the chip, it can rapidly detect multiple microRNA biomarkers. With its integrated AI imaging function, thousands of microRNA signals can be imaged and analysed in a single snapshot. Compared with the current gold standard of detecting microRNA - PCR (polymerase chain reaction) detects tiny amounts of genetic material by copying them many times - the new device can cut detection time from hours to 20 minutes. MicroRNAs are short RNA molecules that help regulate genes that work in the body.


SoftBank profit jumps, emboldens Son to bet more on OpenAI

The Japan Times

SoftBank Group has reported a surge in quarterly profit due to valuation gains on its OpenAI investment, boosting confidence at the Japanese company to bet even more on the ChatGPT-maker. The gains on OpenAI outweighed lackluster investment gains elsewhere in the Tokyo-based technology group's portfolio while war in the Middle East roiled markets. That points to growing reliance on the U.S. startup, which faces rising competition from Anthropic and Google and is reportedly trailing its highest internal targets. SoftBank earned a net income of ยฅ1.83 trillion ($11.6 billion) in its fiscal fourth quarter, compared with the average analyst estimate of ยฅ295.2 billion. The profit could be attributed entirely to its booking $25 billion in valuation gains on OpenAI in the quarter, according to Bloomberg Intelligence analyst Kirk Boodry. In a time of both misinformation and too much information, quality journalism is more crucial than ever.


Trump and Xi to meet in Beijing: The key issues shaping the China summit

Al Jazeera

United States President Donald Trump has departed for Beijing ahead of a high-stakes summit with Chinese President Xi Jinping, after weeks of unsuccessful US efforts to persuade China to help bring Iran back to negotiations and ease tensions around the Strait of Hormuz. The leaders of the world's two largest economies are due to meet on Thursday and Friday during Trump's first visit to China since 2017, with talks expected to focus on trade, Taiwan, artificial intelligence and the war involving Iran. Why does the Trump-Xi summit matter? The Trump-Xi summit is a high-level meeting between Trump and Xi Jinping taking place in Beijing as the world's two largest economies face growing tensions over trade, technology, Taiwan and the Iran war. The summit is particularly significant because Trump will be the first US leader to visit China in nearly a decade, while the talks also come at a time of heightened geopolitical and economic uncertainty.


US-China head-to-head: Explained in 11 maps and charts

Al Jazeera

US President Donald Trump will meet Chinese President Xi Jinping in Beijing on May 14 and 15, following weeks of delays due to the US-Israel war on Iran. The talks are expected to focus on trade relations and mark the first time a US president has visited China in nearly a decade. In recent decades, the US and China have emerged as the world's dominant superpowers, frequently seen as locked in a contest for who sits atop the world order. A quarter of a century ago, by contrast, the US dwarfed China in most major indicators, but today, Beijing is regarded as the factory of the world and is outpacing its Western counterpart in many regards. Who is the world's top trading power?


Training-Time Batch Normalization Reshapes Local Partition Geometry in Piecewise-Affine Networks

arXiv.org Machine Learning

Batch normalization (BN) is central to modern deep networks, but its effect on the realized function during training remains less understood than its optimization benefits. We study training-time BN in continuous piecewise-affine (CPA) networks through the geometry of switching hyperplanes and the induced affine-region partition. Conditioned on a mini-batch, we show that BN defines for each neuron a reference hyperplane through the batch centroid, and that breakpoint-switching hyperplanes are parallel translates whose offsets are expressed in batch-standardized coordinates and are independent of the raw bias. This yields an exact criterion for when a switching hyperplane intersects a local $\ell_\infty$ window and motivates a local region-density functional based on exact affine-region counts. Under explicit sufficient conditions, we show that BN increases expected local partition refinement in ReLU and more general piecewise-affine networks, and that this mechanism transfers locally through depth inside parent affine regions where the upstream representation map is an affine embedding. These results provide a function-level geometric account of training-time BN as a batch-conditional recentering mechanism near the data.


Exact Stiefel Optimization for Probabilistic PLS: Closed-Form Updates, Error Bounds, and Calibrated Uncertainty

arXiv.org Machine Learning

Probabilistic partial least squares (PPLS) is a central likelihood-based model for two-view learning when one needs both interpretable latent factors and calibrated uncertainty. Building on the identifiable parameterization of Bouhaddani et al.\ (2018), existing fitting pipelines still face two practical bottlenecks: noise--signal coupling under joint EM/ECM updates and nontrivial handling of orthogonality constraints. Following the fixed-noise scalar-likelihood line of Hu et al.\ (2025), we develop an end-to-end framework that combines noise pre-estimation, constrained likelihood optimization, and prediction calibration in one pipeline. Relative to Hu et al.\ (2025), we replace full-spectrum noise averaging with noise-subspace estimation and replace interior-point penalty handling with exact Stiefel-manifold optimization. The noise-subspace estimator attains a signal-strength-independent leading finite-sample rate and matches a minimax lower bound, while the full-spectrum estimator is shown to be inconsistent under the same model. We further extend the framework to sub-Gaussian settings via optional Gaussianization and provide closed-form standard errors through a block-structured Fisher analysis. Across synthetic high-noise settings and two multi-omics benchmarks (TCGA-BRCA and PBMC CITE-seq), the method achieves near-nominal coverage without post-hoc recalibration, reaches Ridge-level point accuracy on TCGA-BRCA at rank $r=3$, matches or exceeds PO2PLS on cross-view prediction while providing native calibrated uncertainty, and improves stability of parameter recovery.


Learning U-Statistics with Active Inference

arXiv.org Machine Learning

$U$-statistics play a central role in statistical inference. In many modern applications, however, acquiring the labels required for $U$-statistics is costly. Motivated by recent advances in active inference, we develop an active inference framework for $U$-statistics that selectively queries informative labels to improve estimation efficiency under a fixed labeling budget, while preserving valid statistical inference. Our approach is built on the augmented inverse probability weighting $U$-statistic, which is designed to incorporate the sampling rule and machine learning predictions. We characterize the optimal sampling rule that minimizes its variance and design practical sampling strategies. We further extend the framework to $U$-statistic-based empirical risk minimization. Experiments on real datasets demonstrate substantial gains in estimation efficiency over baseline methods, while maintaining target coverage.


Approximation Theory of Laplacian-Based Neural Operators for Reaction-Diffusion System

arXiv.org Machine Learning

Neural operators provide a framework for learning solution operators of partial differential equations (PDEs), enabling efficient surrogate modeling for complex systems. While universal approximation results are now well understood, approximation analysis specific to nonlinear reaction-diffusion systems remains limited. In this paper, we study neural operators applied to the solution mapping from initial conditions to time-dependent solutions of a generalized Gierer-Meinhardt reaction-diffusion system, a prototypical model of nonlinear pattern formation. Our main results establish explicit approximation error bounds in terms of network depth, width, and spectral rank by exploiting the Laplacian spectral representation of the Green's function underlying the PDE. We show that the required parameter complexity grows at most polynomially with respect to the target accuracy, demonstrating that Laplacian eigenfunction-based neural operator architectures alleviate the curse of parametric complexity encountered in generic operator learning. Numerical experiments on the Gierer-Meinhardt system support the theoretical findings.


Information-Theoretic Generalization Bounds for Sequential Decision Making

arXiv.org Machine Learning

Information-theoretic generalization bounds based on the supersample construction are a central tool for algorithm-dependent generalization analysis in the batch i.i.d.~setting. However, existing supersample conditional mutual information (CMI) bounds do not directly apply to sequential decision-making problems such as online learning, streaming active learning, and bandits, where data are revealed adaptively and the learner evolves along a causal trajectory. To address this limitation, we develop a sequential supersample framework that separates the learner filtration from a proof-side enlargement used for ghost-coordinate comparisons. Under a row-wise exchangeability assumption, the sequential generalization gap is controlled by sequential CMI, a sum of roundwise selector--loss information terms. We also establish a Bernstein-type refinement that yields faster rates under suitable variance conditions. The selector-SCMI proof strategy applies to online learning, streaming active learning with importance weighting, and stochastic multi-armed bandits.


Pion: A Spectrum-Preserving Optimizer via Orthogonal Equivalence Transformation

arXiv.org Machine Learning

We introduce Pion, a spectrum-preserving optimizer for large language model (LLM) training based on orthogonal equivalence transformation. Unlike additive optimizers such as Adam and Muon, Pion updates each weight matrix through left and right orthogonal transformations, preserving its singular values throughout training. This yields an optimization mechanism that modulates the geometry of weight matrices while keeping their spectral norm fixed. We derive the Pion update rule, systematically examine its design choices, and analyze its convergence behavior along with several key properties. Empirical results show that Pion offers a stable and competitive alternative to standard optimizers for both LLM pretraining and finetuning.