Promising Solution
Nvidia's N1X could be the jolt Windows laptops need -- with one big catch
PCWorld reports that Nvidia's rumored N1X chip could revolutionize Windows laptops with a 20-core CPU, Blackwell GPU, and impressive AI performance potentially rivaling Qualcomm's Snapdragon X2 Elite. The N1X represents Nvidia's entry into laptop processors, promising better battery life and AI capabilities as laptop costs soar and consumers seek affordable alternatives. However, gaming performance may suffer due to x86 emulation challenges that plague all Arm-based processors, limiting the chip's appeal for gamers. Nvidia is evidently not content to be the world's most valuable company, as the AI and GPU giant now appears primed to dive headfirst into the choppy waters of the laptop processor market. Whether that will help or hurt its fortunes remains to be seen, as the Internet has been aflame this month with rumors that Nvidia will unveil a new "N1X" chip this week at Computex alongside a weaker N1 chip - and the word is both will be SoC (system-on-chip) silicon aimed at Windows laptops. That could be a big deal for anyone who wants to buy a laptop in the next few years, because everything I've heard about the N1X suggests it's optimized for AI performance, battery life, and perhaps even gaming. If Nvidia's efforts to partner with companies like MediaTek and Intel has produced a capable CPU married to a svelte Nvidia GPU on a single chip, utilizing Nvidia's expertise in building high-performance systems for AI and enterprise use, that's potentially a game-changer for the laptop market - and a big challenge to AMD, Apple, and Qualcomm's flagship laptop chips.
CB-SLICE: Concept-Based Interpretable Error Slice Discovery
Konforti, Yael, Zarlenga, Mateo Espinosa, Almahmoud, Elaf, Jamnik, Mateja
Despite strong average-case performance, deep learning models often exhibit systematic errors on specific population groups, known as error slices. Identifying these groups and the root causes of their failures is critical for model debugging and bias mitigation. However, existing error Slice Discovery Methods (SDMs) typically generate explanations disconnected from the model's inference process, thus only approximating the underlying error source and may be inaccurate. We address this limitation by leveraging Concept Bottleneck Models (CBMs), whose predictions are directly dependent on human-understandable semantic concepts. Since downstream task failures in CBMs commonly arise from concept mispredictions, concept representations provide a strong candidate for error slice identification, offering fine-grained explanations directly linked to the error source. Building on this insight, we introduce CB-SLICE, a concept-based SDM that groups samples with shared concept prediction failures and identifies the keyword concepts most responsible for each slice's failure mode. Across multiple benchmarks, we show that CB-SLICE outperforms state-of-the-art methods in uncovering well-known biases while providing richer and more faithful explanations of model errors.
Amazon Thinks the Future of Data Centers Depends on a Technical Problem It Just Solved
The tech giant says a breakthrough in data-center networking has dramatically accelerated the flow of information through its massive cloud infrastructure. Amazon says it recently achieved a major breakthrough in networking design--and has been quietly deploying the new technology in its data centers since late last year. The company claims it has significantly increased data speeds while reducing energy use, potentially giving the tech giant an edge as companies race to build ever-faster systems in the cloud. The new technology hinges on a "quasi-random" design that combines elements of traditional, structured data networks with the performance advantages of more random architectures. Researchers have explored random networks for decades, but the technology has never been successfully scaled.
Constrained Bayesian Experimental Design via Online Planning
Guo, Yujia, Huang, Daolang, Zhang, Xinyu, Katt, Sammie, Kaski, Samuel, Bharti, Ayush
Bayesian experimental design (BED) is a principled framework for data-efficient design of sequential experiments. However, existing BED methods are unable to adapt to dynamic constraints inherent in real-world tasks due to budget limitations, varying costs, or physical constraints that restrict how designs evolve over time. In this paper, we introduce a novel approach to BED that enables constrained optimization of experimental designs by combining offline pre-training of an amortized policy and a posterior network with online multi-step lookahead planning using scenario trees. We empirically demonstrate that our method yields substantially more informative design sequences than existing methods across a range of constrained BED tasks, while incurring only a modest additional computational overhead.
Smooth Piecewise Cutting for Neural Operator to Handle Discontinuities and Sharp Transitions
Dang, Ha, Schmidt, Sebastian, Hesser, Juergen
Neural operators have achieved strong performance in learning solution operators of partial differential equations (PDEs), but their inherently continuous representations struggle to capture discontinuities and sharp transitions. Existing approaches typically approximate such features within continuous function spaces, often requiring increased model capacity and high-resolution data. In this work, we propose Cut-DeepONet, a two-stage training framework that explicitly models discontinuities while reducing learning complexity. Our approach reformulates the problem via a lifting strategy, partitioning the domain into smooth subregions while representing discontinuities as boundaries in a higher-dimensional space. This separation aligns the operator learning task with the inductive bias of neural networks and avoids directly approximating discontinuities. An additional network predicts input-dependent discontinuity locations for unseen inputs, which are then used to guide the neural operator in generating smooth components within each region. Experiments on benchmark PDEs show that Cut-DeepONet outperforms state-of-the-art methods, even when trained on low-resolution datasets. The method excels on problems with discontinuities and sharp transitions, while using fewer trainable parameters. Our results highlight the benefits of changing the representation of operator learning rather than increasing model complexity.
Self-Distillation is Optimal Among Spectral Shrinkage Estimators in Spiked Covariance Models
Lecoiu, Radu, Mukherjee, Debarghya, Sur, Pragya
Self-distillation has emerged as a promising technique for improving model performance in modern machine learning systems. We develop the statistical foundations of self-distillation in spiked covariance models, by introducing and analyzing a broad class of estimators, namely spectral shrinkage estimators. We establish that for spiked covariance matrices with $s$ spikes, $s$-step self-distillation achieves optimal performance among spectral shrinkage estimators, outperforming well-known estimators in statistics and machine learning. Moreover, we show that $s$ steps are necessary for optimality: any $(s-k)$-step distilled estimator is strictly suboptimal for $1 \leq k \leq s$. For the special subclass of isotropic covariances, we show that optimally tuned Ridge regression performs best among spectral shrinkage estimators. We also study a federated approach where multiple data centers share spectral shrinkage estimators and a common server seeks to aggregate them to achieve optimal performance. In this case, we find that the best local rule again takes the form of self-distillation, though it differs from the optimal rule when data are hosted centrally on a single server. Together, our results elucidate why self-distillation improves predictive performance and provide a broader statistical framework connecting it with classical shrinkage-based methods.
K-Models: a Flexible and Interpretable Method for Ordinal Clustering with Application to Antigen-Antibody Interaction Profiles
Patanè, Giulia, Menafoglio, Alessandra, Krauth, Alexander, Fechner, Peter, Dede', Luca, Colosimo, Bianca Maria, Nicolussi, Federica
Existing clustering methods for functional data often prioritize partitioning accuracy over interpretability, making it challenging to extract meaningful insights when the data-generating process follows a specific underlying structure and an ordinal relationship among clusters is suspected. This work introduces K-Models, a novel framework that integrates ordinal constraints and estimates key underlying elements of the random process generating the observed functional profiles, improving both interpretability and structure identification. The proposed method is evaluated through simulations and real-world applications. In particular, it is tested on Region of Interest (ROI) curves, which represent reaction profiles from a reflectometric sensor monitoring biomolecular interactions, such as antigen-antibody binding. These curves represent changes in reflected light intensity over time at multiple measurement spots with immobilized antigens during analyte exposure, capturing the binding dynamics of the system. The goal is to identify intrinsic signal patterns solely from the observed dynamics, making this dataset an ideal benchmark for assessing the added interpretability of the proposed approach. By incorporating structural assumptions into the clustering process, K-Models enhances interpretability while maintaining performance comparable to state-of-the-art techniques, providing a valuable tool for analyzing functional data with an underlying ordinal structure.
Path-Based Gradient Boosting for Graph-Level Prediction
Meggio, Claudio, Pensar, Johan, De Bin, Riccardo
We propose PathBoost, a gradient tree boosting method for graph-level classification and regression that learns discriminative path-based features directly from the input graph structure. Building on a previous work, which was tailored to a specific chemistry application, PathBoost introduces three key extensions: (i) adaptation to binary classification through gradient boosting with a logistic loss, (ii) incorporation of multiple node and edge attributes into the path feature space via a prefix-based decomposition, and (iii) automatic anchor node selection based on categorical attribute diversity, eliminating the need for the user to specify the starting point of the considered path features. We compared PathBoost to graph neural networks and graph kernel approaches on several benchmark datasets, obtaining better results in half of them, and comparable results in the rest. PathBoost shows better performances on graphs with larger average node counts. Overall, the results demonstrate that path-based boosting methods can be competitive with more complex black-box approaches.
GravityGraphSAGE: Link Prediction in Directed Attributed Graphs
Porcedda, Riccardo, Chiaromonte, Francesca, Lillo, Fabrizio, Vandin, Andrea
Link prediction (inferring missing or future connections between nodes in a graph) is a fundamental problem in network science with widespread applications in, e.g., biological systems, recommender systems, finance and cybersecurity. The ability to accurately predict links has significant real-world applications, such as detecting fraudulent financial transactions or identifying drug-target interactions in biomedicine. Despite a rich literature, link prediction is still challenging, especially for graphs enriched with information on edges (direction) and nodes (attributes). In fact, research on link prediction, especially the one based on Graph Deep Learning (GDL), has mostly focused on undirected graphs, without fully leveraging node attributes. Here, we fill this gap by proposing Gravity-GraphSAGE (GG-SAGE), a modified version of GraphSAGE, a GDL model for node embeddings, composed of a gravity-inspired decoder. This implementation is the first example in the literature of a GraphSAGE backbone adopted for directed link prediction. Using the benchmark datasets Cora, Citeseer, PubMed and 16 real-world graphs from the online Netzschleuder repository, we show that our proposed model outperforms state-of-the-art GDL link prediction techniques. Using further experimental evidence, we relate the quality of the output of our model with various characteristics of the graph, suggesting that our framework scales well when applied to data of increasing complexity.
Asus' tiny touchscreen monitor is a solution in search of a problem
Despite ROG gaming branding, the device offers limited utility with 1920 720 resolution, 75Hz refresh rate, and requires external video sources. At €240, it's significantly overpriced compared to similar portable monitors available on Amazon for around $100 with better versatility. I've been using a triple monitor setup for almost 20 years. I also have an iPad on my desk to show little widgets, time zones, weather, notifications, yadda yadda. There are a of screens in front of me in my desktop setup, is my point. And yet, I still don't think I can use the Asus ROG Strix XG129C . If the name doesn't make it clear, it's a small 12.3-inch ultrawide touchscreen display that goes under a normal monitor. This gadget is very specifically a, not a tablet. It needs a source for its video via either USB-C or HDMI.