Goto

Collaborating Authors

 Sardinia







Physics-informed Blind Reconstruction of Dense Fields from Sparse Measurements using Neural Networks with a Differentiable Simulator

Aloni, Ofek, Fishbain, Barak

arXiv.org Machine Learning

Generating dense physical fields from sparse measurements is a fundamental question in sampling, signal processing, and many other applications. State-of-the-art methods either use spatial statistics or rely on examples of dense fields in the training phase, which often are not available, and thus rely on synthetic data. Here, we present a reconstruction method that generates dense fields from sparse measurements, without assuming availability of the spatial statistics, nor of examples of the dense fields. This is made possible through the introduction of an automatically differentiable numerical simulator into the training phase of the method. The method is shown to have superior results over statistical and neural network based methods on a set of three standard problems from fluid mechanics.


From Shortcut to Induction Head: How Data Diversity Shapes Algorithm Selection in Transformers

Kawata, Ryotaro, Song, Yujin, Bietti, Alberto, Nishikawa, Naoki, Suzuki, Taiji, Vaiter, Samuel, Wu, Denny

arXiv.org Machine Learning

Transformers can implement both generalizable algorithms (e.g., induction heads) and simple positional shortcuts (e.g., memorizing fixed output positions). In this work, we study how the choice of pretraining data distribution steers a shallow transformer toward one behavior or the other. Focusing on a minimal trigger-output prediction task -- copying the token immediately following a special trigger upon its second occurrence -- we present a rigorous analysis of gradient-based training of a single-layer transformer. In both the infinite and finite sample regimes, we prove a transition in the learned mechanism: if input sequences exhibit sufficient diversity, measured by a low ``max-sum'' ratio of trigger-to-trigger distances, the trained model implements an induction head and generalizes to unseen contexts; by contrast, when this ratio is large, the model resorts to a positional shortcut and fails to generalize out-of-distribution (OOD). We also reveal a trade-off between the pretraining context length and OOD generalization, and derive the optimal pretraining distribution that minimizes computational cost per sample. Finally, we validate our theoretical predictions with controlled synthetic experiments, demonstrating that broadening context distributions robustly induces induction heads and enables OOD generalization. Our results shed light on the algorithmic biases of pretrained transformers and offer conceptual guidelines for data-driven control of their learned behaviors.


Consensus dimension reduction via multi-view learning

An, Bingxue, Tang, Tiffany M.

arXiv.org Machine Learning

Dimension reduction methods are a fundamental class of techniques in data analysis, which aim to find a lower-dimensional representation of higher-dimensional data while preserving as much of the original information as possible. These methods are extensively used in practice, including in exploratory data analyses to visualize data--arguably, one of the first and most vital steps in any data analysis (Ray et al., 2021). Notably, in genomics, dimension reduction methods are ubiquitously applied to visualize high-dimensional single-cell RNA sequencing data in two dimensions (Becht et al., 2019). Beyond visualization, dimension reduction methods are also frequently employed to mitigate the curse of dimensionality (Bellman, 1957), engineer new features to improve downstream tasks like prediction (e.g., Massy, 1965), and enable scientific discovery in unsupervised learning settings (Chang et al., 2025). For example, many researchers have used dimension reduction in conjunction with clustering to discover new cell types and cell states (Wu et al., 2021), new cancer subtypes (Northcott et al., 2017), and other substantively-meaningful structure in a variety of domains (Bergen et al., 2019; Traven et al., 2017). Given the widespread use and need for dimension reduction methods, numerous dimension reduction techniques have been developed. Popular techniques include but are not limited to principal component analysis (PCA) (Pearson, 1901; Hotelling, 1933), multidimensional scaling (MDS) (Torgerson, 1952; Kruskal, 1964a), Isomap (Tenenbaum et al., 2000), locally linear embedding (LLE) (Roweis and Saul, 2000), t-distributed stochastic neighbor embedding (t-SNE) (van der 1


Rise of the digital threesome: British couples are turning to AI to spice up their sex lives, study reveals

Daily Mail - Science & tech

Ghislaine Maxwell's ultimate humiliation: Epstein's sex trafficker girlfriend poses in outrageous outfits and exposes herself in dozens of photos released from the billionaire paedophile's files I was falsely accused of being the Brown University shooter... Silent Trump flees growing storm over Epstein'cover-up' as he jets off for holidays without ANY comment I've spotted something else in this photo of Karoline Leavitt's'injection sites'... every woman you know will say the same thing about it: KENNEDY Why Conan O'Brien'stopped party guests calling 911' on Nick Reiner: Insiders reveal disturbing new details of final hours before Rob and Michele murders After 27 years as a TV anchor I was suddenly pulled off screens. My boss's explanation was a brutal lesson in loyalty Emily in Paris cast left'aghast' and'walking on eggshells' as off-camera drama becomes overwhelming... and whispers swirl about a CURSE Doctors said my hip pain was just tendinitis from sitting all day at work. The real cause may kill me... they had left it far too late Chilling mystery of'toddler's foot' in Pedophile Island snap as the worst of THOUSANDS of new Epstein file photos exposed I was dead for 105 minutes and learned exactly how you get into heaven... then Jesus spoke six words into my mind and sent me back Jake Paul's jaw is broken in Anthony Joshua battering: YouTuber-turned-boxer rushes to hospital Kennedy niece vows to attack Trump's name with a PICKAX amid awkward gaffe in center's new signage Andrew's fury at anyone who doesn't bow and scrape. The expletive-ridden bust-up... and final ignominy revealed The devastating story of'feral child' Genie Wiley whose father tied her up and locked her in a room until the age of 13 - and the scientific tug of war which broke out upon her discovery Trump launches massive airstrikes in Syria as'vengeance' for killing three Americans Reiner family bombshell as insiders reveal who is paying for Nick's celebrity lawyer... their secret motive... and who will REALLY inherit $200m fortune'Donald Trump has given white South Africans hope': Refugee fleeing the country for new life in US details horrific torture being inflicted in farm attacks as president is hailed for'fighting evil' Terrifying maps break down exactly who is at risk of new'super flu' exploding across America... as doctors reveal symptoms to really worry about Brave Epstein victims speak out over'lack of transparency' in Epstein files drop READ MORE: Doctor reveals sex trends including'freak matching' Forget couples counselling or a saucy mini-break - when it comes to spicing up their sex lives, British couples are turning to AI . A new report by Lovehoney has detailed the rise of the'digital threesome' in the UK.


Optimizing Drivers' Discount Order Acceptance Strategies: A Policy-Improved Deep Deterministic Policy Gradient Framework

Dai, Hanwen, Gao, Chang, He, Fang, Ji, Congyuan, Yang, Yanni

arXiv.org Artificial Intelligence

The rapid expansion of platform integration has emerged as an effective solution to mitigate market fragmentation by consolidating multiple ride-hailing platforms into a single application. To address heterogeneous passenger preferences, third-party integrators provide Discount Express service delivered by express drivers at lower trip fares. For the individual platform, encouraging broader participation of drivers in Discount Express services has the potential to expand the accessible demand pool and improve matching efficiency, but often at the cost of reduced profit margins. This study aims to dynamically manage drivers' acceptance of Discount Express from the perspective of an individual platform. The lack of historical data under the new business model necessitates online learning. However, early-stage exploration through trial and error can be costly in practice, highlighting the need for reliable early-stage performance in real-world deployment. To address these challenges, this study formulates the decision regarding the proportion of drivers accepting discount orders as a continuous control task. In response to the high stochasticity and the opaque matching mechanisms employed by third-party integrator, we propose an innovative policy-improved deep deterministic policy gradient (pi-DDPG) framework. The proposed framework incorporates a refiner module to boost policy performance during the early training phase. A customized simulator based on a real-world dataset is developed to validate the effectiveness of the proposed pi-DDPG. Numerical experiments demonstrate that pi-DDPG achieves superior learning efficiency and significantly reduces early-stage training losses, enhancing its applicability to practical ride-hailing scenarios.