momenta
291597a100aadd814d197af4f4bab3a7-AuthorFeedback.pdf
We thank the reviewers for their efforts in reviewing and their feedback, considering the paper "well written", "[a R2 commented that our "methods mixing geometric deformation models and DL may interest the NeurIPS community." Parts of the writing that were difficult to follow. Do time-dependent velocity fields matter? Y es, from a theoretical point of view (see 1 above). Practically, when very large deformations (e.g., lung motions or large shape changes) are to be modeled.
Machine-Learning Accelerated Calculations of Reduced Density Matrices
Azam, Awwab A., Zhao, Lexu, Yu, Jiabin
$n$-particle reduced density matrices ($n$-RDMs) play a central role in understanding correlated phases of matter. Yet the calculation of $n$-RDMs is often computationally inefficient for strongly-correlated states, particularly when the system sizes are large. In this work, we propose to use neural network (NN) architectures to accelerate the calculation of, and even predict, the $n$-RDMs for large-size systems. The underlying intuition is that $n$-RDMs are often smooth functions over the Brillouin zone (BZ) (certainly true for gapped states) and are thus interpolable, allowing NNs trained on small-size $n$-RDMs to predict large-size ones. Building on this intuition, we devise two NNs: (i) a self-attention NN that maps random RDMs to physical ones, and (ii) a Sinusoidal Representation Network (SIREN) that directly maps momentum-space coordinates to RDM values. We test the NNs in three 2D models: the pair-pair correlation functions of the Richardson model of superconductivity, the translationally-invariant 1-RDM in a four-band model with short-range repulsion, and the translation-breaking 1-RDM in the half-filled Hubbard model. We find that a SIREN trained on a $6\times 6$ momentum mesh can predict the $18\times 18$ pair-pair correlation function with a relative accuracy of $0.839$. The NNs trained on $6\times 6 \sim 8\times 8$ meshes can provide high-quality initial guesses for $50\times 50$ translation-invariant Hartree-Fock (HF) and $30\times 30$ fully translation-breaking-allowed HF, reducing the number of iterations required for convergence by up to $91.63\%$ and $92.78\%$, respectively, compared to random initializations. Our results illustrate the potential of using NN-based methods for interpolable $n$-RDMs, which might open a new avenue for future research on strongly correlated phases.
- North America > United States > Florida > Alachua County > Gainesville (0.28)
- North America > United States > Georgia > Fulton County > Atlanta (0.04)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- (3 more...)
MT-DAO: Multi-Timescale Distributed Adaptive Optimizers with Local Updates
Iacob, Alex, Jovanovic, Andrej, Safaryan, Mher, Kurmanji, Meghdad, Sani, Lorenzo, Horváth, Samuel, Shen, William F., Qiu, Xinchi, Lane, Nicholas D.
Training large models with distributed data parallelism (DDP) requires frequent communication of gradients across workers, which can saturate bandwidth. Infrequent communication strategies (e.g., Local SGD) reduce this overhead but, when applied to adaptive optimizers, often suffer a performance gap relative to fully synchronous DDP. We trace this gap to a time-scale mismatch: the optimizer's fast-moving momentum, tuned for frequent updates, decays too quickly to smooth gradients over long intervals, leading to noise-dominated optimization. To address this, we propose MT-DAO, a family of optimizers that employs multiple slow- and fast-moving first momenta or the gradient to track update dynamics across different time scales, for which we provide the first convergence guarantees. Empirically, for language-model pre-training, this eliminates the performance gap with DDP, outperforming infrequent-communication baselines in perplexity and reducing iso-token wall-clock time by 6-27% on Ethernet interconnects. At the 720M scale, MT-DAO reaches a target perplexity in 24% fewer steps and 35% less time than the single-momentum DDP baseline. MT-DAO enables effective cross-datacenter training and training over wide geographic areas.
- North America > United States > Virginia (0.04)
- Asia > Middle East > Jordan (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Europe > Austria (0.04)
Refining Integration-by-Parts Reduction of Feynman Integrals with Machine Learning
von Hippel, Matt, Wilhelm, Matthias
Perturbative Quantum Field Theory has proven to be a vastly successful theoretical framework for calculating precision predictions, with applications ranging from collider physics to gravitational-wave physics. A crucial step in the calculation of precision predictions is the reduction of the occurring Feynman integrals to a much smaller set of so-called master integrals, using integration-by-parts (IBP) identities [1-3]. This IBP reduction is a major bottleneck in precision calculations, requiring hundred thousands of CPU hours in current applications [4] and obstructing other applications altogether. IBP identities relate Feynman integrals with different integer exponents of the propagators as well as irreducible scalar products (ISP) in the numerator. They can easily be derived for general values of the exponents, see e.g.
- North America > United States > California > San Francisco County > San Francisco (0.14)
- Europe > Denmark > Capital Region > Copenhagen (0.04)
- Europe > Denmark > Southern Denmark (0.04)
- (2 more...)
FAIR Universe HiggsML Uncertainty Challenge Competition
Bhimji, Wahid, Calafiura, Paolo, Chakkappai, Ragansu, Chang, Po-Wen, Chou, Yuan-Tang, Diefenbacher, Sascha, Dudley, Jordan, Farrell, Steven, Ghosh, Aishik, Guyon, Isabelle, Harris, Chris, Hsu, Shih-Chieh, Khoda, Elham E, Lyscar, Rémy, Michon, Alexandre, Nachman, Benjamin, Nugent, Peter, Reymond, Mathis, Rousseau, David, Sluijter, Benjamin, Thorne, Benjamin, Ullah, Ihsan, Zhang, Yulei
The FAIR Universe -- HiggsML Uncertainty Challenge focuses on measuring the physics properties of elementary particles with imperfect simulators due to differences in modelling systematic errors. Additionally, the challenge is leveraging a large-compute-scale AI platform for sharing datasets, training models, and hosting machine learning competitions. Our challenge brings together the physics and machine learning communities to advance our understanding and methodologies in handling systematic (epistemic) uncertainties within AI techniques.
- North America > United States > Washington > King County > Seattle (0.04)
- North America > United States > California > San Diego County > San Diego (0.04)
- North America > United States > California > Orange County > Irvine (0.04)
- (4 more...)
\nu-Flows: Conditional Neutrino Regression
Leigh, Matthew, Raine, John Andrew, Zoch, Knut, Golling, Tobias
We present $\nu$-Flows, a novel method for restricting the likelihood space of neutrino kinematics in high energy collider experiments using conditional normalizing flows and deep invertible neural networks. This method allows the recovery of the full neutrino momentum which is usually left as a free parameter and permits one to sample neutrino values under a learned conditional likelihood given event observations. We demonstrate the success of $\nu$-Flows in a case study by applying it to simulated semileptonic $t\bar{t}$ events and show that it can lead to more accurate momentum reconstruction, particularly of the longitudinal coordinate. We also show that this has direct benefits in a downstream task of jet association, leading to an improvement of up to a factor of 1.41 compared to conventional methods.
- North America > Canada (0.28)
- Europe > Switzerland (0.28)
- North America > United States (0.14)
- (2 more...)
$m^\ast$ of two-dimensional electron gas: a neural canonical transformation study
Xie, Hao, Zhang, Linfeng, Wang, Lei
The quasiparticle effective mass $m^\ast$ of interacting electrons is a fundamental quantity in the Fermi liquid theory. However, the precise value of the effective mass of uniform electron gas is still elusive after decades of research. The newly developed neural canonical transformation approach [Xie et al., J. Mach. Learn. 1, (2022)] offers a principled way to extract the effective mass of electron gas by directly calculating the thermal entropy at low temperature. The approach models a variational many-electron density matrix using two generative neural networks: an autoregressive model for momentum occupation and a normalizing flow for electron coordinates. Our calculation reveals a suppression of effective mass in the two-dimensional spin-polarized electron gas, which is more pronounced than previous reports in the low-density strong-coupling region. This prediction calls for verification in two-dimensional electron gas experiments.
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Asia > China > Beijing > Beijing (0.04)
- Asia > Japan (0.04)
- Africa > Comoros > Grande Comore > Moroni (0.04)
Unravelling physics beyond the standard model with classical and quantum anomaly detection
Schuhmacher, Julian, Boggia, Laura, Belis, Vasilis, Puljak, Ema, Grossi, Michele, Pierini, Maurizio, Vallecorsa, Sofia, Tacchino, Francesco, Barkoutsos, Panagiotis, Tavernelli, Ivano
Much hope for finding new physics phenomena at microscopic scale relies on the observations obtained from High Energy Physics experiments, like the ones performed at the Large Hadron Collider (LHC). However, current experiments do not indicate clear signs of new physics that could guide the development of additional Beyond Standard Model (BSM) theories. Identifying signatures of new physics out of the enormous amount of data produced at the LHC falls into the class of anomaly detection and constitutes one of the greatest computational challenges. In this article, we propose a novel strategy to perform anomaly detection in a supervised learning setting, based on the artificial creation of anomalies through a random process. For the resulting supervised learning problem, we successfully apply classical and quantum Support Vector Classifiers (CSVC and QSVC respectively) to identify the artificial anomalies among the SM events. Even more promising, we find that employing an SVC trained to identify the artificial anomalies, it is possible to identify realistic BSM events with high accuracy. In parallel, we also explore the potential of quantum algorithms for improving the classification accuracy and provide plausible conditions for the best exploitation of this novel computational paradigm.
- Europe > Switzerland > Zürich > Zürich (0.14)
- Africa > Middle East > Egypt > Cairo Governorate > Cairo (0.05)
- Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
- (3 more...)
- Research Report (1.00)
- Workflow (0.94)
SAIC Mobility Robotaxi valued at $1B after $148M Series B – TechCrunch
SAIC Mobility Robotaxi, an arm of state-owned Chinese automaker SAIC that aims to launch a commercial robotaxi service, raised $148 million (RMB 1 billion). The funds will be used to scale its robotaxi service in China, which it will operate in partnership with autonomous vehicle company Momenta. SAIC Group led the Series B round that also saw participation from Momenta, Gaoheng Management Consulting and other institutions. The funding brought SAIC Mobility's total valuation to more than $1 billion, according to the company. The company's robotaxis are powered using Momenta's "Flywheel L4" technology, which is designed to use deep learning rather than a rules-based, machine learning approach.
- Transportation > Ground > Road (1.00)
- Information Technology > Robotics & Automation (1.00)
- Automobiles & Trucks (1.00)