AITopics | momenta

Collaborating Authors

momenta

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Machine-Learning Accelerated Calculations of Reduced Density Matrices

Azam, Awwab A., Zhao, Lexu, Yu, Jiabin

arXiv.org Artificial IntelligenceNov-11-2025

$n$-particle reduced density matrices ($n$-RDMs) play a central role in understanding correlated phases of matter. Yet the calculation of $n$-RDMs is often computationally inefficient for strongly-correlated states, particularly when the system sizes are large. In this work, we propose to use neural network (NN) architectures to accelerate the calculation of, and even predict, the $n$-RDMs for large-size systems. The underlying intuition is that $n$-RDMs are often smooth functions over the Brillouin zone (BZ) (certainly true for gapped states) and are thus interpolable, allowing NNs trained on small-size $n$-RDMs to predict large-size ones. Building on this intuition, we devise two NNs: (i) a self-attention NN that maps random RDMs to physical ones, and (ii) a Sinusoidal Representation Network (SIREN) that directly maps momentum-space coordinates to RDM values. We test the NNs in three 2D models: the pair-pair correlation functions of the Richardson model of superconductivity, the translationally-invariant 1-RDM in a four-band model with short-range repulsion, and the translation-breaking 1-RDM in the half-filled Hubbard model. We find that a SIREN trained on a $6\times 6$ momentum mesh can predict the $18\times 18$ pair-pair correlation function with a relative accuracy of $0.839$. The NNs trained on $6\times 6 \sim 8\times 8$ meshes can provide high-quality initial guesses for $50\times 50$ translation-invariant Hartree-Fock (HF) and $30\times 30$ fully translation-breaking-allowed HF, reducing the number of iterations required for convergence by up to $91.63\%$ and $92.78\%$, respectively, compared to random initializations. Our results illustrate the potential of using NN-based methods for interpolable $n$-RDMs, which might open a new avenue for future research on strongly correlated phases.

artificial intelligence, machine learning, siren, (17 more...)

arXiv.org Artificial Intelligence

2511.07367

Country: North America > United States > Florida > Alachua County > Gainesville (0.28)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

MT-DAO: Multi-Timescale Distributed Adaptive Optimizers with Local Updates

Iacob, Alex, Jovanovic, Andrej, Safaryan, Mher, Kurmanji, Meghdad, Sani, Lorenzo, Horváth, Samuel, Shen, William F., Qiu, Xinchi, Lane, Nicholas D.

arXiv.org Artificial IntelligenceOct-8-2025

Training large models with distributed data parallelism (DDP) requires frequent communication of gradients across workers, which can saturate bandwidth. Infrequent communication strategies (e.g., Local SGD) reduce this overhead but, when applied to adaptive optimizers, often suffer a performance gap relative to fully synchronous DDP. We trace this gap to a time-scale mismatch: the optimizer's fast-moving momentum, tuned for frequent updates, decays too quickly to smooth gradients over long intervals, leading to noise-dominated optimization. To address this, we propose MT-DAO, a family of optimizers that employs multiple slow- and fast-moving first momenta or the gradient to track update dynamics across different time scales, for which we provide the first convergence guarantees. Empirically, for language-model pre-training, this eliminates the performance gap with DDP, outperforming infrequent-communication baselines in perplexity and reducing iso-token wall-clock time by 6-27% on Ethernet interconnects. At the 720M scale, MT-DAO reaches a target perplexity in 24% fewer steps and 35% less time than the single-momentum DDP baseline. MT-DAO enables effective cross-datacenter training and training over wide geographic areas.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2510.05361

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

291597a100aadd814d197af4f4bab3a7-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 10:19:05 GMT

artificial intelligence, machine learning, registration, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.50)

Add feedback

Refining Integration-by-Parts Reduction of Feynman Integrals with Machine Learning

von Hippel, Matt, Wilhelm, Matthias

arXiv.org Artificial IntelligenceFeb-7-2025

Perturbative Quantum Field Theory has proven to be a vastly successful theoretical framework for calculating precision predictions, with applications ranging from collider physics to gravitational-wave physics. A crucial step in the calculation of precision predictions is the reduction of the occurring Feynman integrals to a much smaller set of so-called master integrals, using integration-by-parts (IBP) identities [1-3]. This IBP reduction is a major bottleneck in precision calculations, requiring hundred thousands of CPU hours in current applications [4] and obstructing other applications altogether. IBP identities relate Feynman integrals with different integer exponents of the propagators as well as irreducible scalar products (ISP) in the numerator. They can easily be derived for general values of the exponents, see e.g.

artificial intelligence, evolutionary algorithm, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2502.05121

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Denmark > Capital Region > Copenhagen (0.04)
Europe > Denmark > Southern Denmark (0.04)
(2 more...)

Genre: Research Report (0.81)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

Add feedback

FAIR Universe HiggsML Uncertainty Challenge Competition

Bhimji, Wahid, Calafiura, Paolo, Chakkappai, Ragansu, Chang, Po-Wen, Chou, Yuan-Tang, Diefenbacher, Sascha, Dudley, Jordan, Farrell, Steven, Ghosh, Aishik, Guyon, Isabelle, Harris, Chris, Hsu, Shih-Chieh, Khoda, Elham E, Lyscar, Rémy, Michon, Alexandre, Nachman, Benjamin, Nugent, Peter, Reymond, Mathis, Rousseau, David, Sluijter, Benjamin, Thorne, Benjamin, Ullah, Ihsan, Zhang, Yulei

arXiv.org Artificial IntelligenceDec-18-2024

The FAIR Universe -- HiggsML Uncertainty Challenge focuses on measuring the physics properties of elementary particles with imperfect simulators due to differences in modelling systematic errors. Additionally, the challenge is leveraging a large-compute-scale AI platform for sharing datasets, training models, and hosting machine learning competitions. Our challenge brings together the physics and machine learning communities to advance our understanding and methodologies in handling systematic (epistemic) uncertainties within AI techniques.

artificial intelligence, machine learning, particle, (17 more...)

arXiv.org Artificial Intelligence

2410.02867

Country:

North America > United States > Washington > King County > Seattle (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
North America > United States > California > Orange County > Irvine (0.04)
(4 more...)

Genre: Research Report (0.82)

Industry: Energy (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

\nu-Flows: Conditional Neutrino Regression

Leigh, Matthew, Raine, John Andrew, Zoch, Knut, Golling, Tobias

arXiv.org Artificial IntelligenceJun-22-2023

We present $\nu$-Flows, a novel method for restricting the likelihood space of neutrino kinematics in high energy collider experiments using conditional normalizing flows and deep invertible neural networks. This method allows the recovery of the full neutrino momentum which is usually left as a free parameter and permits one to sample neutrino values under a learned conditional likelihood given event observations. We demonstrate the success of $\nu$-Flows in a case study by applying it to simulated semileptonic $t\bar{t}$ events and show that it can lead to more accurate momentum reconstruction, particularly of the longitudinal coordinate. We also show that this has direct benefits in a downstream task of jet association, leading to an improvement of up to a factor of 1.41 compared to conventional methods.

artificial intelligence, collaboration, machine learning, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.21468/SciPostPhys.14.6.159

2207.00664

Country:

North America > Canada (0.28)
Europe > Switzerland (0.28)
North America > United States (0.14)
(2 more...)

Genre: Research Report > Promising Solution (0.48)

Industry: Energy > Oil & Gas (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

$m^\ast$ of two-dimensional electron gas: a neural canonical transformation study

Xie, Hao, Zhang, Linfeng, Wang, Lei

arXiv.org Artificial IntelligenceJun-15-2023

The quasiparticle effective mass $m^\ast$ of interacting electrons is a fundamental quantity in the Fermi liquid theory. However, the precise value of the effective mass of uniform electron gas is still elusive after decades of research. The newly developed neural canonical transformation approach [Xie et al., J. Mach. Learn. 1, (2022)] offers a principled way to extract the effective mass of electron gas by directly calculating the thermal entropy at low temperature. The approach models a variational many-electron density matrix using two generative neural networks: an autoregressive model for momentum occupation and a normalizing flow for electron coordinates. Our calculation reveals a suppression of effective mass in the two-dimensional spin-polarized electron gas, which is more pronounced than previous reports in the low-density strong-coupling region. This prediction calls for verification in two-dimensional electron gas experiments.

artificial intelligence, effective mass, machine learning, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.21468/SciPostPhys.14.6.154

2201.03156

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > China > Beijing > Beijing (0.04)
Asia > Japan (0.04)
Africa > Comoros > Grande Comore > Moroni (0.04)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Unravelling physics beyond the standard model with classical and quantum anomaly detection

Schuhmacher, Julian, Boggia, Laura, Belis, Vasilis, Puljak, Ema, Grossi, Michele, Pierini, Maurizio, Vallecorsa, Sofia, Tacchino, Francesco, Barkoutsos, Panagiotis, Tavernelli, Ivano

arXiv.org Artificial IntelligenceJan-27-2023

Much hope for finding new physics phenomena at microscopic scale relies on the observations obtained from High Energy Physics experiments, like the ones performed at the Large Hadron Collider (LHC). However, current experiments do not indicate clear signs of new physics that could guide the development of additional Beyond Standard Model (BSM) theories. Identifying signatures of new physics out of the enormous amount of data produced at the LHC falls into the class of anomaly detection and constitutes one of the greatest computational challenges. In this article, we propose a novel strategy to perform anomaly detection in a supervised learning setting, based on the artificial creation of anomalies through a random process. For the resulting supervised learning problem, we successfully apply classical and quantum Support Vector Classifiers (CSVC and QSVC respectively) to identify the artificial anomalies among the SM events. Even more promising, we find that employing an SVC trained to identify the artificial anomalies, it is possible to identify realistic BSM events with high accuracy. In parallel, we also explore the potential of quantum algorithms for improving the classification accuracy and provide plausible conditions for the best exploitation of this novel computational paradigm.

artificial intelligence, data mining, machine learning, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1088/2632-2153/ad07f7

2301.10787

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Africa > Middle East > Egypt > Cairo Governorate > Cairo (0.05)
Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
(3 more...)

Genre:

Research Report (1.00)
Workflow (0.94)

Industry: Education (0.34)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

SAIC Mobility Robotaxi valued at $1B after $148M Series B – TechCrunch

#artificialintelligenceAug-16-2022, 21:50:45 GMT

SAIC Mobility Robotaxi, an arm of state-owned Chinese automaker SAIC that aims to launch a commercial robotaxi service, raised $148 million (RMB 1 billion). The funds will be used to scale its robotaxi service in China, which it will operate in partnership with autonomous vehicle company Momenta. SAIC Group led the Series B round that also saw participation from Momenta, Gaoheng Management Consulting and other institutions. The funding brought SAIC Mobility's total valuation to more than $1 billion, according to the company. The company's robotaxis are powered using Momenta's "Flywheel L4" technology, which is designed to use deep learning rather than a rules-based, machine learning approach.

momenta, robotaxi service, saic mobility robotaxi, (9 more...)

#artificialintelligence

Country:

Asia > China > Shanghai > Shanghai (0.09)
Asia > China > Hubei Province > Wuhan (0.06)
Asia > China > Chongqing Province > Chongqing (0.06)
Asia > China > Beijing > Beijing (0.06)

Industry:

Transportation > Ground > Road (1.00)
Information Technology > Robotics & Automation (1.00)
Automobiles & Trucks (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Particle-based Fast Jet Simulation at the LHC with Variational Autoencoders

Touranakou, Mary, Chernyavskaya, Nadezda, Duarte, Javier, Gunopulos, Dimitrios, Kansal, Raghav, Orzari, Breno, Pierini, Maurizio, Tomei, Thiago, Vlimant, Jean-Roch

arXiv.org Artificial IntelligenceMar-1-2022

We study how to use Deep Variational Autoencoders for a fast simulation of jets of particles at the LHC. We represent jets as a list of constituents, characterized by their momenta. Starting from a simulation of the jet before detector effects, we train a Deep Variational Autoencoder to return the corresponding list of constituents after detection. Doing so, we bypass both the time-consuming detector simulation and the collision reconstruction steps of a traditional processing chain, speeding up significantly the events generation workflow. Through model optimization and hyperparameter tuning, we achieve state-of-the-art precision on the jet four-momentum, while providing an accurate description of the constituents momenta, and an inference time comparable to that of a rule-based fast simulation.

artificial intelligence, machine learning, particle, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1088/2632-2153/ac7c56

2203.0052

Country:

South America > Brazil > São Paulo (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
Europe > Switzerland (0.04)
(3 more...)

Genre: Research Report (0.50)

Industry: Government > Regional Government (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback