Energy
UniFault: A Fault Diagnosis Foundation Model from Bearing Data
Eldele, Emadeldeen, Ragab, Mohamed, Qing, Xu, Edward, null, Chen, Zhenghua, Wu, Min, Li, Xiaoli, Lee, Jay
Machine fault diagnosis (FD) is a critical task for predictive maintenance, enabling early fault detection and preventing unexpected failures. Despite its importance, existing FD models are operation-specific with limited generalization across diverse datasets. Foundation models (FM) have demonstrated remarkable potential in both visual and language domains, achieving impressive generalization capabilities even with minimal data through few-shot or zero-shot learning. However, translating these advances to FD presents unique hurdles. Unlike the large-scale, cohesive datasets available for images and text, FD datasets are typically smaller and more heterogeneous, with significant variations in sampling frequencies and the number of channels across different systems and applications. This heterogeneity complicates the design of a universal architecture capable of effectively processing such diverse data while maintaining robust feature extraction and learning capabilities. In this paper, we introduce UniFault, a foundation model for fault diagnosis that systematically addresses these issues. Specifically, the model incorporates a comprehensive data harmonization pipeline featuring two key innovations. First, a unification scheme transforms multivariate inputs into standardized univariate sequences. Second, a novel cross-domain temporal fusion strategy mitigates distribution shifts and enriches sample diversity and count, improving the model generalization across varying conditions. UniFault is pretrained on over 6.9 million samples spanning diverse FD datasets, enabling superior few-shot performance. Extensive experiments on real-world FD datasets demonstrate that UniFault achieves state-of-the-art performance, setting a new benchmark for fault diagnosis models and paving the way for more scalable and robust predictive maintenance solutions.
Dissecting FLOPs along input dimensions for GreenAI cost estimations
Asperti, Andrea, Evangelista, Davide, Marzolla, Moreno
The term GreenAI refers to a novel approach to Deep Learning, that is more aware of the ecological impact and the computational efficiency of its methods. The promoters of GreenAI suggested the use of Floating Point Operations (FLOPs) as a measure of the computational cost of Neural Networks; however, that measure does not correlate well with the energy consumption of hardware equipped with massively parallel processing units like GPUs or TPUs. In this article, we propose a simple refinement of the formula used to compute floating point operations for convolutional layers, called α -FLOPs, explaining and correcting the traditional discrepancy with respect to different layers, and closer to reality. The notion of α -FLOPs relies on the crucial insight that, in case of inputs with multiple dimensions, there is no reason to believe that the speedup offered by parallelism will be uniform along all different axes.
Scaling Multi-Agent Environment Co-Design with Diffusion Models
Li, Hao Xiang, Amir, Michael, Prorok, Amanda
The agent-environment co-design paradigm jointly optimises agent policies and environment configurations in search of improved system performance. With application domains ranging from warehouse logistics to windfarm management, co-design promises to fundamentally change how we deploy multi-agent systems. However, current co-design methods struggle to scale. They collapse under high-dimensional environment design spaces and suffer from sample inefficiency when addressing moving targets inherent to joint optimisation. We address these challenges by developing Diffusion Co-Design (DiCoDe), a scalable and sample-efficient co-design framework pushing co-design towards practically relevant settings. DiCoDe incorporates two core innovations. First, we introduce Projected Universal Guidance (PUG), a sampling technique that enables DiCoDe to explore a distribution of reward-maximising environments while satisfying hard constraints such as spatial separation between obstacles. Second, we devise a critic distillation mechanism to share knowledge from the reinforcement learning critic, ensuring that the guided diffusion model adapts to evolving agent policies using a dense and up-to-date learning signal. Together, these improvements lead to superior environment-policy pairs when validated on challenging multi-agent environment co-design benchmarks including warehouse automation, multi-agent pathfinding and wind farm optimisation. Our method consistently exceeds the state-of-the-art, achieving, for example, 39% higher rewards in the warehouse setting with 66% fewer simulation samples. This sets a new standard in agent-environment co-design, and is a stepping stone towards reaping the rewards of co-design in real world domains. The performance of agents is fundamentally tied to the environments they inhabit.
Zero-shot data citation function classification using transformer-based large language models (LLMs)
Byers, Neil, Zaidi, Ali, Skye, Valerie, Beecroft, Chris, Fagnan, Kjiersten
Efforts have increased in recent years to identify associations between specific datasets and the scientific literature that incorporates them. Knowing that a given publication cites a given dataset, the next logical step is to explore how or why that data was used. Advances in recent years with pretrained, transformer-based large language models (LLMs) offer potential means for scaling the description of data use cases in the published literature. This avoids expensive manual labeling and the development of training datasets for classical machine-learning (ML) systems. In this work we apply an open-source LLM, Llama 3.1-405B, to generate structured data use case labels for publications known to incorporate specific genomic datasets. We also introduce a novel evaluation framework for determining the efficacy of our methods. Our results demonstrate that the stock model can achieve an F1 score of .674 on a zero-shot data citation classification task with no previously defined categories. While promising, our results are qualified by barriers related to data availability, prompt overfitting, computational infrastructure, and the expense required to conduct responsible performance evaluation.
Predicting Weekly Fishing Concentration Zones through Deep Learning Integration of Heterogeneous Environmental Spatial Datasets
Rele, Chaitanya, Rathod, Aditya, Natu, Kaustubh, Kulkarni, Saurabh, Koli, Ajay, Makdey, Swapnali
The North Indian Ocean, including the Arabian Sea and the Bay of Bengal, represents a vital source of livelihood for coastal communities, yet fishermen often face uncertainty in locating productive fishing grounds. To address this challenge, we present an AI-assisted framework for predicting Potential Fishing Zones (PFZs) using oceanographic parameters such as sea surface temperature and chlorophyll concentration. The approach is designed to enhance the accuracy of PFZ identification and provide region-specific insights for sustainable fishing practices. Preliminary results indicate that the framework can support fishermen by reducing search time, lowering fuel consumption, and promoting efficient resource utilization.
In Situ Training of Implicit Neural Compressors for Scientific Simulations via Sketch-Based Regularization
Simpson, Cooper, Becker, Stephen, Doostan, Alireza
Focusing on implicit neural representations, we present a novel in situ training protocol that employs limited memory buffers of full and sketched data samples, where the sketched data are leveraged to prevent catastrophic forgetting. The theoretical motivation for our use of sketching as a regularizer is presented via a simple Johnson-Lindenstrauss-informed result. While our methods may be of wider interest in the field of continual learning, we specifically target in situ neural compression using implicit neural representation-based hypernetworks. We evaluate our method on a variety of complex simulation data in two and three dimensions, over long time horizons, and across unstructured grids and non-Cartesian geometries. On these tasks, we show strong reconstruction performance at high compression rates. Most importantly, we demonstrate that sketching enables the presented in situ scheme to approximately match the performance of the equivalent offline method.
Flow matching for reaction pathway generation
Tuo, Ping, Chen, Jiale, Li, Ju
Elucidating reaction mechanisms hinges on efficiently generating transition states (TSs), products, and complete reaction networks. Recent generative models, such as diffusion models for TS sampling and sequence-based architectures for product generation, offer faster alternatives to quantum-chemistry searches. But diffusion models remain constrained by their stochastic differential equation (SDE) dynamics, which suffer from inefficiency and limited controllability. We show that flow matching, a deterministic ordinary differential (ODE) formulation, can replace SDE-based diffusion for molecular and reaction generation. We introduce MolGEN, a conditional flow-matching framework that learns an optimal transport path to transport Gaussian priors to target chemical distributions. On benchmarks used by TSDiff and OA-ReactDiff, MolGEN surpasses TS geometry accuracy and barrier-height prediction while reducing sampling to sub-second inference. MolGEN also supports open-ended product generation with competitive top-k accuracy and avoids mass/electron-balance violations common to sequence models. In a realistic test on the $γ$-ketohydroperoxide decomposition network, MolGEN yields higher fractions of valid and intended TSs with markedly fewer quantum-chemistry evaluations than string-based baselines. These results demonstrate that deterministic flow matching provides a unified, accurate, and computationally efficient foundation for molecular generative modeling, signaling that flow matching is the future for molecular generation across chemistry.
Leveraging LLMs to Automate Energy-Aware Refactoring of Parallel Scientific Codes
Dearing, Matthew T., Tao, Yiheng, Wu, Xingfu, Lan, Zhiling, Taylor, Valerie
While large language models (LLMs) are increasingly used for generating parallel scientific codes, most efforts emphasize functional correctness, often overlooking performance, especially energy efficiency. We propose LASSI-EE, an automated LLM-based refactoring framework that generates energy-efficient parallel codes through a multi-stage, iterative approach integrating runtime power profiling, energy-aware prompting, self-correcting feedback loops, and an LLM-as-a-Judge agent for automated screening of code solutions. We introduce energy-reduction@k, a novel metric that quantifies expected energy reduction when generating k code candidates and selecting the most energy-efficient, enabling systematic evaluation of multi-attempt generation strategies. Evaluating 20 HeCBench applications and two miniApps on NVIDIA A100 and AMD MI100 GPUs, a single run (k=1) with LASSI-EE delivers refactored parallel codes with an average 29% expected energy reduction at an 81% pass rate, representing a 2.8x improvement over vanilla LLM prompting. Multiple runs (k=3) achieve an average 48% expected energy reduction at a 97% pass rate. These results are consistent across devices, demonstrating LASSI-EE's effectiveness across diverse hardware architectures.
The Download: the solar geoengineering race, and future gazing with the The Simpsons
Last week, an American-Israeli company that claims it's developed proprietary technology to cool the planet announced it had raised $60 million, by far the largest known venture capital round to date for a solar geoengineering startup. The company, Stardust, says the funding will enable it to develop a system that could be deployed by the start of the next decade, according to Heatmap, which broke the story. As scientists who have worked on the science of solar geoengineering for decades, we have grown increasingly concerned about emerging efforts to start and fund private companies to deploy technologies that could alter the climate of the planet. We also strongly dispute some of the technical claims that certain companies have made about their offerings. This story is part of Heat Exchange, MIT Technology Review's guest opinion series offering expert commentary on legal, political and regulatory issues related to climate change and clean energy. Can "The Simpsons" really predict the future?
The great climate paradox: Drop in air pollution has INCREASED global warming by making clouds less reflective, scientists warn
New York's new mayor Zohran Mamdani tells Trump'I have four words for you' in blistering victory speech quoting his socialist hero, bragging about'toppling a dynasty' and promising a'new dawn' Driver screaming'Allahu Akbar' ploughs in to pedestrians'trying to hit everyone he encountered' on French holiday island leaving ten injured This Leftist election landslide was caused by the same vile disease that's triggered a GOP civil war. Amazon signals it's finally fed up with Whole Foods' sluggish sales - and is making sweeping, controversial changes Why Mamdani's socialist revolution in New York has sparked a civil war for Democrats... and Trump is secretly loving it Simone Biles details all the plastic surgery she's had after her boob job this summer Inside Kate and William's forever home: Princess is kitting out Forest Lodge in her preferred'classic contemporary style' to create a'lovely but absolutely inoffensive' look REVEALED: Fattest states in America ranked... including region where three-quarters of residents are obese Now he's dead, here's the full story of what happened that day... and the ghastly aftermath no one knows about Shocking moment Mexico's president is groped by man who grabs her breasts and tries to kiss her Miss Universe contestant called'dumb' in humiliating dressing-down by official hits back with powerful speech as furious organisers condemn her treatment and he issues grovelling apology Hollywood A-listers may be blacklisted for'antisemitism' under Paramount's new anti-woke leadership Nepo baby turns heads at Glamour Women Of The Year Awards in a glitzy gold sequin feathered gown - but can YOU guess who her A-list mother is? New footage reveals the moments before football manager collapsed and died mid-match, leaving his players in disbelief, as it emerges he'complained about fish he had eaten' hours before Texas teen'tears masterpiece from wall at the Met in unhinged meltdown' before being handed in by his MOTHER Scientists have been faced with a huge dilemma, as research reveals that reducing air pollution has increased global warming . While smog kills millions of people every year, it also whitens clouds - making them more reflective. So by slashing air pollution, we're inadvertently diminishing the brightness of clouds, which are key regulators of global temperature.