Energy
CATransformers: Carbon Aware Transformers Through Joint Model-Hardware Optimization
Machine learning solutions are rapidly adopted to enable a variety of key use cases, from conversational AI assistants to scientific discovery. As the adoption of machine learning models becomes increasingly prevalent, the associated lifecycle carbon footprint is expected to increase, including both from training and inference and from AI hardware manufacturing. We introduce CATransformers, the first carbon-aware co-optimization framework for Transformer-based models and hardware accelerators. By integrating both operational and embodied carbon into early-stage design space exploration, CATransformers enables sustainability-driven model architecture and hardware accelerator co-design that reveals fundamentally different trade-offs than latency-or energy-centric approaches. Evaluated across a range of Transformer models, CATransformers consistently demonstrates the potential to reduce total carbon emissions --by up to 30\% -- while maintaining accuracy and latency. We further highlight its extensibility through a focused case study on multi-modal models. Our results emphasize the need for holistic optimization methods that prioritize carbon efficiency without compromising model capability and execution time performance.
Breaking the Performance Ceiling in Reinforcement Learning requires Inference Strategies
Reinforcement learning (RL) systems have countless applications, from energy-grid management to protein design. However, such real-world scenarios are often extremely difficult, combinatorial in nature, and require complex coordination between multiple agents. This level of complexity can cause even state-of-the-art RL systems, trained until convergence, to hit a performance ceiling which they are unable to break out of with zero-shot inference. Meanwhile, many digital or simulation-based applications allow for an inference phase that utilises a specific time and compute budget to explore multiple attempts before outputting a final solution. In this work, we show that such an inference phase employed at execution time, and the choice of a corresponding inference strategy, are key to breaking the performance ceiling observed in complex multi-agent RL problems. Our main result is striking: we can obtain up to a 126% and, on average, a 45% improvement over the previous state-of-the-art across 17 tasks, using only a couple seconds of extra wall-clock time during execution. We also demonstrate promising compute scaling properties, supported by over 60k experiments, making it the largest study on inference strategies for complex RL to date. We make all of our experimental data and code available.
OpenAI says China-based actors stoking opposition to AI data centres
China-based actors are likely behind the use of ChatGPT for "covert influence operations" aimed at stoking opposition to data centres in the United States, OpenAI has said. In a research report released on Wednesday, the company behind the world's most popular AI chatbot said it had banned a cluster of accounts likely based in China for attempting to "manipulate a legitimate debate about American AI". Among other content, the accounts generated a comic strip showing a cigar-chomping businessman holding bags marked with dollar signs as a family reacted in shock to their electricity bill, according to the San Francisco-based company. OpenAI said a second cluster of accounts had generated content casting US tariffs as an effort to "dominate technological competition" with China, and specified that the material should not mention Chinese leader Xi Jinping. While the campaign sought to "exploit and amplify existing public concerns" about energy prices, OpenAI found no evidence that it had a "meaningful" influence, the company said.
Fireworks illuminate Barcelona's Sagrada Famรญlia during Pope visit
Pope Leo XIV has described Barcelona's Sagrada Famรญlia as a masterpiece of stones, colours and light as he inaugurated its newest - and tallest - tower. The giant Tower of Jesus Christ, completed in February, has brought the church to a soaring height of 172.5m (566ft) - cementing it as the tallest church in the world. His visit to the iconic basilica also marks 100 years since the death of its architect, Antoni Gaudรญ. Among those attending the service were Spanish royals King Felipe VI and Queen Letizia, as well as Prime Minister Pedro Sรกnchez. The pope's week-long visit to Spain, which began on Saturday, is the first by a pope in some 15 years.
China Opens World's First Wind-Powered Underwater Data Center
With an initial capacity of 24 megawatts, the innovative data center uses seawater as a natural cooling system. China is submerging data centers into the ocean to keep them cool.Photograph: Shanghai Hailanyun Technology China has become the first country in the world to operate an underwater data center, or UDC, powered by wind. Located off the coast of Shanghai, the complex represents a significant advance in the country's strategy to secure energy supplies in the face of the accelerated growth of artificial intelligence, reduce dependence on fossil fuels, and reduce the environmental impact of its technology infrastructure. The initiative is the result of a collaboration between private company HiCloud Technology and state-owned China Communications Construction, which involved an investment of 1.6 billion yuan, equivalent to about $236 million. With an initial capacity of 24 megawatts, the facility is submerged at a depth of 10 meters in the Lin-gang Special Zone, within the China Pilot Free Trade Zone in Shanghai.
Most New US Data Centers Are Slated for Drought-Plagued Areas
To meet this moment, we need YOU. For five decades, has been exposing the corruption that the powerful would rather keep buried. That fight for the truth is at a pivotal point, and it takes readers like you to make it possible. To meet this moment, we need YOU. That fight for the truth is at a pivotal point, and it takes readers like you to make it possible. Amid public outcry over water-guzzling server farms, a Guardian analysis indicates trouble ahead.
CarbonGlobe: A Global-Scale, Multi-Decade Dataset and Benchmark for Carbon Forecasting in Forest Ecosystems
Forest ecosystems play a critical role in the Earth system as major carbon sinks that are essential for carbon neutralization and climate change mitigation. However, the Earth has undergone significant deforestation and forest degradation, and the remaining forested areas are also facing increasing pressures from socioeconomic factors and climate change, potentially pushing them towards tipping points.Responding to the grand challenge, a theory-based Ecosystem Demography (ED) model has been continuously developed over the past two decades and serves as a key component in major initiatives, including the Global Carbon Budget, NASA Carbon Monitoring System, and US Greenhouse Gas Center. Despite its growing importance in combating climate change and shaping carbon policies, ED's expensive computation significantly limits its ability to estimate carbon dynamics at the global scale with high spatial resolution.Recently, machine learning (ML) models have shown promising potential in approximating theory-based models with interesting success in various domains including weather forecasting, thanks to the open-source benchmark datasets made available.However, there are currently no publicly available ML-ready datasets for global carbon dynamics forecasting in forest ecosystems. The limited data availability hinders the development of corresponding ML emulators. Furthermore, the inputs needed for running ED are highly complex with over a hundred variables from various remote sensing products. To bridge the gap, we develop a new ML-ready benchmark dataset, \textit{CarbonGlobe}, for carbon dynamics forecasting, featuring that: (1) the data has a global-scale coverage at 0.5$^\circ$ resolution; (2) the temporal range spans 40 years; (3) the inputs integrate extensive multi-source data from different sensing products, with calibrated outputs from ED; (4) the data is formatted in ML-ready forms and split into different evaluation scenarios based on climate conditions, etc.; (5) a set of problem-driven metrics is designed to develop benchmarks using various ML models to best align with the needs of downstream applications.
DCcluster-Opt: Benchmarking Dynamic Multi-Objective Optimization for Geo-Distributed Data Center Workloads
The increasing energy demands and carbon footprint of large-scale AI require intelligent workload management in globally distributed data centers. Yet progress is limited by the absence of benchmarks that realistically capture the interplay of time-varying environmental factors (grid carbon intensity, electricity prices, weather), detailed data center physics (CPUs, GPUs, memory, HVAC energy), and geo-distributed network dynamics (latency and transmission costs). To bridge this gap, we present DCcluster-Opt: an open-source, high-fidelity simulation benchmark for sustainable, geo-temporal task scheduling. DCcluster-Opt combines curated real-world datasets, including AI workload traces, grid carbon intensity, electricity markets, weather across 20 global regions, cloud transmission costs, and empirical network delay parameters with physics-informed models of data center operations, enabling rigorous and reproducible research in sustainable computing. It presents a challenging scheduling problem where a top-level coordinating agent must dynamically reassign or defer tasks that arrive with resource and service-level agreement requirements across a configurable cluster of data centers to optimize multiple objectives. The environment also models advanced components such as heat recovery. A modular reward system enables an explicit study of trade-offs among carbon emissions, energy costs, service level agreements, and water use. It provides a Gymnasium API with baseline controllers, including reinforcement learning and rule-based strategies, to support reproducible ML research and a fair comparison of diverse algorithms. By offering a realistic, configurable, and accessible testbed, DCcluster-Opt accelerates the development and validation of next-generation sustainable computing solutions for geo-distributed data centers.
GM Wants Your Electric Car to Power Your House--and Your Neighborhood
The automaker today is turning on vehicle-to-grid charging for its GM Energy customers. Will people actually use it? Some 250,000 electric vehicles manufactured by General Motors are driving around the US today--right now!--with an oft-secret capability: Their big, powerful batteries can charge other things. Potentially appliances, homes, and now, thanks to a software update pushed by the automaker this week, an electrical grid . Twelve of GM's EVs have this "bidirectional charging" capability, way more than US competitors' battery-electrics.
Design tweaks promote responsible AI use for environmental protection, research shows
Artificial intelligence systems that ask users to pause to consider AI's energy consumption and environmental impacts are likely to reduce unnecessary AI use, new research by Oregon State University suggests. The findings, published in Science Communication, are important as AI is already using electricity on scales that can be meaningfully compared to households, factories and towns. For example, the electricity needed to train a large language model would power 120 homes for a year, the researchers note; one AI-generated image has roughly the same energy cost as charging a smartphone. With about 85% of the world's energy still coming from fossil fuels, every megawatt-hour that can be carved from AI's electricity profile is significant, says the study's leader, Cheng "Chris" Chen of the OSU College of Liberal Arts. "Despite AI's substantial environmental impacts, information about those impacts is rarely disclosed or effectively communicated to everyday users of AI systems," said Chen, assistant professor in the School of Communication.