colmena
Employing Artificial Intelligence to Steer Exascale Workflows with Colmena
Ward, Logan, Pauloski, J. Gregory, Hayot-Sasson, Valerie, Babuji, Yadu, Brace, Alexander, Chard, Ryan, Chard, Kyle, Thakur, Rajeev, Foster, Ian
Computational workflows are a common class of application on supercomputers, yet the loosely coupled and heterogeneous nature of workflows often fails to take full advantage of their capabilities. We created Colmena to leverage the massive parallelism of a supercomputer by using Artificial Intelligence (AI) to learn from and adapt a workflow as it executes. Colmena allows scientists to define how their application should respond to events (e.g., task completion) as a series of cooperative agents. In this paper, we describe the design of Colmena, the challenges we overcame while deploying applications on exascale systems, and the science workflows we have enhanced through interweaving AI. The scaling challenges we discuss include developing steering strategies that maximize node utilization, introducing data fabrics that reduce communication overhead of data-intensive tasks, and implementing workflow tasks that cache costly operations between invocations. These innovations coupled with a variety of application patterns accessible through our agent-based steering model have enabled science advances in chemistry, biophysics, and materials science using different types of AI. Our vision is that Colmena will spur creative solutions that harness AI across many domains of scientific computing.
- North America > United States > Illinois > Cook County > Chicago (0.04)
- North America > United States > Illinois > Cook County > Lemont (0.04)
- Asia > Middle East > Jordan (0.04)
- Workflow (1.00)
- Research Report > Promising Solution (0.34)
- Health & Medicine > Therapeutic Area (0.68)
- Government > Regional Government (0.46)
- Energy > Energy Storage (0.46)
Mexico eagerly prepares for historic first Latin American lunar mission: 'Elevates the name of our country'
The United States and China explore the lunar presence of critical minerals. Mexico will launch its first lunar mission next month, a historic step for the country and Latin America as a whole, according to officials. "This project will make history and is the first of its kind in Latin America, which elevates the name of our country, confirming once again that Mexican engineering is at the level of the best in the world," Salvador Landeros, director of the Mexican Space Agency (AEM), said in a press release. A team of scientists and nearly 250 university students developed five microrobots that the AEM will launch from Cape Canaveral, Florida, between Jan. 8 and Jan. 11 as part of project Colmena. Each robot weighs 60 grams -- a little over one-tenth of a pound -- and measures just under 5 inches in diameter.
- North America > Central America (0.48)
- North America > United States > Florida > Brevard County > Cape Canaveral (0.30)
- Asia > China (0.26)
- (3 more...)
Colmena: Scalable Machine-Learning-Based Steering of Ensemble Simulations for High Performance Computing
Ward, Logan, Sivaraman, Ganesh, Pauloski, J. Gregory, Babuji, Yadu, Chard, Ryan, Dandu, Naveen, Redfern, Paul C., Assary, Rajeev S., Chard, Kyle, Curtiss, Larry A., Thakur, Rajeev, Foster, Ian
Scientific applications that involve simulation ensembles can be accelerated greatly by using experiment design methods to select the best simulations to perform. Methods that use machine learning (ML) to create proxy models of simulations show particular promise for guiding ensembles but are challenging to deploy because of the need to coordinate dynamic mixes of simulation and learning tasks. We present Colmena, an open-source Python framework that allows users to steer campaigns by providing just the implementations of individual tasks plus the logic used to choose which tasks to execute when. Colmena handles task dispatch, results collation, ML model invocation, and ML model (re)training, using Parsl to execute tasks on HPC systems. We describe the design of Colmena and illustrate its capabilities by applying it to electrolyte design, where it both scales to 65536 CPUs and accelerates the discovery rate for high-performance molecules by a factor of 100 over unguided searches.
- North America > United States > Illinois > Cook County > Chicago (0.04)
- Asia > Middle East > Jordan (0.04)
- North America > United States > Illinois > Cook County > Lemont (0.04)
- North America > United States > California > San Diego County > Carlsbad (0.04)
- Workflow (0.95)
- Research Report > New Finding (0.67)