Energy
Sequence-to-sequence models for workload interference
Prats, David Buchaca, Marcual, Joan, Berral, Josep Lluís, Carrera, David
Co-scheduling of jobs in data-centers is a challenging scenario, where jobs can compete for resources yielding to severe slowdowns or failed executions. Efficient job placement on environments where resources are shared requires awareness on how jobs interfere during execution, to go far beyond ineffective resource overbooking techniques. Current techniques, most of them already involving machine learning and job modeling, are based on workload behavior summarization across time, instead of focusing on effective job requirements at each instant of the execution. In this work we propose a methodology for modeling co-scheduling of jobs on data-centers, based on their behavior towards resources and execution time, using sequence-to-sequence models based on recurrent neural networks. The goal is to forecast co-executed jobs footprint on resources along their execution time, from the profile shown by the individual jobs, to enhance resource managers and schedulers placement decisions. The methods here presented are validated using High Performance Computing benchmarks based on different frameworks (like Hadoop and Spark) and applications (CPU bound, IO bound, machine learning, SQL queries...). Experiments show that the model can correctly identify the resource usage trends from previously seen and even unseen co-scheduled jobs.
Deep Learning and Knowledge-Based Methods for Computer Aided Molecular Design -- Toward a Unified Approach: State-of-the-Art and Future Directions
Alshehri, Abdulelah S., Gani, Rafiqul, You, Fengqi
The optimal design of compounds through manipulating properties at the molecular level is often the key to considerable scientific advances and improved process systems performance. This paper highlights key trends, challenges, and opportunities underpinning the Computer-Aided Molecular Design (CAMD) problems. A brief review of knowledge-driven property estimation methods and solution techniques, as well as corresponding CAMD tools and applications, are first presented. In view of the computational challenges plaguing knowledge-based methods and techniques, we survey the current state-of-the-art applications of deep learning to molecular design as a fertile approach towards overcoming computational limitations and navigating uncharted territories of the chemical space. The main focus of the survey is given to deep generative modeling of molecules under various deep learning architectures and different molecular representations. Further, the importance of benchmarking and empirical rigor in building deep learning models is spotlighted. The review article also presents a detailed discussion of the current perspectives and challenges of knowledge-based and data-driven CAMD and identifies key areas for future research directions. Special emphasis is on the fertile avenue of hybrid modeling paradigm, in which deep learning approaches are exploited while leveraging the accumulated wealth of knowledge-driven CAMD methods and tools.
Fictitious Play for Mean Field Games: Continuous Time Analysis and Applications
Perrin, Sarah, Perolat, Julien, Laurière, Mathieu, Geist, Matthieu, Elie, Romuald, Pietquin, Olivier
In this paper, we deepen the analysis of continuous time Fictitious Play learning algorithm to the consideration of various finite state Mean Field Game settings (finite horizon, $\gamma$-discounted), allowing in particular for the introduction of an additional common noise. We first present a theoretical convergence analysis of the continuous time Fictitious Play process and prove that the induced exploitability decreases at a rate $O(\frac{1}{t})$. Such analysis emphasizes the use of exploitability as a relevant metric for evaluating the convergence towards a Nash equilibrium in the context of Mean Field Games. These theoretical contributions are supported by numerical experiments provided in either model-based or model-free settings. We provide hereby for the first time converging learning dynamics for Mean Field Games in the presence of common noise.
Deep Learning's Climate Change Problem
The human brain is an incredibly efficient source of intelligence. Earlier this month, OpenAI announced it had built the biggest AI model in history. This astonishingly large model, known as GPT-3, is an impressive technical achievement. Yet it highlights a troubling and harmful trend in the field of artificial intelligence--one that has not gotten enough mainstream attention. Modern AI models consume a massive amount of energy, and these energy requirements are growing at a breathtaking rate.
Remote Sensing Scientist at Leidos in Arlington, VA
Want to be a part of an elite team where our innovative technical solutions are delivered to customers that advance the state of the art while addressing long-term problems of importance to national security? At our Leidos' Multi-Spectrum Warfare Research and Analytics Systems (MSWRAS) Division, an organization in the Leidos Innovation Center (LInC), we are looking for you, our next Scientist who specializes in remote sensing data analytics. Join our team of Ph.D. level peers in designing and developing advanced technology-based solutions for contract research and development projects working in our Arlington, VA office. Fun roles you will have in this job: Describe instances of successful, proven, and demonstrable experience contributing to the technical work as part of cross-discipline teams in the development and integration of software-based solutions for competitive, contract-based applied research programs Work with teams composed of members from industry, small businesses, and academic-based researchers and should have experience working on projects focused on multiple technical fields such as machine learning, artificial intelligence, engineering, and software development and integration Describe how the work products to which they contributed had solved customers' problems in such domains as energy, health, and national security or in the commercial sector Work within the MSWRAS Division and across the LInC, performing basic and applied contract research and development projects both leading and working under the guidance of senior scientists and engineers. Processing, interpreting and analyzing large volumes of data collected by remote sensing platforms but may also include other types of phenomenological data such as field measurements, or weather data Independently design and undertake new research as well as partner in a team environment across organizations Contribute to the development of creative and innovative R&D approaches to solving major remote sensing analytics challenges and work with potential sponsors (customers or internal champions) to secure funding for new research efforts based on those topics Contribute to the productivity of teams composed of fellow researchers, data scientists, data engineers, and software engineers to execute complex R&D programs Under the guidance of a senior scientist or engineer, design and develop or integrate secure and scalable applications that are part of broader solutions, that are applicable across multiple domains.
AI is reinventing the way we invent
Drug discovery is a hugely expensive and often frustrating process. Medicinal chemists must guess which compounds might make good medicines, using their knowledge of how a molecule's structure affects its properties. They synthesize and test countless variants, and most are failures. "Coming up with new molecules is still an art, because you have such a huge space of possibilities," says Barzilay. "It takes a long time to find good drug candidates." By speeding up this critical step, deep learning could offer far more opportunities for chemists to pursue, making drug discovery much quicker.
AI's carbon footprint problem
For all the advances enabled by artificial intelligence, from speech recognition to self-driving cars, AI systems consume a lot of power and can generate high volumes of climate-changing carbon emissions. A study last year found that training an off-the-shelf AI language-processing system produced 1,400 pounds of emissions--about the amount produced by flying one person roundtrip between New York and San Francisco. The full suite of experiments needed to build and train that AI language system from scratch can generate even more: up to 78,000 pounds, depending on the source of power. But there are ways to make machine learning cleaner and greener, a movement that has been called "Green AI." Some algorithms are less power-hungry than others, for example, and many training sessions can be moved to remote locations that get most of their power from renewable sources.
How Having Bigger AI Models Can Have A Detrimental Impact On Environment
The COVID crisis has skyrocketed the applications of artificial intelligence -- from tackling this global pandemic, to being a vital tool in managing various business processes. Despite its benefits, AI has always been scrutinised for its ethical concerns like existing biases and privacy issues. However, this technology also has some significant sustainability issues – it is known to consume a massive amount of energy, creating a negative impact on the environment. As AI technology is getting advanced in predicting weather, understanding human speech, enhancing banking payments, and revolutionising healthcare, the advanced models are not only required to be trained on large datasets, but also require massive computing power to improve its accuracy. Such heavy computing and processing consumes a tremendous amount of energy and emits carbon dioxide, which has become an environmental concern. According to a report, it has been estimated that the power required for training AI models emits approximately 626,000 pounds (284 tonnes) of carbon dioxide, which is comparatively five times the lifetime emissions of the average US car.
Iran nuclear site fire hit centrifuge facility, analysts say
Secretary of State Mike Pompeo seized on a U.N. report confirming Iranian weapons were used to attack Saudi Arabia in September and were part of an arms shipment seized months ago off Yemen's coast; State Department correspondent Rich Edson reports. A fire and an explosion struck a centrifuge production plant above Iran's underground Natanz nuclear enrichment facility early Thursday, analysts said, one of the most-tightly guarded sites in all of the Islamic Republic after earlier acts of sabotage there. The Atomic Energy Organization of Iran sought to downplay the fire, calling it an "incident" that only affected an under-construction "industrial shed," spokesman Behrouz Kamalvandi said. However, both Kamalvandi and Iranian nuclear chief Ali Akbar Salehi rushed after the fire to Natanz, a facility earlier targeted by the Stuxnet computer virus and built underground to withstand enemy airstrikes. The fire threatened to rekindle wider tensions across the Middle East, similar to the escalation in January after a U.S. drone strike killed a top Iranian general in Baghdad and Tehran launched a retaliatory ballistic missile attack targeting American forces in Iraq. While offering no cause for Thursday's blaze, Iran's state-run IRNA news agency published a commentary addressing the possibility of sabotage by enemy nations such as Israel and the U.S. following other recent explosions in the country.
Continual Learning: Tackling Catastrophic Forgetting in Deep Neural Networks with Replay Processes
Humans learn all their life long. They accumulate knowledge from a sequence of learning experiences and remember the essential concepts without forgetting what they have learned previously. Artificial neural networks struggle to learn similarly. They often rely on data rigorously preprocessed to learn solutions to specific problems such as classification or regression. In particular, they forget their past learning experiences if trained on new ones. Therefore, artificial neural networks are often inept to deal with real-life settings such as an autonomous-robot that has to learn on-line to adapt to new situations and overcome new problems without forgetting its past learning-experiences. Continual learning (CL) is a branch of machine learning addressing this type of problem. Continual algorithms are designed to accumulate and improve knowledge in a curriculum of learning-experiences without forgetting. In this thesis, we propose to explore continual algorithms with replay processes. Replay processes gather together rehearsal methods and generative replay methods. Generative Replay consists of regenerating past learning experiences with a generative model to remember them. Rehearsal consists of saving a core-set of samples from past learning experiences to rehearse them later. The replay processes make possible a compromise between optimizing the current learning objective and the past ones enabling learning without forgetting in sequences of tasks settings. We show that they are very promising methods for continual learning. Notably, they enable the re-evaluation of past data with new knowledge and the confrontation of data from different learning-experiences. We demonstrate their ability to learn continually through unsupervised learning, supervised learning and reinforcement learning tasks.