doe
Nvidia will build AI supercomputers for US Department of Energy
Nvidia, the artificial intelligence (AI) chip leader, will build seven new supercomputers for the United States Department of Energy (DOE), CEO Jensen Huang has said. The company has $500bn in bookings for its AI chips, Huang said on Tuesday in a keynote address at the company's GTC event in Washington, DC, the US capital. It is striking deals around the world while also navigating a US-China trade war that could determine which country's technology is most used across the globe. Investors are looking for clarity on what chips the tech company will be able to sell to the vast Chinese market, but Huang in his keynote speech praised policies by US President Donald Trump while announcing new products and deals. These included network technology that will let Nvidia AI chips work with quantum computers.
- Asia > China (0.52)
- North America > United States > District of Columbia > Washington (0.26)
- South America (0.05)
- (6 more...)
- Government > Regional Government > North America Government > United States Government (1.00)
- Energy (1.00)
How much data do you need? Part 2: Predicting DL class specific training dataset sizes
Mühlenstädt, Thomas, Frtunikj, Jelena
This paper targets the question of predicting machine learning classification model performance, when taking into account the number of training examples per class and not just the overall number of training examples. This leads to the a combinatorial question, which combinations of number of training examples per class should be considered, given a fixed overall training dataset size. In order to solve this question, an algorithm is suggested which is motivated from special cases of space filling design of experiments. The resulting data are modeled using models like powerlaw curves and similar models, extended like generalized linear models i.e. by replacing the overall training dataset size by a parametrized linear combination of the number of training examples per label class. The proposed algorithm has been applied on the CIFAR10 and the EMNIST datasets.
High-dimensional mixed-categorical Gaussian processes with application to multidisciplinary design optimization for a green aircraft
Saves, Paul, Diouane, Youssef, Bartoli, Nathalie, Lefebvre, Thierry, Morlier, Joseph
Multidisciplinary design optimization (MDO) methods aim at adapting numerical optimization techniques to the design of engineering systems involving multiple disciplines. In this context, a large number of mixed continuous, integer, and categorical variables might arise during the optimization process, and practical applications involve a significant number of design variables. Recently, there has been a growing interest in mixed-categorical metamodels based on Gaussian Process (GP) for Bayesian optimization. In particular, to handle mixed-categorical variables, several existing approaches employ different strategies to build the GP. These strategies either use continuous kernels, such as the continuous relaxation or the Gower distance-based kernels, or direct estimation of the correlation matrix, such as the exponential homoscedastic hypersphere (EHH) or the Homoscedastic Hypersphere (HH) kernel. Although the EHH and HH kernels are shown to be very efficient and lead to accurate GPs, they are based on a large number of hyperparameters. In this paper, we address this issue by constructing mixed-categorical GPs with fewer hyperparameters using Partial Least Squares (PLS) regression. Our goal is to generalize Kriging with PLS, commonly used for continuous inputs, to handle mixed-categorical inputs. The proposed method is implemented in the open-source software SMT and has been efficiently applied to structural and multidisciplinary applications. Our method is used to effectively demonstrate the structural behavior of a cantilever beam and facilitates MDO of a green aircraft, resulting in a 439-kilogram reduction in the amount of fuel consumed during a single aircraft mission.
- Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Improving Pre-trained Language Model Fine-tuning with Noise Stability Regularization
Hua, Hang, Li, Xingjian, Dou, Dejing, Xu, Cheng-Zhong, Luo, Jiebo
The advent of large-scale pre-trained language models has contributed greatly to the recent progress in natural language processing. Many state-of-the-art language models are first trained on a large text corpus and then fine-tuned on downstream tasks. Despite its recent success and wide adoption, fine-tuning a pre-trained language model often suffers from overfitting, which leads to poor generalizability due to the extremely high complexity of the model and the limited training samples from downstream tasks. To address this problem, we propose a novel and effective fine-tuning framework, named Layerwise Noise Stability Regularization (LNSR). Specifically, we propose to inject the standard Gaussian noise or In-manifold noise and regularize hidden representations of the fine-tuned model. We first provide theoretical analyses to support the efficacy of our method. We then demonstrate the advantages of the proposed method over other state-of-the-art algorithms including L2-SP, Mixout and SMART. While these previous works only verify the effectiveness of their methods on relatively simple text classification tasks, we also verify the effectiveness of our method on question answering tasks, where the target problem is much more difficult and more training examples are available. Furthermore, extensive experimental results indicate that the proposed algorithm can not only enhance the in-domain performance of the language models but also improve the domain generalization performance on out-of-domain data.
- North America > United States > Georgia > Fulton County > Atlanta (0.05)
- Europe > United Kingdom > England (0.05)
Automatic Counterfactual Augmentation for Robust Text Classification Based on Word-Group Search
Song, Rui, Giunchiglia, Fausto, Li, Yingji, Xu, Hao
Despite large-scale pre-trained language models have achieved striking results for text classificaion, recent work has raised concerns about the challenge of shortcut learning. In general, a keyword is regarded as a shortcut if it creates a superficial association with the label, resulting in a false prediction. Conversely, shortcut learning can be mitigated if the model relies on robust causal features that help produce sound predictions. To this end, many studies have explored post-hoc interpretable methods to mine shortcuts and causal features for robustness and generalization. However, most existing methods focus only on single word in a sentence and lack consideration of word-group, leading to wrong causal features. To solve this problem, we propose a new Word-Group mining approach, which captures the causal effect of any keyword combination and orders the combinations that most affect the prediction. Our approach bases on effective post-hoc analysis and beam search, which ensures the mining effect and reduces the complexity. Then, we build a counterfactual augmentation method based on the multiple word-groups, and use an adaptive voting mechanism to learn the influence of different augmentated samples on the prediction results, so as to force the model to pay attention to effective causal features. We demonstrate the effectiveness of the proposed method by several tasks on 8 affective review datasets and 4 toxic language datasets, including cross-domain text classificaion, text attack and gender fairness test.
- North America > United States > Georgia > Fulton County > Atlanta (0.07)
- Europe > United Kingdom > England (0.07)
Perspectives on AI Architectures and Co-design for Earth System Predictability
Mudunuru, Maruti K., Ang, James A., Halappanavar, Mahantesh, Hammond, Simon D., Gokhale, Maya B., Hoe, James C., Krishna, Tushar, Sreepathi, Sarat S., Norman, Matthew R., Peng, Ivy B., Jones, Philip W.
Recently, the U.S. Department of Energy (DOE), Office of Science, Biological and Environmental Research (BER), and Advanced Scientific Computing Research (ASCR) programs organized and held the Artificial Intelligence for Earth System Predictability (AI4ESP) workshop series. From this workshop, a critical conclusion that the DOE BER and ASCR community came to is the requirement to develop a new paradigm for Earth system predictability focused on enabling artificial intelligence (AI) across the field, lab, modeling, and analysis activities, called ModEx. The BER's `Model-Experimentation', ModEx, is an iterative approach that enables process models to generate hypotheses. The developed hypotheses inform field and laboratory efforts to collect measurement and observation data, which are subsequently used to parameterize, drive, and test model (e.g., process-based) predictions. A total of 17 technical sessions were held in this AI4ESP workshop series. This paper discusses the topic of the `AI Architectures and Co-design' session and associated outcomes. The AI Architectures and Co-design session included two invited talks, two plenary discussion panels, and three breakout rooms that covered specific topics, including: (1) DOE HPC Systems, (2) Cloud HPC Systems, and (3) Edge computing and Internet of Things (IoT). We also provide forward-looking ideas and perspectives on potential research in this co-design area that can be achieved by synergies with the other 16 session topics. These ideas include topics such as: (1) reimagining co-design, (2) data acquisition to distribution, (3) heterogeneous HPC solutions for integration of AI/ML and other data analytics like uncertainty quantification with earth system modeling and simulation, and (4) AI-enabled sensor integration into earth system measurements and observations. Such perspectives are a distinguishing aspect of this paper.
- North America > United States > District of Columbia > Washington (0.14)
- North America > United States > Tennessee > Anderson County > Oak Ridge (0.04)
- North America > United States > New Mexico > Los Alamos County > Los Alamos (0.04)
- (8 more...)
- Research Report (0.84)
- Instructional Material > Course Syllabus & Notes (0.68)
- Information Technology > Services (1.00)
- Information Technology > Security & Privacy (1.00)
- Government > Regional Government > North America Government > United States Government (1.00)
- Energy (1.00)
- Information Technology > Scientific Computing (1.00)
- Information Technology > Data Science > Data Mining (1.00)
- Information Technology > Communications > Networks (1.00)
- (2 more...)
Out-of-distribution Detection with Implicit Outlier Transformation
Wang, Qizhou, Ye, Junjie, Liu, Feng, Dai, Quanyu, Kalander, Marcus, Liu, Tongliang, Hao, Jianye, Han, Bo
Outlier exposure (OE) is powerful in out-of-distribution (OOD) detection, enhancing detection capability via model fine-tuning with surrogate OOD data. Thus, the performance of OE, when facing unseen OOD data, can be weakened. To address this issue, we propose a novel OE-based approach that makes the model perform well for unseen OOD situations, even for unseen OOD cases. It leads to a min-max learning scheme--searching to synthesize OOD data that leads to worst judgments and learning from such OOD data for uniform performance in OOD detection. In our realization, these worst OOD data are synthesized by transforming original surrogate ones. Specifically, the associated transform functions are learned implicitly based on our novel insight that model perturbation leads to data transformation. Our methodology offers an efficient way of synthesizing OOD data, which can further benefit the detection model, besides the surrogate OOD data. We conduct extensive experiments under various OOD detection setups, demonstrating the effectiveness of our method against its advanced counterparts. The code is publicly available at: github.com/qizhouwang/doe. Deep learning systems in the open world often encounter out-of-distribution (OOD) data whose label space is disjoint with that of the in-distribution (ID) samples. For many safety-critical applications, deep models should make reliable predictions for ID data, while OOD cases (Bulusu et al., 2020) should be reported as anomalies. It leads to the well-known OOD detection problem (Lee et al., 2018c; Fang et al., 2022), which has attracted intensive attention in reliable machine learning. OOD detection remains non-trivial since deep models can be over-confident when facing OOD data (Nguyen et al., 2015; Bendale & Boult, 2016), and many efforts have been made in pursuing reliable detection models (Yang et al., 2021; Salehi et al., 2021).
- North America > Canada > Ontario > Toronto (0.14)
- Asia > Middle East > Jordan (0.04)
- Asia > China > Hong Kong (0.04)
Unfolding-Aided Bootstrapped Phase Retrieval in Optical Imaging
Pinilla, Samuel, Mishra, Kumar Vijay, Shevkunov, Igor, Soltanalian, Mojtaba, Katkovnik, Vladimir, Egiazarian, Karen
Phase retrieval in optical imaging refers to the recovery of a complex signal from phaseless data acquired in the form of its diffraction patterns. These patterns are acquired through a system with a coherent light source that employs a diffractive optical element (DOE) to modulate the scene resulting in coded diffraction patterns at the sensor. Recently, the hybrid approach of model-driven network or deep unfolding has emerged as an effective alternative to conventional model-based and learning-based phase retrieval techniques because it allows for bounding the complexity of algorithms while also retaining their efficacy. Additionally, such hybrid approaches have shown promise in improving the design of DOEs that follow theoretical uniqueness conditions. There are opportunities to exploit novel experimental setups and resolve even more complex DOE phase retrieval applications. This paper presents an overview of algorithms and applications of deep unfolding for bootstrapped - regardless of near, middle, and far zones - phase retrieval.
- Europe > Finland > Pirkanmaa > Tampere (0.04)
- North America > United States > Illinois > Cook County > Chicago (0.04)
- North America > United States > Maryland > Prince George's County > Adelphi (0.04)
- (3 more...)
- Overview (0.68)
- Research Report (0.50)
- Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
- Information Technology > Sensing and Signal Processing (0.68)
- Information Technology > Artificial Intelligence > Vision (0.68)
Energy Department launches council to coordinate AI activities
The Department of Energy established the Artificial Intelligence Advancement Council earlier this month to coordinate funding and development of algorithms and hold agencies accountable for how they are used. A lean team consisting of five members, AIAC will quickly approve task forces, implementation plans and organizational changes for the AI & Technology Office, DOE Executive Secretariat, and AI Program Committee to execute. DOE stood up the Responsible and Trustworthy AI Task Force ahead of AIAC's first meeting, tentatively planned for June, to suggest departmental principles and practices, particularly around equity and ethics. "We are very mindful of the fact that there are activities and initiatives that are underway, as well as initiatives that should be considered, in AI space," AITO Director Pamela Isom told FedScoop. "And we're also needing the support at the level that the council brings."
- Energy (1.00)
- Government > Regional Government (0.39)
How The US Department Of Energy Is Transforming AI
The US Department of Energy (DOE) has long stood out as one of the most science, technology, and innovation-focused US federal agencies. It should come as little surprise then that the DOE continues to invest in transformative technology such as artificial intelligence and machine learning. The DOE established the Artificial Intelligence and Technology (AITO) office to help transform the DOE into a world leading Artificial Intelligence (AI) enterprise by accelerating the research, development, delivery, and adoption of AI. Pamela Isom, the new Director of the AITO, will be presenting at the February 2022 AI in Government event to share how they are maximizing the impacts of AI through strategic coordination, planning, and customer service excellence. In this interview article Ms. Isom goes into greater detail about how the DOE is leveraging data, and transformative technologies to help advance the agency's core missions.
- Government > Regional Government > North America Government > United States Government (1.00)
- Energy (1.00)