AITopics | doe

Collaborating Authors

doe

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

US government wants to have a useful quantum computer by 2028

New ScientistJun-29-2026, 18:00:11 GMT

The US government wants to get hold of a quantum computer good enough to contribute to scientific breakthroughs in just two years. It will use it to try to accelerate the research and development of new materials, pharmaceuticals and molecules useful in agriculture and manufacturing. Once a dream of theoretical physicists, quantum computers are now undoubtedly real, but have yet to prove unambiguously useful or to have broad commercial value. Their computational power depends on their size - how many components called qubits they comprise - and how reliable they are. Existing devices are still too small and too error-prone.

artificial intelligence, quantum computer, social media, (15 more...)

New Scientist

Country: North America > United States (1.00)

Industry: Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

Multi-fidelity approaches for general constrained Bayesian optimization with application to aircraft design

Cordelier, Oihan, Diouane, Youssef, Bartoli, Nathalie, Laurendeau, Eric

arXiv.org Machine LearningApr-1-2026

Aircraft design relies heavily on solving challenging and computationally expensive Multidisciplinary Design Optimization problems. In this context, there has been growing interest in multi-fidelity models for Bayesian optimization to improve the MDO process by balancing computational cost and accuracy through the combination of high- and low-fidelity simulation models, enabling efficient exploration of the design process at a minimal computational effort. In the existing literature, fidelity selection focuses only on the objective function to decide how to integrate multiple fidelity levels, balancing precision and computational cost using variance reduction criteria. In this work, we propose novel multi-fidelity selection strategies. Specifically, we demonstrate how incorporating information from both the objective and the constraints can further reduce computational costs without compromising the optimality of the solution. We validate the proposed multi-fidelity optimization strategy by applying it to four analytical test cases, showcasing its effectiveness. The proposed method is used to efficiently solve a challenging aircraft wing aero-structural design problem. The proposed setting uses a linear vortex lattice method and a finite element method for the aerodynamic and structural analysis respectively. We show that employing our proposed multi-fidelity approach leads to $86\%$ to $200\%$ more constraint compliant solutions given a limited budget compared to the state-of-the-art approach.

artificial intelligence, modeling & simulation, optimization, (17 more...)

arXiv.org Machine Learning

2603.28987

Country:

North America > United States > Virginia (0.04)
North America > Canada (0.04)
Europe > Germany (0.04)
(2 more...)

Genre: Research Report (0.69)

Industry: Aerospace & Defense > Aircraft (0.34)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.90)

Add feedback

Nvidia will build AI supercomputers for US Department of Energy

Al JazeeraOct-28-2025, 20:06:25 GMT

Nvidia, the artificial intelligence (AI) chip leader, will build seven new supercomputers for the United States Department of Energy (DOE), CEO Jensen Huang has said. The company has $500bn in bookings for its AI chips, Huang said on Tuesday in a keynote address at the company's GTC event in Washington, DC, the US capital. It is striking deals around the world while also navigating a US-China trade war that could determine which country's technology is most used across the globe. Investors are looking for clarity on what chips the tech company will be able to sell to the vast Chinese market, but Huang in his keynote speech praised policies by US President Donald Trump while announcing new products and deals. These included network technology that will let Nvidia AI chips work with quantum computers.

huang, nvidia, supercomputer, (14 more...)

Al Jazeera

Country:

Asia > China (0.52)
North America > United States > District of Columbia > Washington (0.26)
South America (0.05)
(6 more...)

Genre: Press Release (0.35)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Energy (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.30)

Add feedback

How much data do you need? Part 2: Predicting DL class specific training dataset sizes

Mühlenstädt, Thomas, Frtunikj, Jelena

arXiv.org Machine LearningMar-10-2024

This paper targets the question of predicting machine learning classification model performance, when taking into account the number of training examples per class and not just the overall number of training examples. This leads to the a combinatorial question, which combinations of number of training examples per class should be considered, given a fixed overall training dataset size. In order to solve this question, an algorithm is suggested which is motivated from special cases of space filling design of experiments. The resulting data are modeled using models like powerlaw curves and similar models, extended like generalized linear models i.e. by replacing the overall training dataset size by a parametrized linear combination of the number of training examples per label class. The proposed algorithm has been applied on the CIFAR10 and the EMNIST datasets.

dataset, experiment, training dataset size, (15 more...)

arXiv.org Machine Learning

2403.06311

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)

Add feedback

High-dimensional mixed-categorical Gaussian processes with application to multidisciplinary design optimization for a green aircraft

Saves, Paul, Diouane, Youssef, Bartoli, Nathalie, Lefebvre, Thierry, Morlier, Joseph

arXiv.org Machine LearningNov-10-2023

Multidisciplinary design optimization (MDO) methods aim at adapting numerical optimization techniques to the design of engineering systems involving multiple disciplines. In this context, a large number of mixed continuous, integer, and categorical variables might arise during the optimization process, and practical applications involve a significant number of design variables. Recently, there has been a growing interest in mixed-categorical metamodels based on Gaussian Process (GP) for Bayesian optimization. In particular, to handle mixed-categorical variables, several existing approaches employ different strategies to build the GP. These strategies either use continuous kernels, such as the continuous relaxation or the Gower distance-based kernels, or direct estimation of the correlation matrix, such as the exponential homoscedastic hypersphere (EHH) or the Homoscedastic Hypersphere (HH) kernel. Although the EHH and HH kernels are shown to be very efficient and lead to accurate GPs, they are based on a large number of hyperparameters. In this paper, we address this issue by constructing mixed-categorical GPs with fewer hyperparameters using Partial Least Squares (PLS) regression. Our goal is to generalize Kriging with PLS, commonly used for continuous inputs, to handle mixed-categorical inputs. The proposed method is implemented in the open-source software SMT and has been efficiently applied to structural and multidisciplinary applications. Our method is used to effectively demonstrate the structural behavior of a cantilever beam and facilitates MDO of a green aircraft, resulting in a 439-kilogram reduction in the amount of fuel consumed during a single aircraft mission.

artificial intelligence, machine learning, optimization, (17 more...)

arXiv.org Machine Learning

2311.0613

Country:

Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)
North America > Canada > Quebec > Montreal (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (0.67)

Industry: Transportation > Air (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Modeling & Simulation (0.86)

Add feedback

Improving Pre-trained Language Model Fine-tuning with Noise Stability Regularization

Hua, Hang, Li, Xingjian, Dou, Dejing, Xu, Cheng-Zhong, Luo, Jiebo

arXiv.org Artificial IntelligenceNov-8-2023

The advent of large-scale pre-trained language models has contributed greatly to the recent progress in natural language processing. Many state-of-the-art language models are first trained on a large text corpus and then fine-tuned on downstream tasks. Despite its recent success and wide adoption, fine-tuning a pre-trained language model often suffers from overfitting, which leads to poor generalizability due to the extremely high complexity of the model and the limited training samples from downstream tasks. To address this problem, we propose a novel and effective fine-tuning framework, named Layerwise Noise Stability Regularization (LNSR). Specifically, we propose to inject the standard Gaussian noise or In-manifold noise and regularize hidden representations of the fine-tuned model. We first provide theoretical analyses to support the efficacy of our method. We then demonstrate the advantages of the proposed method over other state-of-the-art algorithms including L2-SP, Mixout and SMART. While these previous works only verify the effectiveness of their methods on relatively simple text classification tasks, we also verify the effectiveness of our method on question answering tasks, where the target problem is much more difficult and more training examples are available. Furthermore, extensive experimental results indicate that the proposed algorithm can not only enhance the in-domain performance of the language models but also improve the domain generalization performance on out-of-domain data.

arxiv, png, ps 2206, (5 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/TNNLS.2023.3330926

2206.05658

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.05)
Europe > United Kingdom > England (0.05)

Genre: Research Report (0.76)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.53)

Add feedback

Automatic Counterfactual Augmentation for Robust Text Classification Based on Word-Group Search

Song, Rui, Giunchiglia, Fausto, Li, Yingji, Xu, Hao

arXiv.org Artificial IntelligenceJun-30-2023

Despite large-scale pre-trained language models have achieved striking results for text classificaion, recent work has raised concerns about the challenge of shortcut learning. In general, a keyword is regarded as a shortcut if it creates a superficial association with the label, resulting in a false prediction. Conversely, shortcut learning can be mitigated if the model relies on robust causal features that help produce sound predictions. To this end, many studies have explored post-hoc interpretable methods to mine shortcuts and causal features for robustness and generalization. However, most existing methods focus only on single word in a sentence and lack consideration of word-group, leading to wrong causal features. To solve this problem, we propose a new Word-Group mining approach, which captures the causal effect of any keyword combination and orders the combinations that most affect the prediction. Our approach bases on effective post-hoc analysis and beam search, which ensures the mining effect and reduces the complexity. Then, we build a counterfactual augmentation method based on the multiple word-groups, and use an adaptive voting mechanism to learn the influence of different augmentated samples on the prediction results, so as to force the model to pay attention to effective causal features. We demonstrate the effectiveness of the proposed method by several tasks on 8 affective review datasets and 4 toxic language datasets, including cross-domain text classificaion, text attack and gender fairness test.

artificial intelligence, automatic counterfactual augmentation, natural language, (6 more...)

arXiv.org Artificial Intelligence

2307.01214

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.07)
Europe > United Kingdom > England (0.07)

Genre: Research Report (0.52)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.40)

Add feedback

Perspectives on AI Architectures and Co-design for Earth System Predictability

Mudunuru, Maruti K., Ang, James A., Halappanavar, Mahantesh, Hammond, Simon D., Gokhale, Maya B., Hoe, James C., Krishna, Tushar, Sreepathi, Sarat S., Norman, Matthew R., Peng, Ivy B., Jones, Philip W.

arXiv.org Artificial IntelligenceApr-7-2023

Recently, the U.S. Department of Energy (DOE), Office of Science, Biological and Environmental Research (BER), and Advanced Scientific Computing Research (ASCR) programs organized and held the Artificial Intelligence for Earth System Predictability (AI4ESP) workshop series. From this workshop, a critical conclusion that the DOE BER and ASCR community came to is the requirement to develop a new paradigm for Earth system predictability focused on enabling artificial intelligence (AI) across the field, lab, modeling, and analysis activities, called ModEx. The BER's `Model-Experimentation', ModEx, is an iterative approach that enables process models to generate hypotheses. The developed hypotheses inform field and laboratory efforts to collect measurement and observation data, which are subsequently used to parameterize, drive, and test model (e.g., process-based) predictions. A total of 17 technical sessions were held in this AI4ESP workshop series. This paper discusses the topic of the `AI Architectures and Co-design' session and associated outcomes. The AI Architectures and Co-design session included two invited talks, two plenary discussion panels, and three breakout rooms that covered specific topics, including: (1) DOE HPC Systems, (2) Cloud HPC Systems, and (3) Edge computing and Internet of Things (IoT). We also provide forward-looking ideas and perspectives on potential research in this co-design area that can be achieved by synergies with the other 16 session topics. These ideas include topics such as: (1) reimagining co-design, (2) data acquisition to distribution, (3) heterogeneous HPC solutions for integration of AI/ML and other data analytics like uncertainty quantification with earth system modeling and simulation, and (4) AI-enabled sensor integration into earth system measurements and observations. Such perspectives are a distinguishing aspect of this paper.

ai architecture, data mining, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2304.03748

Country:

North America > United States > District of Columbia > Washington (0.14)
North America > United States > Tennessee > Anderson County > Oak Ridge (0.04)
North America > United States > New Mexico > Los Alamos County > Los Alamos (0.04)
(8 more...)

Genre:

Research Report (0.84)
Instructional Material > Course Syllabus & Notes (0.68)

Industry:

Information Technology > Services (1.00)
Information Technology > Security & Privacy (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Energy (1.00)

Technology:

Information Technology > Scientific Computing (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Networks (1.00)
(2 more...)

Add feedback

Out-of-distribution Detection with Implicit Outlier Transformation

Wang, Qizhou, Ye, Junjie, Liu, Feng, Dai, Quanyu, Kalander, Marcus, Liu, Tongliang, Hao, Jianye, Han, Bo

arXiv.org Artificial IntelligenceMar-8-2023

Outlier exposure (OE) is powerful in out-of-distribution (OOD) detection, enhancing detection capability via model fine-tuning with surrogate OOD data. Thus, the performance of OE, when facing unseen OOD data, can be weakened. To address this issue, we propose a novel OE-based approach that makes the model perform well for unseen OOD situations, even for unseen OOD cases. It leads to a min-max learning scheme--searching to synthesize OOD data that leads to worst judgments and learning from such OOD data for uniform performance in OOD detection. In our realization, these worst OOD data are synthesized by transforming original surrogate ones. Specifically, the associated transform functions are learned implicitly based on our novel insight that model perturbation leads to data transformation. Our methodology offers an efficient way of synthesizing OOD data, which can further benefit the detection model, besides the surrogate OOD data. We conduct extensive experiments under various OOD detection setups, demonstrating the effectiveness of our method against its advanced counterparts. The code is publicly available at: github.com/qizhouwang/doe. Deep learning systems in the open world often encounter out-of-distribution (OOD) data whose label space is disjoint with that of the in-distribution (ID) samples. For many safety-critical applications, deep models should make reliable predictions for ID data, while OOD cases (Bulusu et al., 2020) should be reported as anomalies. It leads to the well-known OOD detection problem (Lee et al., 2018c; Fang et al., 2022), which has attracted intensive attention in reliable machine learning. OOD detection remains non-trivial since deep models can be over-confident when facing OOD data (Nguyen et al., 2015; Bendale & Boult, 2016), and many efforts have been made in pursuing reliable detection models (Yang et al., 2021; Salehi et al., 2021).

data mining, machine learning, ood data, (20 more...)

arXiv.org Artificial Intelligence

2303.05033

Country:

North America > Canada > Ontario > Toronto (0.14)
Asia > Middle East > Jordan (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)

Add feedback

Unfolding-Aided Bootstrapped Phase Retrieval in Optical Imaging

Pinilla, Samuel, Mishra, Kumar Vijay, Shevkunov, Igor, Soltanalian, Mojtaba, Katkovnik, Vladimir, Egiazarian, Karen

arXiv.org Artificial IntelligenceOct-9-2022

Phase retrieval in optical imaging refers to the recovery of a complex signal from phaseless data acquired in the form of its diffraction patterns. These patterns are acquired through a system with a coherent light source that employs a diffractive optical element (DOE) to modulate the scene resulting in coded diffraction patterns at the sensor. Recently, the hybrid approach of model-driven network or deep unfolding has emerged as an effective alternative to conventional model-based and learning-based phase retrieval techniques because it allows for bounding the complexity of algorithms while also retaining their efficacy. Additionally, such hybrid approaches have shown promise in improving the design of DOEs that follow theoretical uniqueness conditions. There are opportunities to exploit novel experimental setups and resolve even more complex DOE phase retrieval applications. This paper presents an overview of algorithms and applications of deep unfolding for bootstrapped - regardless of near, middle, and far zones - phase retrieval.

artificial intelligence, machine learning, phase retrieval, (18 more...)

arXiv.org Artificial Intelligence

2203.01695

Country:

Europe > Finland > Pirkanmaa > Tampere (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > Maryland > Prince George's County > Adelphi (0.04)
(3 more...)

Genre:

Overview (0.68)
Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Sensing and Signal Processing (0.68)
Information Technology > Artificial Intelligence > Vision (0.68)

Add feedback