AITopics | Tartakovsky, Daniel M.

Collaborating Authors

Tartakovsky, Daniel M.

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Transfer Learning on Multi-Dimensional Data: A Novel Approach to Neural Network-Based Surrogate Modeling

Propp, Adrienne M., Tartakovsky, Daniel M.

arXiv.org Artificial IntelligenceJan-11-2025

The development of efficient surrogates for partial differential equations (PDEs) is a critical step towards scalable modeling of complex, multiscale systems-of-systems. Convolutional neural networks (CNNs) have gained popularity as the basis for such surrogate models due to their success in capturing high-dimensional input-output mappings and the negligible cost of a forward pass. However, the high cost of generating training data -- typically via classical numerical solvers -- raises the question of whether these models are worth pursuing over more straightforward alternatives with well-established theoretical foundations, such as Monte Carlo methods. To reduce the cost of data generation, we propose training a CNN surrogate model on a mixture of numerical solutions to both the $d$-dimensional problem and its ($d-1$)-dimensional approximation, taking advantage of the efficiency savings guaranteed by the curse of dimensionality. We demonstrate our approach on a multiphase flow test problem, using transfer learning to train a dense fully-convolutional encoder-decoder CNN on the two classes of data. Numerical results from a sample uncertainty quantification task demonstrate that our surrogate model outperforms Monte Carlo with several times the data generation budget.

artificial intelligence, machine learning, neural network-based surrogate modeling, (3 more...)

arXiv.org Artificial Intelligence

doi: 10.1615/JMachLearnModelComput.2024057138

2410.12241

Genre:

Research Report > Promising Solution (0.40)
Overview > Innovation (0.40)

Industry: Energy > Oil & Gas > Upstream (0.53)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.60)

Add feedback

Baseflow identification via explainable AI with Kolmogorov-Arnold networks

Liu, Chuyang, Roy, Tirthankar, Tartakovsky, Daniel M., Dwivedi, Dipankar

arXiv.org Artificial IntelligenceOct-10-2024

Hydrological models often involve constitutive laws that may not be optimal in every application. We propose to replace such laws with the Kolmogorov-Arnold networks (KANs), a class of neural networks designed to identify symbolic expressions. We demonstrate KAN's potential on the problem of baseflow identification, a notoriously challenging task plagued by significant uncertainty. KAN-derived functional dependencies of the baseflow components on the aridity index outperform their original counterparts. On a test set, they increase the Nash-Sutcliffe Efficiency (NSE) by 67%, decrease the root mean squared error by 30%, and increase the Kling-Gupta efficiency by 24%. This superior performance is achieved while reducing the number of fitting parameters from three to two. Next, we use data from 378 catchments across the continental United States to refine the water-balance equation at the mean-annual scale. The KAN-derived equations based on the refined water balance outperform both the current aridity index model, with up to a 105% increase in NSE, and the KAN-derived equations based on the original water balance. While the performance of our model and tree-based machine learning methods is similar, KANs offer the advantage of simplicity and transparency and require no specific software or computational tools. This case study focuses on the aridity index formulation, but the approach is flexible and transferable to other hydrological processes.

activation function, artificial intelligence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2410.11587

Country:

North America > United States > Nebraska (0.28)
North America > United States > California (0.28)

Genre: Research Report > New Finding (0.68)

Industry:

Energy (0.68)
Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.91)

Add feedback

High-Precision Geosteering via Reinforcement Learning and Particle Filters

Muhammad, Ressi Bonti, Srivastava, Apoorv, Alyaev, Sergey, Bratvold, Reidar Brumer, Tartakovsky, Daniel M.

arXiv.org Artificial IntelligenceFeb-9-2024

Geosteering, a key component of drilling operations, traditionally involves manual interpretation of various data sources such as well-log data. This introduces subjective biases and inconsistent procedures. Academic attempts to solve geosteering decision optimization with greedy optimization and Approximate Dynamic Programming (ADP) showed promise but lacked adaptivity to realistic diverse scenarios. Reinforcement learning (RL) offers a solution to these challenges, facilitating optimal decision-making through reward-based iterative learning. State estimation methods, e.g., particle filter (PF), provide a complementary strategy for geosteering decision-making based on online information. We integrate an RL-based geosteering with PF to address realistic geosteering scenarios. Our framework deploys PF to process real-time well-log data to estimate the location of the well relative to the stratigraphic layers, which then informs the RL-based decision-making process. We compare our method's performance with that of using solely either RL or PF. Our findings indicate a synergy between RL and PF in yielding optimized geosteering decisions.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

2402.06377

Country: North America > United States > California > Santa Clara County (0.14)

Genre: Research Report > New Finding (1.00)

Industry: Energy > Oil & Gas > Upstream (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

Neural oscillators for generalization of physics-informed machine learning

Kapoor, Taniya, Chandra, Abhishek, Tartakovsky, Daniel M., Wang, Hongrui, Nunez, Alfredo, Dollevoet, Rolf

arXiv.org Artificial IntelligenceDec-18-2023

A primary challenge of physics-informed machine learning (PIML) is its generalization beyond the training domain, especially when dealing with complex physical problems represented by partial differential equations (PDEs). This paper aims to enhance the generalization capabilities of PIML, facilitating practical, real-world applications where accurate predictions in unexplored regions are crucial. We leverage the inherent causality and temporal sequential characteristics of PDE solutions to fuse PIML models with recurrent neural architectures based on systems of ordinary differential equations, referred to as neural oscillators. Through effectively capturing long-time dependencies and mitigating the exploding and vanishing gradient problem, neural oscillators foster improved generalization in PIML tasks. Extensive experimentation involving time-dependent nonlinear PDEs and biharmonic beam equations demonstrates the efficacy of the proposed approach. Incorporating neural oscillators outperforms existing state-of-the-art methods on benchmark problems across various metrics. Consequently, the proposed method improves the generalization capabilities of PIML, providing accurate solutions for extrapolation and prediction beyond the training data.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2308.08989

Country: Europe > Netherlands (0.28)

Genre:

Research Report > Promising Solution (0.66)
Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Neural oscillators for magnetic hysteresis modeling

Chandra, Abhishek, Kapoor, Taniya, Daniels, Bram, Curti, Mitrofan, Tiels, Koen, Tartakovsky, Daniel M., Lomonova, Elena A.

arXiv.org Artificial IntelligenceAug-23-2023

Hysteresis is a ubiquitous phenomenon in science and engineering; its modeling and identification are crucial for understanding and optimizing the behavior of various systems. We develop an ordinary differential equation-based recurrent neural network (RNN) approach to model and quantify the hysteresis, which manifests itself in sequentiality and history-dependence. Our neural oscillator, HystRNN, draws inspiration from coupled-oscillatory RNN and phenomenological hysteresis models to update the hidden states. The performance of HystRNN is evaluated to predict generalized scenarios, involving first-order reversal curves and minor loops. The findings show the ability of HystRNN to generalize its behavior to previously untrained regions, an essential feature that hysteresis models must have. This research highlights the advantage of neural oscillators over the traditional RNN-based methods in capturing complex hysteresis patterns in magnetic materials, where traditional rate-dependent methods are inadequate to capture intrinsic nonlinearity.

artificial intelligence, machine learning, neural oscillator, (18 more...)

arXiv.org Artificial Intelligence

2308.12002

Country: Europe > Netherlands (0.29)

Genre: Research Report > New Finding (0.48)

Industry: Materials > Chemicals > Commodity Chemicals > Petrochemicals (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Learning Nonautonomous Systems via Dynamic Mode Decomposition

Lu, Hannah, Tartakovsky, Daniel M.

arXiv.org Artificial IntelligenceJun-27-2023

We present a data-driven learning approach for unknown nonautonomous dynamical systems with time-dependent inputs based on dynamic mode decomposition (DMD). To circumvent the difficulty of approximating the time-dependent Koopman operators for nonautonomous systems, a modified system derived from local parameterization of the external time-dependent inputs is employed as an approximation to the original nonautonomous system. The modified system comprises a sequence of local parametric systems, which can be well approximated by a parametric surrogate model using our previously proposed framework for dimension reduction and interpolation in parameter space (DRIPS). The offline step of DRIPS relies on DMD to build a linear surrogate model, endowed with reduced-order bases (ROBs), for the observables mapped from training data. Then the offline step constructs a sequence of iterative parametric surrogate models from interpolations on suitable manifolds, where the target/test parameter points are specified by the local parameterization of the test external time-dependent inputs. We present a number of numerical examples to demonstrate the robustness of our method and compare its performance with deep neural networks in the same settings.

artificial intelligence, machine learning, nonautonomous system, (19 more...)

arXiv.org Artificial Intelligence

2306.15618

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > United States > California > Santa Clara County (0.14)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

Discovery of sparse hysteresis models for piezoelectric materials

Chandra, Abhishek, Daniels, Bram, Curti, Mitrofan, Tiels, Koen, Lomonova, Elena A., Tartakovsky, Daniel M.

arXiv.org Artificial IntelligenceMay-15-2023

This article presents an approach for modelling hysteresis in piezoelectric materials, that leverages recent advancements in machine learning, particularly in sparse-regression techniques. While sparse regression has previously been used to model various scientific and engineering phenomena, its application to nonlinear hysteresis modelling in piezoelectric materials has yet to be explored. The study employs the least-squares algorithm with a sequential threshold to model the dynamic system responsible for hysteresis, resulting in a concise model that accurately predicts hysteresis for both simulated and experimental piezoelectric material data. Several numerical experiments are performed, including learning butterfly-shaped hysteresis and modelling real-world hysteresis data for a piezoelectric actuator. The presented approach is compared to traditional regression-based and neural network methods, demonstrating its efficiency and robustness.

artificial intelligence, deep learning, machine learning, (19 more...)

arXiv.org Artificial Intelligence

doi: 10.1063/5.0146134

2302.05313

Country: North America > United States > California (0.28)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Autonomous learning of nonlocal stochastic neuron dynamics

Maltba, Tyler E., Zhao, Hongli, Tartakovsky, Daniel M.

arXiv.org Machine LearningNov-22-2020

Neuronal dynamics is driven by externally imposed or internally generated random excitations/noise, and is often described by systems of stochastic ordinary differential equations. A solution to these equations is the joint probability density function (PDF) of neuron states. It can be used to calculate such information-theoretic quantities as the mutual information between the stochastic stimulus and various internal states of the neuron (e.g., membrane potential), as well as various spiking statistics. When random excitations are modeled as Gaussian white noise, the joint PDF of neuron states satisfies exactly a Fokker-Planck equation. However, most biologically plausible noise sources are correlated (colored). In this case, the resulting PDF equations require a closure approximation. We propose two methods for closing such equations: a modified nonlocal large-eddy-diffusivity closure and a data-driven closure relying on sparse regression to learn relevant features. The closures are tested for stochastic leaky integrate-and-fire (LIF) and FitzHugh-Nagumo (FHN) neurons driven by sine-Wiener noise. Mutual information and total correlation between the random stimulus and the internal states of the neuron are calculated for the FHN neuron.

closure, health & medicine, upstream oil & gas, (20 more...)

arXiv.org Machine Learning

2011.10955

Country: North America > United States > California (0.14)

Genre: Research Report (1.00)

Industry:

Energy > Oil & Gas > Upstream (1.00)
Health & Medicine (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.45)

Add feedback

Mutual Information for Explainable Deep Learning of Multiscale Systems

Taverniers, Søren, Hall, Eric J., Katsoulakis, Markos A., Tartakovsky, Daniel M.

arXiv.org Machine LearningSep-7-2020

Timely completion of design cycles for multiscale and multiphysics systems ranging from consumer electronics to hypersonic vehicles relies on rapid simulation-based prototyping. The latter typically involves high-dimensional spaces of possibly correlated control variables (CVs) and quantities of interest (QoIs) with non-Gaussian and/or multimodal distributions. We develop a model-agnostic, moment-independent global sensitivity analysis (GSA) that relies on differential mutual information to rank the effects of CVs on QoIs. Large amounts of data, which are necessary to rank CVs with confidence, are cheaply generated by a deep neural network (DNN) surrogate model of the underlying process. The DNN predictions are made explainable by the GSA so that the DNN can be deployed to close design loops. Our information-theoretic framework is compatible with a wide variety of black-box models. Its application to multiscale supercapacitor design demonstrates that the CV rankings facilitated by a domain-aware Graph-Informed Neural Network are better resolved than their counterparts obtained with a physics-based model for a fixed computational budget. Consequently, our information-theoretic GSA provides an "outer loop" for accelerated product design by identifying the most and least sensitive input directions and performing subsequent optimization over appropriately reduced parameter subspaces.

confidence interval, deep learning, neural network, (20 more...)

arXiv.org Machine Learning

2009.0457

Country:

North America > United States > Massachusetts > Hampshire County > Amherst (0.14)
North America > United States > California > Santa Clara County (0.14)

Genre: Research Report (0.63)

Industry: Energy (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

GINNs: Graph-Informed Neural Networks for Multiscale Physics

Hall, Eric J., Taverniers, Søren, Katsoulakis, Markos A., Tartakovsky, Daniel M.

arXiv.org Machine LearningJun-26-2020

Typically this requires casting the original deterministic physics-based model into a probabilistic framework where inputs or control variables (CVs) are treated as random variables with probability distributions derived from available experimental data, manufacturing constraints, design criteria, expert judgment, and/or other domain knowledge (e.g., see [1]). Running the physics-based model with CVs sampled according to these distributions yields corresponding realizations of the system response as characterized by quantities of interest (QoIs). Analysis of the uncertainty propagation from the CVs to the QoIs informs decision-making, e.g., it informs engineering decisions aimed at improving the quality and reliability of designed products and helps identify potential risks at early stages in the design and manufacturing process. Quantitatively assessing uncertainty propagation presents a fundamental challenge due to the computational cost of the underlying physics-based model. Even for a low number of CVs and QoIs, uncertainty quantification (UQ) for, e.g., accelerating the simulation-aided design of multiscale systems and data-centric engineering tasks more generally ([2]), requires a large number of repeated observations of QoIs to achieve a high degree of confidence in such an analysis. The sampling cost is further exacerbated in real-world applications where distributions on QoIs are typically non-Gaussian, skewed, and/or mutually correlated, and therefore need to be characterized by their full probability density function (PDF) rather than through summary statistics such as mean and variance. The computational cost of nonparametric methods to estimate these densities can become prohibitively high when using a fully-featured physics-based model to compute each sample. One approach to alleviate the computational burden is to derive a cheaper-to-compute surrogate for the physicsbased model's response enabling much faster generation of output data and thus overcoming computational bottlenecks.

deep learning, ginn, upstream oil & gas, (19 more...)

arXiv.org Machine Learning

2006.14807

Country:

North America > United States > Massachusetts > Hampshire County > Amherst (0.14)
North America > United States > California > Santa Clara County (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report (0.64)

Industry: Energy > Oil & Gas > Upstream (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Model-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.88)

Add feedback