AITopics | Mishra, Siddhartha

Plotting

Mishra, Siddhartha

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A universal approximation theorem for nonlinear resistive networks

Scellier, Benjamin, Mishra, Siddhartha

arXiv.org Artificial IntelligenceDec-22-2023

Resistor networks have recently had a surge of interest as substrates for energy-efficient self-learning machines. This work studies the computational capabilities of these resistor networks. We show that electrical networks composed of voltage sources, linear resistors, diodes and voltage-controlled voltage sources (VCVS) can implement any continuous functions. To prove it, we assume that the circuit elements are ideal and that the conductances of variable resistors and the amplification factors of the VCVS's can take arbitrary values -- arbitrarily small or arbitrarily large. The constructive nature of our proof could also inform the design of such self-learning electrical networks.

artificial intelligence, machine learning, voltage source, (17 more...)

arXiv.org Artificial Intelligence

2312.15063

Country: North America (0.14)

Genre: Research Report (0.40)

Industry: Energy > Power Industry (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Estimates on the generalization error of Physics Informed Neural Networks (PINNs) for approximating a class of inverse problems for PDEs

Mishra, Siddhartha, Molinaro, Roberto

arXiv.org Artificial IntelligenceDec-6-2023

Physics informed neural networks (PINNs) have recently been very successfully applied for efficiently approximating inverse problems for PDEs. We focus on a particular class of inverse problems, the so-called data assimilation or unique continuation problems, and prove rigorous estimates on the generalization error of PINNs approximating them. An abstract framework is presented and conditional stability estimates for the underlying inverse problem are employed to derive the estimate on the PINN generalization error, providing rigorous justification for the use of PINNs in this context. The abstract framework is illustrated with examples of four prototypical linear PDEs. Numerical experiments, validating the proposed theory, are also presented.

artificial intelligence, inverse problem, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2007.01138

Country: Europe > Switzerland (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Estimates on the generalization error of Physics Informed Neural Networks (PINNs) for approximating PDEs

Mishra, Siddhartha, Molinaro, Roberto

arXiv.org Artificial IntelligenceDec-6-2023

Physics informed neural networks (PINNs) have recently been widely used for robust and accurate approximation of PDEs. We provide rigorous upper bounds on the generalization error of PINNs approximating solutions of the forward problem for PDEs. An abstract formalism is introduced and stability properties of the underlying PDE are leveraged to derive an estimate for the generalization error in terms of the training error and number of training samples. This abstract framework is illustrated with several examples of nonlinear PDEs. Numerical experiments, validating the proposed theory, are also presented.

artificial intelligence, generalization error, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2006.16144

Country: Europe > Switzerland (0.14)

Genre: Research Report (0.82)

Industry: Energy > Oil & Gas > Upstream (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Convolutional Neural Operators for robust and accurate learning of PDEs

Raonić, Bogdan, Molinaro, Roberto, De Ryck, Tim, Rohner, Tobias, Bartolucci, Francesca, Alaifari, Rima, Mishra, Siddhartha, de Bézenac, Emmanuel

arXiv.org Artificial IntelligenceDec-1-2023

Although very successfully used in conventional machine learning, convolution based neural network architectures -- believed to be inconsistent in function space -- have been largely ignored in the context of learning solution operators of PDEs. Here, we present novel adaptations for convolutional neural networks to demonstrate that they are indeed able to process functions as inputs and outputs. The resulting architecture, termed as convolutional neural operators (CNOs), is designed specifically to preserve its underlying continuous nature, even when implemented in a discretized form on a computer. We prove a universality theorem to show that CNOs can approximate operators arising in PDEs to desired accuracy. CNOs are tested on a novel suite of benchmarks, encompassing a diverse set of PDEs with possibly multi-scale solutions and are observed to significantly outperform baselines, paving the way for an alternative framework for robust and accurate operator learning. Our code is publicly available at https://github.com/bogdanraonic3/ConvolutionalNeuralOperator

artificial intelligence, convolutional neural operator, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2302.01178

Country:

Europe > United Kingdom (0.14)
Europe > Sweden (0.14)
Europe > Germany (0.14)

Genre: Research Report > New Finding (0.45)

Industry: Energy > Oil & Gas > Upstream (0.45)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Representation Equivalent Neural Operators: a Framework for Alias-free Operator Learning

Bartolucci, Francesca, de Bézenac, Emmanuel, Raonić, Bogdan, Molinaro, Roberto, Mishra, Siddhartha, Alaifari, Rima

arXiv.org Artificial IntelligenceNov-2-2023

Recently, operator learning, or learning mappings between infinite-dimensional function spaces, has garnered significant attention, notably in relation to learning partial differential equations from data. Conceptually clear when outlined on paper, neural operators necessitate discretization in the transition to computer implementations. This step can compromise their integrity, often causing them to deviate from the underlying operators. This research offers a fresh take on neural operators with a framework Representation equivalent Neural Operators (ReNO) designed to address these issues. At its core is the concept of operator aliasing, which measures inconsistency between neural operators and their discrete representations. We explore this for widely-used operator learning techniques. Our findings detail how aliasing introduces errors when handling different discretizations and grids and loss of crucial continuous structures. More generally, this framework not only sheds light on existing challenges but, given its constructive and broad nature, also potentially offers tools for developing new neural operators.

artificial intelligence, machine learning, operator, (12 more...)

arXiv.org Artificial Intelligence

2305.19913

Country: Europe (0.28)

Genre: Research Report > New Finding (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

An operator preconditioning perspective on training in physics-informed machine learning

De Ryck, Tim, Bonnet, Florent, Mishra, Siddhartha, de Bézenac, Emmanuel

arXiv.org Artificial IntelligenceOct-9-2023

In this paper, we investigate the behavior of gradient descent algorithms in physics-informed machine learning methods like PINNs, which minimize residuals connected to partial differential equations (PDEs). Our key result is that the difficulty in training these models is closely related to the conditioning of a specific differential operator. This operator, in turn, is associated to the Hermitian square of the differential operator of the underlying PDE. If this operator is ill-conditioned, it results in slow or infeasible training. Therefore, preconditioning this operator is crucial. We employ both rigorous mathematical analysis and empirical evaluations to investigate various strategies, explaining how they better condition this critical operator, and consequently improve training.

artificial intelligence, condition number, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2310.05801

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > California (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Model-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Structured Matrix Method for Nonequispaced Neural Operators

Lingsch, Levi, Michelis, Mike, de Bezenac, Emmanuel, Perera, Sirani M., Katzschmann, Robert K., Mishra, Siddhartha

arXiv.org Artificial IntelligenceOct-6-2023

The computational efficiency of many neural operators, widely used for learning solutions of PDEs, relies on the fast Fourier transform (FFT) for performing spectral computations. However, as FFT is limited to equispaced (rectangular) grids, this limits the efficiency of such neural operators when applied to problems where the input and output functions need to be processed on general non-equispaced point distributions. We address this issue by proposing a novel method that leverages batch matrix multiplications to efficiently construct Vandermonde-structured matrices and compute forward and inverse transforms, on arbitrarily distributed points. An efficient implementation of such structured matrix methods is coupled with existing neural operator models to allow the processing of data on arbitrary non-equispaced distributions of points. With extensive empirical evaluation, we demonstrate that the proposed method allows one to extend neural operators to very general point distributions with significant gains in training speed over baselines, while retaining or improving accuracy.

artificial intelligence, data quality, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2305.19663

Country:

North America > United States (0.46)
Europe > Switzerland > Zürich > Zürich (0.15)

Genre: Research Report (0.84)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Data Science > Data Quality > Data Transformation (0.71)

Add feedback

How does over-squashing affect the power of GNNs?

Di Giovanni, Francesco, Rusch, T. Konstantin, Bronstein, Michael M., Deac, Andreea, Lackenby, Marc, Mishra, Siddhartha, Veličković, Petar

arXiv.org Artificial IntelligenceAug-16-2023

Graph Neural Networks (GNNs) are the state-of-the-art model for machine learning on graph-structured data. The most popular class of GNNs operate by exchanging information between adjacent nodes, and are known as Message Passing Neural Networks (MPNNs). Given their widespread use, understanding the expressive power of MPNNs is a key question. However, existing results typically consider settings with uninformative node features. In this paper, we provide a rigorous analysis to determine which function classes of node features can be learned by an MPNN of a given capacity. We do so by measuring the level of pairwise interactions between nodes that MPNNs allow for. This measure provides a novel quantitative characterization of the so-called over-squashing effect, which is observed to occur when a large volume of messages is aggregated into fixed-size vectors. Using our measure, we prove that, to guarantee sufficient communication between pairs of nodes, the capacity of the MPNN must be large enough, depending on properties of the input graph structure, such as commute times. For many relevant scenarios, our analysis results in impossibility statements in practice, showing that over-squashing hinders the expressive power of MPNNs. We validate our theoretical findings through extensive controlled experiments and ablation studies.

artificial intelligence, machine learning, mpnn, (18 more...)

arXiv.org Artificial Intelligence

2306.03589

Country: Europe > United Kingdom > England (0.14)

Genre: Research Report > Promising Solution (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Neural Inverse Operators for Solving PDE Inverse Problems

Molinaro, Roberto, Yang, Yunan, Engquist, Björn, Mishra, Siddhartha

arXiv.org Artificial IntelligenceJun-3-2023

A large class of inverse problems for PDEs are only well-defined as mappings from operators to functions. Existing operator learning frameworks map functions to functions and need to be modified to learn inverse maps from data. We propose a novel architecture termed Neural Inverse Operators (NIOs) to solve these PDE inverse problems. Motivated by the underlying mathematical structure, NIO is based on a suitable composition of DeepONets and FNOs to approximate mappings from operators to functions. A variety of experiments are presented to demonstrate that NIOs significantly outperform baselines and solve PDE inverse problems robustly, accurately and are several orders of magnitude faster than existing direct and PDE-constrained optimization methods.

artificial intelligence, coefficient, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2301.11167

Country: North America > United States (0.14)

Genre: Research Report > New Finding (0.45)

Industry: Energy > Oil & Gas > Upstream (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Data Science (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Neural Oscillators are Universal

Lanthaler, Samuel, Rusch, T. Konstantin, Mishra, Siddhartha

arXiv.org Artificial IntelligenceMay-15-2023

Coupled oscillators are being increasingly used as the basis of machine learning (ML) architectures, for instance in sequence modeling, graph representation learning and in physical neural networks that are used in analog ML devices. We introduce an abstract class of neural oscillators that encompasses these architectures and prove that neural oscillators are universal, i.e, they can approximate any continuous and casual operator mapping between time-varying functions, to desired accuracy. This universality result provides theoretical justification for the use of oscillator based ML systems. The proof builds on a fundamental result of independent interest, which shows that a combination of forced harmonic oscillators with a nonlinear read-out suffices to approximate the underlying operators.

artificial intelligence, machine learning, oscillator, (16 more...)

arXiv.org Artificial Intelligence

2305.08753

Country: North America > United States (0.28)

Genre: Research Report (0.50)

Industry: Health & Medicine > Therapeutic Area (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback