AITopics

Country:

Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsFeb-7-2026, 19:24:57 GMT

Coupling-based InvertibleNeuralNetworks Are UniversalDiffeomorphismApproximators

However,their desirable characteristics such asanalytic invertibility come atthe cost of restricting the functional forms.

artificial intelligence, machine learning, universality, (15 more...)

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.06)
North America > United States > New York (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Lu, Jian, Huang, Xiaohuang

Approximation Rates of Shallow Neural Networks: Barron Spaces, Activation Functions and Optimality Analysis

arXiv.org Artificial IntelligenceOct-22-2025

This paper investigates the approximation properties of shallow neural networks with activation functions that are powers of exponential functions. It focuses on the dependence of the approximation rate on the dimension and the smoothness of the function being approximated within the Barron function space. We examine the approximation rates of ReLU$^{k}$ activation functions, proving that the optimal rate cannot be achieved under $\ell^{1}$-bounded coefficients or insufficient smoothness conditions. We also establish optimal approximation rates in various norms for functions in Barron spaces and Sobolev spaces, confirming the curse of dimensionality. Our results clarify the limits of shallow neural networks' approximation capabilities and offer insights into the selection of activation functions and network structures.

approximation rate, artificial intelligence, machine learning, (17 more...)

2510.18388

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Murari, Davide, Furuya, Takashi, Schönlieb, Carola-Bibiane

Approximation theory for 1-Lipschitz ResNets

arXiv.org Artificial IntelligenceOct-14-2025

1-Lipschitz neural networks are fundamental for generative modelling, inverse problems, and robust classifiers. In this paper, we focus on 1-Lipschitz residual networks (ResNets) based on explicit Euler steps of negative gradient flows and study their approximation capabilities. Leveraging the Restricted Stone-Weierstrass Theorem, we first show that these 1-Lipschitz ResNets are dense in the set of scalar 1-Lipschitz functions on any compact domain when width and depth are allowed to grow. We also show that these networks can exactly represent scalar piecewise affine 1-Lipschitz functions. We then prove a stronger statement: by inserting norm-constrained linear maps between the residual blocks, the same density holds when the hidden width is fixed. Because every layer obeys simple norm constraints, the resulting models can be trained with off-the-shelf optimisers. This paper provides the first universal approximation guarantees for 1-Lipschitz ResNets, laying a rigorous foundation for their practical use.

artificial intelligence, constraint, machine learning, (18 more...)

2505.12003

Genre: Research Report > New Finding (0.46)

Industry: Government (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Ceylan, Mihriban, Prömel, David J.

Distributionally robust approximation property of neural networks

arXiv.org Machine LearningOct-13-2025

The universal approximation property uniformly with respect to weakly compact families of measures is established for several classes of neural networks. To that end, we prove that these neural networks are dense in Orlicz spaces, thereby extending classical universal approximation theorems even beyond the traditional $L^p$-setting. The covered classes of neural networks include widely used architectures like feedforward neural networks with non-polynomial activation functions, deep narrow networks with ReLU activation functions and functional input neural networks.

artificial intelligence, machine learning, neural network, (14 more...)

arXiv.org Machine Learning

2510.09177

Country:

Europe > Germany (0.04)
North America > United States > New Jersey > Hudson County > Hoboken (0.04)
Europe > Switzerland > Basel-City > Basel (0.04)
(4 more...)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Alberto Bietti, Julien Mairal

On the Inductive Bias of Neural Tangent Kernels

Neural Information Processing SystemsAug-20-2025, 01:42:25 GMT

Neural Information Processing Systems http://nips.cc/

kernel, neural information processing system, neural network, (13 more...)

Country:

Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsAug-20-2025, 01:42:10 GMT

c4ef9c39b300931b69a36fb3dbb8d60e-AuthorFeedback.pdf

data distribution, reviewer, stability, (15 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Pradhan, Satyaranjan, Soren, Madan Mohan

Convergence Analysis of Max-Min Exponential Neural Network Operators in Orlicz Space

arXiv.org Artificial IntelligenceAug-15-2025

In this current work, we propose a Max-Min approach for approximating functions using exponential neural network operators. We extend this framework to develop the Max-Min Kantorovich-type exponential neural network operators and investigate their approximation properties. We study both pointwise and uniform convergence for univariate functions. To analyze the order of convergence, we use the logarithmic modulus of continuity and estimate the corresponding rate of convergence. Furthermore, we examine the convergence behavior of the Max-Min Kantorovich-type exponential neural network operators within the Orlicz space setting. We provide some graphical representations to illustrate the approximation error of the function through suitable kernel and sigmoidal activation functions.

artificial intelligence, machine learning, operator, (14 more...)

2508.10248

Country: Asia (0.28)

Genre: Research Report (0.50)

Industry:

Telecommunications > Networks (1.00)
Information Technology > Networks (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Chemnitz, Dennis, Engel, Maximilian, Kuehn, Christian, Kuntz, Sara-Viola

A Dynamical Systems Perspective on the Analysis of Neural Networks

arXiv.org Artificial IntelligenceJul-8-2025

In this chapter, we utilize dynamical systems to analyze several aspects of machine learning algorithms. As an expository contribution we demonstrate how to re-formulate a wide variety of challenges from deep neural networks, (stochastic) gradient descent, and related topics into dynamical statements. We also tackle three concrete challenges. First, we consider the process of information propagation through a neural network, i.e., we study the input-output map for different architectures. We explain the universal embedding property for augmented neural ODEs representing arbitrary functions of given regularity, the classification of multilayer perceptrons and neural ODEs in terms of suitable function classes, and the memory-dependence in neural delay equations. Second, we consider the training aspect of neural networks dynamically. We describe a dynamical systems perspective on gradient descent and study stability for overdetermined problems. We then extend this analysis to the overparameterized setting and describe the edge of stability phenomenon, also in the context of possible explanations for implicit bias. For stochastic gradient descent, we present stability results for the overparameterized setting via Lyapunov exponents of interpolation solutions. Third, we explain several results regarding mean-field limits of neural networks. We describe a result that extends existing techniques to heterogeneous neural networks involving graph limits via digraph measures. This shows how large classes of neural networks naturally fall within the framework of Kuramoto-type models on graphs and their large-graph limits. Finally, we point out that similar strategies to use dynamics to study explainable and reliable AI can also be applied to settings such as generative models or fundamental issues in gradient training methods, such as backpropagation or vanishing/exploding gradients.

artificial intelligence, machine learning, neural network, (17 more...)