AITopics | pushforward

Collaborating Authors

pushforward

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Adaptive Nonlinear Data Assimilation through P-Spline Triangular Measure Transport

Lunde, Berent Å. S., Ramgraber, Maximilian

arXiv.org Machine LearningMar-20-2026

Non-Gaussian statistics are a challenge for data assimilation. Linear methods oversimplify the problem, yet fully nonlinear methods are often too expensive to use in practice. The best solution usually lies between these extremes. Triangular measure transport offers a flexible framework for nonlinear data assimilation. Its success, however, depends on how the map is parametrized. Too much flexibility leads to overfitting; too little misses important structure. To address this balance, we develop an adaptation algorithm that selects a parsimonious parametrization automatically. Our method uses P-spline basis functions and an information criterion as a continuous measure of model complexity. This formulation enables gradient descent and allows efficient, fine-scale adaptation in high-dimensional settings. The resulting algorithm requires no hyperparameter tuning. It adjusts the transport map to the appropriate level of complexity based on the system statistics and ensemble size. We demonstrate its performance in nonlinear, non-Gaussian problems, including a high-dimensional distributed groundwater model.

artificial intelligence, machine learning, monk, (18 more...)

arXiv.org Machine Learning

2603.19058

Country:

Europe > Norway > Western Norway > Vestland > Bergen (0.04)
Europe > Netherlands > South Holland > Delft (0.04)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.34)

Add feedback

Stochastic Taylor Derivative Estimator: Efficient amortization for arbitrary differential operators

Neural Information Processing SystemsFeb-18-2026, 09:38:00 GMT

Our code is available at https://github.com/sail-sg/stde

artificial intelligence, machine learning, operator, (20 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.04)
North America > United States > California > San Diego County > San Diego (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry:

Information Technology (0.46)
Government (0.46)
Banking & Finance (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

e463a2a3daee91a6dd0e0c41cd2f9f7a-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 11:38:30 GMT

algorithm, definition 8, lemma 4, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.40)

Add feedback

Stochastic Taylor Derivative Estimator: Efficient amortization for arbitrary differential operators

Neural Information Processing SystemsOct-10-2025, 18:53:01 GMT

Our code is available at https://github.com/sail-sg/stde

equation, operator, pushforward, (17 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.04)
North America > United States > California > San Diego County > San Diego (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry:

Information Technology (0.46)
Government (0.46)
Banking & Finance (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

e463a2a3daee91a6dd0e0c41cd2f9f7a-Supplemental-Conference.pdf

Neural Information Processing SystemsAug-19-2025, 13:56:59 GMT

artificial intelligence, lemma 4, machine learning, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Learning (Very) Simple Generative Models Is Hard

Neural Information Processing SystemsAug-19-2025, 13:56:55 GMT

More formally, we consider the following problem.

artificial intelligence, lemma 4, machine learning, (15 more...)

Neural Information Processing Systems

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.99)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

Geometric Gaussian Approximations of Probability Distributions

Da Costa, Nathaël, Mucsányi, Bálint, Hennig, Philipp

arXiv.org Artificial IntelligenceJul-2-2025

Approximating complex probability distributions, such as Bayesian posterior distributions, is of central interest in many applications. We study the expressivity of geometric Gaussian approximations. These consist of approximations by Gaussian pushforwards through diffeomorphisms or Riemannian exponential maps. We first review these two different kinds of geometric Gaussian approximations. Then we explore their relationship to one another. We further provide a constructive proof that such geometric Gaussian approximations are universal, in that they can capture any probability distribution. Finally, we discuss whether, given a family of probability distributions, a common diffeomorphism can be found to obtain uniformly high-quality geometric Gaussian approximations for that family.

approximation, artificial intelligence, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2507.00616

Country: Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)

Genre: Overview (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.96)

Add feedback

Rethinking Approximate Gaussian Inference in Classification

Mucsányi, Bálint, Da Costa, Nathaël, Hennig, Philipp

arXiv.org Machine LearningFeb-5-2025

In classification tasks, softmax functions are ubiquitously used as output activations to produce predictive probabilities. Such outputs only capture aleatoric uncertainty. To capture epistemic uncertainty, approximate Gaussian inference methods have been proposed, which output Gaussian distributions over the logit space. Predictives are then obtained as the expectations of the Gaussian distributions pushed forward through the softmax. However, such softmax Gaussian integrals cannot be solved analytically, and Monte Carlo (MC) approximations can be costly and noisy. We propose a simple change in the learning objective which allows the exact computation of predictives and enjoys improved training dynamics, with no runtime or memory overhead. This framework is compatible with a family of output activation functions that includes the softmax, as well as element-wise normCDF and sigmoid. Moreover, it allows for approximating the Gaussian pushforwards with Dirichlet distributions by analytic moment matching. We evaluate our approach combined with several approximate Gaussian inference methods (Laplace, HET, SNGP) on large- and small-scale datasets (ImageNet, CIFAR-10), demonstrating improved uncertainty quantification capabilities compared to softmax MC sampling. Code is available at https://github.com/bmucsanyi/probit.

artificial intelligence, equation, machine learning, (15 more...)

arXiv.org Machine Learning

2502.03366

Country:

Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)
North America > Canada > Ontario > Toronto (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

Stochastic Taylor Derivative Estimator: Efficient amortization for arbitrary differential operators

Shi, Zekun, Hu, Zheyuan, Lin, Min, Kawaguchi, Kenji

arXiv.org Artificial IntelligenceJan-12-2025

Optimizing neural networks with loss that contain high-dimensional and high-order differential operators is expensive to evaluate with back-propagation due to $\mathcal{O}(d^{k})$ scaling of the derivative tensor size and the $\mathcal{O}(2^{k-1}L)$ scaling in the computation graph, where $d$ is the dimension of the domain, $L$ is the number of ops in the forward computation graph, and $k$ is the derivative order. In previous works, the polynomial scaling in $d$ was addressed by amortizing the computation over the optimization process via randomization. Separately, the exponential scaling in $k$ for univariate functions ($d=1$) was addressed with high-order auto-differentiation (AD). In this work, we show how to efficiently perform arbitrary contraction of the derivative tensor of arbitrary order for multivariate functions, by properly constructing the input tangents to univariate high-order AD, which can be used to efficiently randomize any differential operator. When applied to Physics-Informed Neural Networks (PINNs), our method provides >1000$\times$ speed-up and >30$\times$ memory reduction over randomization with first-order AD, and we can now solve \emph{1-million-dimensional PDEs in 8 minutes on a single NVIDIA A100 GPU}. This work opens the possibility of using high-order differential operators in large-scale problems.

artificial intelligence, machine learning, operator, (20 more...)

arXiv.org Artificial Intelligence

2412.00088

Country: North America > United States (0.46)

Genre: Research Report (0.50)

Industry: Information Technology (0.34)

Technology: