AITopics | table 1

Collaborating Authors

table 1

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Fast Reconstruction of Exact Maxwell Dynamics from Sparse Data

DeGenaro, Dan, Li, Xin, Amo, Obed, Pokojovy, Michael, Bargal, Sarah Adel, Lange-Hegermann, Markus, Raiţă, Bogdan

arXiv.org Machine LearningMay-22-2026

We introduce FLASH-MAX, a shallow, exact-by-construction neural network architecture for predicting homogeneous electromagnetic fields from sparse pointwise observations. Each hidden neuron represents a separate exact solution to Maxwell's equations, so that the network satisfies the governing equations symbolically by construction and can be trained end-to-end from sparse data within seconds. We prove a universal approximation result showing that this exact model class remains universal on arbitrary domains. FLASH-MAX reaches sub-1% relative validation error from about 1K sparse pointwise observations in seconds, all while maintaining a zero PDE residual, and keeps single-digit errors even for only 100 observations sampled from 3D space. These results suggest that moving governing structure from the loss into the hypothesis class can dramatically improve the trade-off between precision and optimization speed in scientific machine learning.

artificial intelligence, flash-max, machine learning, (18 more...)

arXiv.org Machine Learning

2605.20514

Country: Europe (0.67)

Genre: Research Report > New Finding (0.86)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

On Uniform Error Bounds for Kernel Regression under Non-Gaussian Noise

Teutsch, Johannes, Molodchyk, Oleksii, Leibold, Marion, Faulwasser, Timm, Lederer, Armin

arXiv.org Machine LearningMay-12-2026

Providing non-conservative uncertainty quantification for function estimates derived from noisy observations remains a fundamental challenge in statistical machine learning, particularly for applications in safety-critical domains. In this work, we propose novel non-asymptotic probabilistic uniform error bounds for kernel-based regression. Compared to related bounds in the literature that are restricted to (conditionally) independent sub-Gaussian noise, our bounds allow to consider a broad class of non-Gaussian distributions, such as sub-Gaussian, bounded, sub-exponential, and variance/moment-bounded noise. Moreover, our results apply to correlated and uncorrelated noise. We compare our proposed error bounds with existing results in terms of the induced uncertainty region and their performance in safe control, demonstrating the tightness of the proposed bounds.

artificial intelligence, machine learning, theorem 3, (12 more...)

arXiv.org Machine Learning

2605.09757

Country: Europe > Germany (0.46)

Genre: Research Report (0.83)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Compositional Generalization from First Principles

Neural Information Processing SystemsMay-1-2026, 02:03:27 GMT

Leveraging the compositional nature of our world to expedite learning and facilitate generalization is a hallmark of human perception. In machine learning, on the other hand, achieving compositional generalization has proven to be an elusive goal, even for models with explicit compositional priors. To get a better handle on compositional generalization, we here approach it from the bottom up: Inspired by identifiable representation learning, we investigate compositionality as a property of the data-generating process rather than the data itself. This reformulation enables us to derive mild conditions on only the support of the training distribution and the model architecture, which are sufficient for compositional generalization. We further demonstrate how our theoretical framework applies to real-world scenarios and validate our findings empirically. Our results set the stage for a principled theoretical study of compositional generalization.

artificial intelligence, machine learning, object-oriented architecture, (19 more...)

Neural Information Processing Systems

Country: Europe > Germany (0.28)

Genre: Research Report > New Finding (0.86)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.34)

Add feedback

It consists of an image encoder with a Vision Transformer [17] architecture, a text encoder with a similar Transformer architecture, and heads that predict bounding boxes and label scores from provided images and text queries. Input(s) An image and a list of free-text object descriptions (queries).

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: Europe > Luxembourg (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.34)

Add feedback

e6c2e85db1f1039177c4495ccd399ac4-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-30-2026, 03:23:00 GMT

A.1 Preliminary Study2 The basic GPT-2 model1 is trained from scratch on each corpus, which has 12 transformer blocks3 and 12 attention heads with 768 hidden dimensions. The Huggingface transformers [4] and Pytorch4 toolkit [2] are used to train the GPT-2 model in the distributed manner on A100 GPU server. The5 hyper-parameters during training are shown in Table 1.6 Hyper-parameter Value Optimization steps 100K Test interval 10K Dropout rate 0.1 Grad clipping 1.0 Learning rate 5e 5 Batch size 128 Maximum sequence length 256 Warmup steps 10K Learning scheduler Linear decay Random seed 0 Number of GPUs 4 Learning objective Cross-Entropy Loss Table 1: The hyper-parameters during GPT-2 training procedure. Most of the hyper-parameters for our proposed method are the same as that in Table 1 for better8 variable controlling. The specific hyper-parameters for our proposed method are the length of9 repetitive n-gram and its repetition dropout rate p, which are set as 2 and 0.6, respectively.10

category, large language model, machine learning, (11 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.69)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.69)

Add feedback

e242660df1b69b74dcc7fde711f924ff-Supplemental.pdf

Neural Information Processing SystemsApr-27-2026, 12:00:50 GMT

artificial intelligence, machine learning, variant, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback

564127c03caab942e503ee6f810f54fd-Supplemental.pdf

Neural Information Processing SystemsApr-26-2026, 00:06:03 GMT

This paper solves three NP-hard routing problems, traveling salesman problem (TSP), prize collecting TSP (PCTSP), and capacitated vehicle routing problem (CVRP). This section provides detailed descriptions of PCTSP and CVRP (for TSP, see section 3). The PCTSP is similar to TSP, while there are differences in that we do not have to visit all the nodes and that the destination is not the first node but the depot node, i.e., a tour is not a cycle. Let N be the number of nodes. The problem instance of PCTSP is s = {(xi,λi,µi)}N+1i=1, where the xi R2 is in 2D euclidean coordinates, λi R is the penalty of unvisited node, and µi R is the prize of visited node. The L(π|s) is the tour length, and λ(π|s) is the total penalty of the unvisited nodes.

artificial intelligence, experiment, machine learning, (17 more...)

Neural Information Processing Systems

Industry: Transportation (0.76)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.51)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.48)

Add feedback

Filters

Collaborating Authors

table 1

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Fast Reconstruction of Exact Maxwell Dynamics from Sparse Data

On Uniform Error Bounds for Kernel Regression under Non-Gaussian Noise

Compositional Generalization from First Principles

0060ef47b12160b9198302ebdb144dcf-AuthorFeedback.pdf

00989c20ff1386dc386d8124ebcba1a5-AuthorFeedback.pdf

e6d58fc68c0f3c36ae6e0e64478a69c0-Supplemental-Conference.pdf

e6d58fc68c0f3c36ae6e0e64478a69c0-Paper-Conference.pdf

e6c2e85db1f1039177c4495ccd399ac4-Supplemental-Conference.pdf

e242660df1b69b74dcc7fde711f924ff-Supplemental.pdf

564127c03caab942e503ee6f810f54fd-Supplemental.pdf