AITopics | forexample

Collaborating Authors

forexample

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Decision-Value Attribution in Predict-then-Optimize Systems

Ziliaskopoulos, Konstantinos, Vinel, Alexander, Smith, Alice E.

arXiv.org Machine LearningJun-30-2026

Predictive models are increasingly embedded in operational decision-making, yet standard explanation methods typically explain forecasts rather than the decisions those forecasts induce. This distinction is important in predict-then-optimize systems: large forecast changes may leave the optimizer's action unchanged, while small changes can alter the selected decision and its realized value. We propose Decision Value Attribution (DVA), a Shapley-based framework for attributing the value of a fixed prediction--optimization pipeline. The framework defines cooperative games whose payoff is the downstream decision value, allowing the players to be information sources, optimization or design parameters, or both. We present three variants: InfoDVA attributes value to features, DesignDVA attributes value to operational configurations, and Decision-Value Interactions (DVI) quantifies how information and design jointly create value. We further distinguish post-DVA, which evaluates decisions using realized outcomes, from pre-DVA, which evaluates decisions under the model's full prediction. This separation turns attribution into a decision-level diagnostic of whether the model's operational beliefs align with realized performance. The resulting attributions are expressed in the units of the operational objective and decompose the gain or loss relative to a baseline. Case studies in electricity storage arbitrage and emergency medical service coverage show that predictive explanations can be poor proxies for operational value, that DVA can guide targeted information-control interventions, and that optimization configurations determine when predictive information is decision-relevant.

artificial intelligence, modeling & simulation, total interaction, (16 more...)

arXiv.org Machine Learning

2606.29878

Country: North America > United States > Alabama (0.28)

Genre: Research Report (0.40)

Industry: Energy > Energy Storage (0.34)

Technology:

Information Technology > Artificial Intelligence (0.69)
Information Technology > Modeling & Simulation (0.54)
Information Technology > Data Science (0.54)
Information Technology > Information Management (0.54)

Add feedback

NoLimits.jl: Flexible and Composable Nonlinear Mixed-Effects Modeling in Julia

Huth, Manuel, Arruda, Jonas, Schmid, Nina, Gusinow, Roy, Wieland, Vincent, Peiter, Clemens, Hasenauer, Jan

arXiv.org Machine LearningJun-24-2026

Nonlinear mixed-effects models are widely used to analyze longitudinal data, but existing open-source software often supports only a limited subset of the model structures, inference methods, machine-learning components, automatic differentiation techniques, and random-effects distributions required in modern applications. We introduce NoLimits.jl, an open-source Julia package for flexible and composable nonlinear mixed-effects modeling. Its macro-based modeling language enables observation and latent-state models to be constructed from diverse building blocks, including ordinary differential equations, Markov models, and neural networks. NoLimits.jl supports flexible, covariate-dependent observation and random-effects distributions and provides a unified interface to frequentist inference through Laplace approximation, stochastic expectation maximization, and Bayesian Markov chain Monte Carlo methods. We demonstrate the package on three case studies showcasing its workflows, integration of differentiable machine-learning components, and data-driven estimation of random-effects distributions using normalizing flows. Together, these capabilities substantially expand the range of nonlinear mixed-effects models that can be specified, estimated, and compared within a single open-source framework.

artificial intelligence, machine learning, realnumber, (18 more...)

arXiv.org Machine Learning

2606.24427

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.86)

Add feedback

Ambient Diffusion Guided Recovery for Corruption Robust Reinforcement Learning

Neural Information Processing SystemsJun-23-2026, 01:58:05 GMT

Real-world datasets collected from sensors or human inputs are prone to noise and errors, posing significant challenges for applying offline reinforcement learning (RL). While existing methods have made progress in addressing corrupted actions and rewards, they remain insufficient for handling corruption in high-dimensional state spaces and for cases where multiple elements in the dataset are corrupted simultaneously. Diffusion models, known for their strong denoising capabilities, offer a promising direction for this problem--but their tendency to overfit noisy samples limits their direct applicability. To overcome this, we propose Ambient Diffusion-Guided Dataset Recovery (ADG), a novel approach that pioneers the use of diffusion models to tackle data corruption in offline RL. First, we introduce Ambient Denoising Diffusion Probabilistic Models (DDPM) from approximated distributions, which enable learning on partially corrupted datasets with theoretical guarantees.

justification, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Genre: Research Report (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)

Add feedback

Taught Well Learned Ill Towards Distillation conditional Backdoor Attack

Neural Information Processing SystemsJun-23-2026, 00:36:57 GMT

Knowledge distillation (KD) is a vital technique for deploying deep neural networks (DNNs) on resource-constrained devices by transferring knowledge from large teacher models to lightweight student models. While teacher models from third-party platforms may undergo security verification (e.g., backdoor detection), we uncover a novel and critical threat: distillation-conditional backdoor attacks (DCBAs). DCBA injects dormant and undetectable backdoors into teacher models, which become activated in student models via the KD process, even with clean distillation datasets. While the direct extension of existing methods is ineffective for DCBA, we implement this attack by formulating it as a bilevel optimization problem and proposing a simple yet effective method (i.e., SCAR). Specifically, the inner optimization simulates the KD process by optimizing a surrogate student model, while the outer optimization leverages outputs from this surrogate to optimize the teacher model for implanting the conditional backdoor.

artificial intelligence, machine learning, mobilenet-v2, (19 more...)

Neural Information Processing Systems

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Towards Reasoning Centric Benchmark for Aerial Anomaly Understanding

Neural Information Processing SystemsJun-22-2026, 23:17:29 GMT

While unmanned aerial vehicles (UAVs) offer wide-area, high-altitude coverage for anomaly detection, they face challenges such as dynamic viewpoints, scale variations, and complex scenes. Existing datasets and methods, mainly designed for fixed ground-level views, struggle to adapt to these conditions, leading to significant performance drops in drone-view scenarios. To bridge this gap, we introduce A2Seek (Aerial Anomaly Seek), a large-scale, reasoning-centric benchmark dataset for aerial anomaly understanding. This dataset covers various scenarios and environmental conditions, providing high-resolution real-world aerial videos with detailed annotations, including anomaly categories, frame-level timestamps, region-level bounding boxes, and natural language explanations for causal reasoning. Building on this dataset, we propose A2Seek-R1, a novel reasoning framework that generalizes R1-style strategies to aerial anomaly understanding, enabling a deeper understanding of "Where" anomalies occur and "Why" they happen in aerial frames.

artificial intelligence, arxivpreprintarxiv, natural language, (16 more...)

Neural Information Processing Systems

Country: Asia > China (0.28)

Industry: Information Technology > Robotics & Automation (0.68)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.66)
Information Technology > Artificial Intelligence > Natural Language (0.66)

Add feedback

Bounds on the computational complexity of neurons due to dendritic morphology

Neural Information Processing SystemsJun-22-2026, 22:32:16 GMT

The simple linear threshold units used in many artificial neural networks have a limited computational capacity. Famously, a single unit cannot handle nonlinearly separable problems like XOR. In contrast, real neurons exhibit complex morphologies as well as active dendritic integration, suggesting that their computational capacities outperform those of simple linear units. Considering specific families of Boolean functions, we empirically examine the computational limits of single units that incorporate more complex dendritic structures. For random Boolean functions, we show that there is a phase transition in learnability as a function of the input dimension, with most random functions below a certain critical dimension being learnable and those above not.

artificial intelligence, justification, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States > Washington > King County > Seattle (0.15)

Industry: Health & Medicine > Therapeutic Area (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Linearly Constrained Diffusion Implicit Models

Neural Information Processing SystemsJun-22-2026, 22:28:08 GMT

We introduce Linearly Constrained Diffusion Implicit Models (CDIM), a fast and accurate approach to solving noisy linear inverse problems using diffusion models. Traditional diffusion-based inverse methods rely on numerous projection steps to enforce measurement consistency in addition to unconditional denoising steps. CDIM achieves a 10-50 reduction in projection steps by dynamically adjusting the number and size of projection steps to align a residual measurement energy with its theoretical distribution under the forward diffusion process. This adaptive alignment preserves measurement consistency while substantially accelerating constrained inference. For noise-free linear inverse problems, CDIM exactly satisfies the measurement constraints with few projection steps, even when existing methods fail. We demonstrate CDIM's effectiveness across a range of applications, including super-resolution, denoising, inpainting, deblurring, and 3D point cloud reprojection.

artificial intelligence, justification, machine learning, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.48)

Add feedback

d7a2222b8d41014e060cfeb0995501d0-Paper-Conference.pdf

Neural Information Processing SystemsJun-22-2026, 22:12:10 GMT

How can we trust the correctness of a learned model on a particular input of interest? Model accuracy is typically measured on average over a distribution of inputs, giving no guarantee for any fixed input. This paper proposes a theoreticallyfounded solution to this problem: to train Self-Proving models that prove the correctness of their output to a verification algorithm V via an Interactive Proof. SelfProving models satisfy that, with high probability over an input sampled from a given distribution, the model generates a correct output and successfully proves its correctness to V. The soundness property of V guarantees that, for every input, no model can convince V of the correctness of an incorrect output. Thus, a Self-Proving model proves correctness of most of its outputs, while all incorrect outputs (of any model) are detected by V. We devise and analyze two generic methods for learning Self-Proving models: Transcript Learning (TL) which relies on access to transcripts of accepting interactions, and Reinforcement Learning from Verifier Feedback (RLVF) which trains a model by emulating interactions with the verifier.

artificial intelligence, machine learning, urlhttp, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.70)
Europe > Austria > Vienna (0.14)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Rethinking Temporal Pattern Learning in Deep Learning Models for Time Series

Neural Information Processing SystemsJun-22-2026, 21:57:32 GMT

Recent advances in deep learning have driven rapid progress in time series forecasting, yet many state-of-the-art models continue to struggle with robust performance in real-world applications, even when they achieve strong results on standard benchmark datasets. This persistent gap can be attributed to the blackbox nature of deep learning architectures and the inherent limitations of current evaluation frameworks, which frequently lack the capacity to provide clear, quantitative insights into the specific strengths and weaknesses of different models, thereby complicating the selection of appropriate models for particular forecasting scenarios. To address these issues, we propose a synthetic data-driven evaluation paradigm, SynTSBench, that systematically assesses fundamental modeling capabilities of time series forecasting models through programmable feature configuration. Our framework isolates confounding factors and establishes an interpretable evaluation system with three core analytical dimensions: (1) temporal feature decomposition and capability mapping, which enables systematic evaluation of model capacities to learn specific pattern types; (2) robustness analysis under data irregularities, which quantifies noise tolerance thresholds and anomaly recovery capabilities; and (3) theoretical optimum benchmarking, which establishes performance boundaries for each pattern type--enabling direct comparison between model predictions and mathematical optima. Our experiments show that current deep learning models do not universally approach optimal baselines across all types of temporal features.

artificial intelligence, justification, machine learning, (17 more...)

Neural Information Processing Systems

Genre: Research Report (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Filters

Collaborating Authors

forexample

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Decision-Value Attribution in Predict-then-Optimize Systems

NoLimits.jl: Flexible and Composable Nonlinear Mixed-Effects Modeling in Julia

3ff71ee3d8ec70e172be0b2d56e73481-Paper-Conference.pdf

Ambient Diffusion Guided Recovery for Corruption Robust Reinforcement Learning

Taught Well Learned Ill Towards Distillation conditional Backdoor Attack

Towards Reasoning Centric Benchmark for Aerial Anomaly Understanding

Bounds on the computational complexity of neurons due to dendritic morphology

Linearly Constrained Diffusion Implicit Models

d7a2222b8d41014e060cfeb0995501d0-Paper-Conference.pdf

Rethinking Temporal Pattern Learning in Deep Learning Models for Time Series