AITopics | dtr

Collaborating Authors

dtr

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Jacobian-Velocity Bounds for Deployment Risk Under Covariate Drift

Landers, Jonathan R.

arXiv.org Machine LearningMay-7-2026

We study long-horizon deployment of a frozen predictor under dynamic covariate shift. A time-domain Poincaré inequality reduces temporal risk volatility to derivative energy, and a Jacobian-velocity theorem identifies directional tangent energy along the deployment path as the governing quantity under explicit along-path regularity and domination assumptions. Under low-rank drift, that quantity reduces to directional Jacobian energy in the drift subspace, motivating drift-aligned tangent regularization (DTR) and a matched monitoring proxy. Rather than smoothing the network isotropically, DTR penalizes sensitivity only along estimated drift directions. We validate the theorem-to-method pipeline in four experiments: a synthetic benchmark for the time-domain inequality, a controlled synthetic comparison against isotropic Jacobian regularization, and two frozen-deployment studies on the UCI Air Quality and Tetouan power-consumption datasets. DTR reduces risk volatility and directional gain in the controlled low-rank regime, beats isotropic smoothing there, and gives validation-selected deployment gains on both real datasets when the Air Quality drift subspace is estimated from target-orthogonal sensor motion. Moderate drift-subspace misspecification is tolerable while orthogonal misspecification largely removes the benefit.

artificial intelligence, machine learning, volatility, (16 more...)

arXiv.org Machine Learning

2605.04932

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Dataset Distillation Efficiently Encodes Low-Dimensional Representations from Gradient-Based Learning of Non-Linear Tasks

Kinoshita, Yuri, Nishikawa, Naoki, Toyoizumi, Taro

arXiv.org Machine LearningMar-31-2026

Dataset distillation, a training-aware data compression technique, has recently attracted increasing attention as an effective tool for mitigating costs of optimization and data storage. However, progress remains largely empirical. Mechanisms underlying the extraction of task-relevant information from the training process and the efficient encoding of such information into synthetic data points remain elusive. In this paper, we theoretically analyze practical algorithms of dataset distillation applied to the gradient-based training of two-layer neural networks with width $L$. By focusing on a non-linear task structure called multi-index model, we prove that the low-dimensional structure of the problem is efficiently encoded into the resulting distilled data. This dataset reproduces a model with high generalization ability for a required memory complexity of $\tildeΘ$$(r^2d+L)$, where $d$ and $r$ are the input and intrinsic dimensions of the task. To the best of our knowledge, this is one of the first theoretical works that include a specific task structure, leverage its intrinsic dimensionality to quantify the compression rate and study dataset distillation implemented solely via gradient-based algorithms.

artificial intelligence, machine learning, sd 1, (18 more...)

arXiv.org Machine Learning

2603.1483

Country: Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.66)

Add feedback

Probabilistic Model-Agnostic Meta-Learning

Chelsea Finn, Kelvin Xu, Sergey Levine

Neural Information Processing SystemsFeb-13-2026, 16:55:40 GMT

In this paper,we propose aprobabilisticmeta-learning algorithm that can sample models for a new task from a modeldistribution.

artificial intelligence, machine learning, xtesti, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts (0.04)
North America > Canada > Quebec > Montreal (0.04)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.71)

Add feedback

Meta-ticket: Findingoptimalsubnetworksfor few-shotlearningwithinrandomlyinitialized neuralnetworks

Neural Information Processing SystemsFeb-11-2026, 02:22:48 GMT

Thecorrespondingfm (x) where means element-wise Rn.

artificial intelligence, machine learning, maml, (19 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.04)
Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

whichimpliesthat: Pr(ˆq q 1 d(1/ n+ϵ)) e nϵ

Neural Information Processing SystemsFeb-9-2026, 19:30:06 GMT

To extend this and adapt other results to our setting, we could now apply the Simulation Lemma [1]to bound the value difference given the model error,or alternatively, develop the theory in the direction of[55]andrelated work. Code is available at https://github.com/spitis/mocoda Forexample, in2d Navigation,themaskfunction was implementedasfollows: def Mask2dNavigation(input_tensor): """ accepts B x num_sa_features, and returns B x num_parents x num_children """ # base local mask mask = torch.tensor( Theadvantageofthisapproach isthat we can easily do conditional sampling incase of overlapping parent sets. The CQL implementation uses SAC [17].

artificial intelligence, machine learning, tensor, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.95)

Add feedback

95c7dfc5538e1ce71301cf92a9a96bd0-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 10:14:15 GMT

For regression, we model output noise as a zero-mean Gaussian: N(0,σ2) where σ2 is the varianceofthenoise,treatedasahyperparameter. Neal[21] shows that in the regression setting, the isotropic Gaussian prior for a BNN with a single hidden layer approaches aGaussian process prior asthe number ofhidden units tends toinfinity,solong as the chosen activation function is bounded. We will use this prior in the baseline BNN for our experiments. In the context of BNNs, our Markov chain is a sequence ofrandomparametersW(1),W(2),... definedoverW,whichweconstruct bydefining thetransitionkernel. BBB is scalable and fast, and therefore can be applied to high-dimensional and large datasets in real-life applications.

artificial intelligence, dtr, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Suffolk County > Boston (0.05)
Asia > Middle East > Israel (0.05)

Industry: Health & Medicine (0.97)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.57)

Add feedback

IncorporatingInterpretableOutputConstraints inBayesianNeuralNetworks

Neural Information Processing SystemsFeb-9-2026, 10:14:08 GMT

The ability to encode informative functional beliefs in BNN priors can significantly reduce the bias and uncertainty of the posterior predictive, especially in regions of input space sparsely coveredbytraining data[27].

artificial intelligence, constraint, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > Switzerland (0.04)

Industry: Law (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.70)

Add feedback

2433fec2144ccf5fea1c9c5ebdbc3924-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 22:05:14 GMT

For each word, we use WordNet [7] to find its synonyms and build a list of word sets. Inaddition, toavoidreplacement clash, wedonotallowanyword to appear in more than word set. Eventually, top 50 semantically matching pairs are retained for CATER. Since the training data of the victim model is unknown to the malicious users, we randomly select 5M sentences from common crawl data as thebenigncorpus. Numbers in parentheses are resultsofcleandata.

lemmaa, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.35)
Information Technology > Artificial Intelligence > Machine Learning (0.35)

Add feedback

TheHighLine: ExactRiskandLearningRateCurves ofStochasticAdaptiveLearningRateAlgorithms

Neural Information Processing SystemsFeb-7-2026, 19:03:00 GMT

We then investigate in detail two adaptivelearning rates-anidealized exactlinesearch andAdaGrad-Norm -on the least squares problem.

artificial intelligence, def, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > New York (0.04)
North America > Canada > Quebec (0.04)

Genre: Research Report (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

SupplementaryMaterial: StructuredPredictionfor ConditionalMeta-Learning

Neural Information Processing SystemsFeb-7-2026, 16:25:41 GMT

This led tothe derivation ofamore general (and involved) characterization of the estimator ˆf. We recall that the distributionπ samples the two datasets according to the process described in Section 2, namely by first samplingρ a task-distribution (onX Y) from µ and then obtaining Dtr and Dval by independently sampling points(x,y) from ρ. Therforeπ = πµ can be seen as implicitlyinducedby µ. The loss4isoftheform(A.5)and admits derivatives ofany order,namely4 C (Z Y X). Assumption 2. Assume Θ Rd1 and D Rd2 compact sets satisfying the cone condition and assume that there exists a reproducing kernelk: D D R with associated RKHSF and s>(d1+2d2)/2suchthatthefunctiong:D HwithH=Ws,2(Θ D),characterizedby g (Dtr)= Z 4(,Dval|)dπ(Dval|Dtr) Dtr D, (A.7) is such that g H F and, for any D D, we have that the application of the operator T(g):F Htothefunctionk(D,) F issuchthatT(g)k(D,)=g (D). The functiong in (A.7) can be interpreted as capturing the interaction between4and the metadistributionπ.

artificial intelligence, dtr, machine learning, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback