AITopics | assumption2

Collaborating Authors

assumption2

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A Direct Approach for Handling Contextual Bandits with Latent State Dynamics

Li, Zhen, Stoltz, Gilles

arXiv.org Machine LearningApr-10-2026

We revisit the finite-armed linear bandit model by Nelson et al. (2022), where contexts and rewards are governed by a finite hidden Markov chain. Nelson et al. (2022) approach this model by a reduction to linear contextual bandits; but to do so, they actually introduce a simplification in which rewards are linear functions of the posterior probabilities over the hidden states given the observed contexts, rather than functions of the hidden states themselves. Their analysis (but not their algorithm) also does not take into account the estimation of the HMM parameters, and only tackles expected, not high-probability, bounds, which suffer in addition from unnecessary complex dependencies on the model (like reward gaps). We instead study the more natural model incorporating direct dependencies in the hidden states (on top of dependencies on the observed contexts, as is natural for contextual bandits) and also obtain stronger, high-probability, regret bounds for a fully adaptive strategy that estimates HMM parameters online. These bounds do not depend on the reward functions and only depend on the model through the estimation of the HMM parameters.

artificial intelligence, machine learning, nelsonetal, (18 more...)

arXiv.org Machine Learning

2604.08149

Country: Europe > France > Île-de-France > Paris > Paris (0.04)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

Estimating Convergence of Markov chains with L-Lag Couplings

Niloy Biswas, Pierre E. Jacob, Paul Vanetti

Neural Information Processing SystemsFeb-13-2026, 14:37:10 GMT

Various theoretical results provideupper bounds onthedistance between the target and marginal distribution after a fixed number of iterations.

artificial intelligence, coupling, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Europe > Austria > Vienna (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.71)

Add feedback

OntheEffectiveNumberofLinearRegionsinShallow UnivariateReLUNetworks: ConvergenceGuarantees andImplicitBias

Neural Information Processing SystemsFeb-12-2026, 02:37:02 GMT

Howeverwhat is perhaps more surprising, is that in stark contrast to our classic understanding of generalization in machine learning models, this does not seem to degrade the generalization capabilities of the learned model in spite of the significant increase in its capacity.

artificial intelligence, implicit bias, machine learning, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

AdamCanConvergeWithoutAnyModificationOn UpdateRules

Neural Information Processing SystemsFeb-11-2026, 12:37:25 GMT

However, vanilla Adam remains exceptionally popular anditworkswellinpractice.

artificial intelligence, arxivpreprintarxiv, machine learning, (16 more...)

Neural Information Processing Systems

Country: Asia > China > Guangdong Province > Shenzhen (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

d3e6cd9f66f2c1d3840ade4161cf7406-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 08:00:44 GMT

Our bounds hold ininfinite-dimensional spaces, thereby showing that finer and finer discretizations do not make this learning problemharder.

artificial intelligence, inverse problem, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > Italy (0.05)
Europe > Finland > Uusimaa > Helsinki (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

When Games

Neural Information Processing SystemsFeb-11-2026, 03:59:05 GMT

artificial intelligence, machine learning, markovgame, (15 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England > Greater London > London (0.14)

Genre: Research Report (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.95)

Add feedback

ConstrainedStochasticNonconvexOptimizationwith State-dependentMarkovData

Neural Information Processing SystemsFeb-10-2026, 20:13:14 GMT

Inboth cases, weestablish that the number of calls to the stochastic first-order oracle to obtain an appropriately definedϵ-stationary point isoftheorderO(1/ϵ2.5).

artificial intelligence, citedonpage4, machine learning, (19 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

OnReward-FreeReinforcementLearningwith LinearFunctionApproximation

Neural Information Processing SystemsFeb-10-2026, 11:12:45 GMT

During the exploration phase, an agent collects samples without using a pre-specified reward function. After the exploration phase, a reward function is given, and the agent uses samples collected during the exploration phase to computeanear-optimalpolicy.

artificial intelligence, exploration phase, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Europe > United Kingdom (0.04)
North America > United States > Massachusetts > Middlesex County > Belmont (0.04)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

EscapingSaddle-PointFasterunder Interpolation-likeConditions

Neural Information Processing SystemsFeb-9-2026, 09:04:22 GMT

One of the fundamental aspects of over-parametrized models is that they are capable of interpolating the training data. We show that, under interpolation-like assumptions satisfied by the stochastic gradients in an overparametrization setting, thefirst-order oracle complexityofPerturbed Stochastic Gradient Descent (PSGD) algorithm toreach an -local-minimizer,matches the corresponding deterministic rateof O(1/2).

artificial intelligence, lemmaa, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: