AITopics | krishnamurthy

Collaborating Authors

krishnamurthy

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

ExplicitExplore-ExploitAlgorithmsinContinuous StateSpaces

Neural Information Processing SystemsFeb-11-2026, 08:28:46 GMT

We then give a practical approximation using neural networks anddemonstrate itsperformance andsampleefficiencyinpractice.

artificial intelligence, machine learning, reinforcement learning, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia > Arlington County > Arlington (0.05)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > New York > New York County > New York City (0.04)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)

Add feedback

ProvablyEfficientExplorationforReinforcement LearningUsingUnsupervisedLearning

Neural Information Processing SystemsFeb-11-2026, 06:55:18 GMT

Insomework,functionapproximation scheme is adopted such that essential quantities for policy improvement, e.g.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

Asia > Afghanistan > Parwan Province > Charikar (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.72)

Add feedback

LowerBound

Neural Information Processing SystemsFeb-11-2026, 05:41:36 GMT

Then, we consider sufficient assumptions under which learning good policies requires polynomial number of episodes.

algorithm, artificial intelligence, machine learning, (15 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

PlanningwithGeneralObjectiveFunctions: GoingBeyondTotalRewards

Neural Information Processing SystemsFeb-9-2026, 16:55:25 GMT

Note that inthis simple example, the state transition functionT and the reward functionr stillsatisfy theMarkovproperty.

artificial intelligence, machine learning, reward value, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Asia > Middle East > Jordan (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Planningwith General Objective Functions: Going Beyond Total Rewards

Neural Information Processing SystemsFeb-9-2026, 16:55:18 GMT

O((|S ||A|+ T) H ( log ( 1/")/")). ItisalsoeasyV ( , )andQ ( , , )obtained algorithm.

artificial intelligence, machine learning, neural information processing system, (11 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.05)
Asia > Middle East > Jordan (0.05)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.32)

Add feedback

11f9e78e4899a78dedd439fc583b6693-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 13:34:30 GMT

There, areward function isdrawn from one of multiple possible reward models atthebeginning ofeveryepisode, buttheidentity ofthechosen rewardmodel is not revealed to the agent. Hence, the latent state space, for which the dynamics are Markovian, is not given to the agent. We study the problem of learning a near optimal policy for two reward-mixing MDPs. Unlike existing approaches that rely on strong assumptions on the dynamics, we make no assumptions and study the problem in full generality.

artificial intelligence, arxivpreprintarxiv, machine learning, (18 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Improved RegretAnalysisforVariance-Adaptive LinearBanditsandHorizon-FreeLinearMixture MDPs

Neural Information Processing SystemsFeb-7-2026, 07:56:30 GMT

In online learning problems, exploiting low variance plays an important role in obtaining tight performance guarantees yet ischallenging because variances are often not known a priori. Recently, considerable progress has been made by Zhangetal.

artificial intelligence, machine learning, variance, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Arizona (0.04)
Asia > Middle East > Jordan (0.04)

Industry: Education (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

analysis and our analysis of FRANCIS remains unchanged, we wish to note that in our own internal re-review we

Neural Information Processing SystemsAug-15-2025, 00:27:00 GMT

We thank the reviewers for their thoughtful reviews; below we address their main concerns. This allows us to express the misspecification error (e.g., eqn 37 in appendix) directly in every (null 1) Note that the results from Chi et al. We consider this work as a first step in this direction. Is a good representation sufficient for sample efficient reinforcement learning?

algorithm, assumption, international conference, (12 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

Efficient First-Order Contextual Bandits: Prediction, Allocation, and Triangular Discrimination

Neural Information Processing SystemsMay-27-2025, 01:08:11 GMT

A recurring theme in statistical learning, online learning, and beyond is that faster convergence rates are possible for problems with low noise, often quantified by the performance of the best hypothesis; such results are known as first-order or small-loss guarantees. While first-order guarantees are relatively well understood in statistical and online learning, adapting to low noise in contextual bandits (and more broadly, decision making) presents major algorithmic challenges. In a COLT 2017 open problem, Agarwal, Krishnamurthy, Langford, Luo, and Schapire asked whether first-order guarantees are even possible for contextual bandits and---if so---whether they can be attained by efficient algorithms. We give a resolution to this question by providing an optimal and efficient reduction from contextual bandits to online regression with the logarithmic (or, cross-entropy) loss. Our algorithm is simple and practical, readily accommodates rich function classes, and requires no distributional assumptions beyond realizability.

artificial intelligence, efficient first-order contextual bandit, machine learning, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.45)

Add feedback