AITopics | forall

Collaborating Authors

forall

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

0a2f65c9d2313b71005e600bd23393fe-Paper-Conference.pdf

Neural Information Processing SystemsFeb-18-2026, 21:54:43 GMT

Note f = , so havethat / f =1.

artificial intelligence, machine learning, rcsl, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > New York (0.05)
Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.95)

Add feedback

Near-OptimalGoal-Oriented Reinforcement LearninginNon-StationaryEnvironments

Neural Information Processing SystemsFeb-12-2026, 07:28:27 GMT

The different roles of c and P in this lower bound inspire us to design algorithms that estimate costs and transitions separately. Specifically, assuming the knowledge of c and P, we develop a simple but sub-optimal algorithm and another more involved minimax optimal algorithm (up to logarithmic terms). These algorithms combine the ideas of finite-horizon approximation [Chen et al., 2022a], special Bernstein-style bonuses of the MVP algorithm[Zhangetal.,2020],adaptiveconfidencewidening[WeiandLuo,2021],as well as some new techniques such as properly penalizing long-horizon policies. Finally,when c and P are unknown, we develop avariant ofthe MASTER algorithm [Weiand Luo,2021]and integrate the aforementioned ideas into itto achieve O(min{B?S

algorithm, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

The Option Keyboard: Combining Skills in Reinforcement Learning

Andre Barreto, Diana Borsa, Shaobo Hou, Gheorghe Comanici, Eser Aygün, Philippe Hamel, Daniel Toyama, Jonathan hunt, Shibl Mourad, David Silver, Doina Precup

Neural Information Processing SystemsFeb-11-2026, 17:28:02 GMT

Recently,Sutton[23]proposed anewview on action selection.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
North America > Barbados (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)

Add feedback

c74214a3877c4d8297ac96217d5189b7-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 02:53:27 GMT

However, the resulting methods often suffer from high computational complexity which has reduced their practical applicability. For example, in the case of multiclass logistic regression, the aggregating forecaster (Foster et al. (2018)) achievesaregret ofO(log(Bn))whereas Online Newton Step achieves O(eBlog(n))obtaining adouble exponential gaininB (aboundonthenormof comparativefunctions).

artificial intelligence, machine learning, regression, (18 more...)

Neural Information Processing Systems

Country:

Europe > France > Île-de-France > Paris > Paris (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)

Genre: Research Report > New Finding (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.36)

Add feedback

Finite-SampleAnalysisofOff-PolicyTD-Learningvia GeneralizedBellmanOperators

Neural Information Processing SystemsFeb-10-2026, 19:20:21 GMT

Itisknown that policyevaluation has the interpretation of solving ageneralized Bellman equation. Inthispaper,wederivefinite-sample bounds foranygeneral off-policy TD-like stochastic approximation algorithm that solves for the fixedpoint of this generalized Bellman operator.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.95)

Add feedback

UnderstandingGlobalFeatureContributionsWith AdditiveImportanceMeasures

Neural Information Processing SystemsFeb-10-2026, 07:33:32 GMT

Most recent research hasaddressed thisby focusing onlocal interpretability, which explains a model's individual predictions (e.g., the role of each feature in a patient's diagnosis) [25, 30, 34, 38]. Twospecial cases areS = andS = D, which respectively correspond to the mean prediction f (x ) = E[f(X)] and the full model predictionfD(x) = f(x).

artificial intelligence, machine learning, predictive power, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Washington > King County > Seattle (0.05)
North America > United States > Washington > King County > Redmond (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Industry: Health & Medicine (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.94)

Add feedback

Private Estimationwith Public Data

Neural Information Processing SystemsFeb-9-2026, 21:45:17 GMT

Lemma 2.2(zCDPPrivate Gaussian Mean Estimation).

artificial intelligence, gaussian, machine learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > New York County > New York City (0.05)
North America > United States > District of Columbia > Washington (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Germany (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

3e6260b81898beacda3d16db379ed329-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 08:16:41 GMT

Moreover,we set the initial distributionξ1 tobeuniformoverS. As mentioned in the discussion following Theorem 4.1, it holds thatDVA DFQI. These findings also shed light on the minimax optimality of the OPE problem. PH h=1kvhkΛ 1h, is tighter. Here taking maximum with1 is to deal with the situation wherebVhbVπh+1(,) is close to zero or negative, and the second1 is to account for the variance of the rewards.

artificial intelligence, log dh2k, proofoflemmah, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.48)

Add feedback

TheLoCARegret: AConsistentMetrictoEvaluate Model-BasedBehaviorinReinforcementLearning--SupplementaryMaterial -- ATabularExperiments

Neural Information Processing SystemsFeb-8-2026, 07:44:44 GMT

For all tabular experiments, we used -greedy exploration with = 0.1. Furthermore, during pretraining and training, we used a maximum episode-length of 100. For evaluation, we set = 0, and ran 10 evaluation episodes. We used a fixed step-sizeα for all tabular experiments. Therefore, there is stochasticity in the update target even in deterministic environments due to exploration of the behavior policy.

artificial intelligence, supplementarymaterial, thelocaregret, (6 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.05)

Technology: Information Technology > Artificial Intelligence (0.32)

Add feedback

VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks

Feng, Yu, Weir, Nathaniel, Bostrom, Kaj, Bayless, Sam, Cassel, Darion, Chaudhary, Sapana, Kiesl-Reiter, Benjamin, Rangwala, Huzefa

arXiv.org Artificial IntelligenceNov-7-2025

LLMs can perform multi-step reasoning through Chain-of-Thought (CoT), but they cannot reliably verify their own logic. Even when they reach correct answers, the underlying reasoning may be flawed, undermining trust in high-stakes scenarios. To mitigate this issue, we introduce VeriCoT, a neuro-symbolic method that extracts and verifies formal logical arguments from CoT reasoning. VeriCoT formalizes each CoT reasoning step into first-order logic and identifies premises that ground the argument in source context, commonsense knowledge, or prior reasoning steps. The symbolic representation enables automated solvers to verify logical validity while the NL premises allow humans and systems to identify ungrounded or fallacious reasoning steps. Experiments on the ProofWriter, LegalBench, and BioASQ datasets show VeriCoT effectively identifies flawed reasoning, and serves as a strong predictor of final answer correctness. We also leverage VeriCoT's verification signal for (1) inference-time self-reflection, (2) supervised fine-tuning (SFT) on VeriCoT-distilled datasets and (3) preference fine-tuning (PFT) with direct preference optimization (DPO) using verification-based pairwise rewards, further improving reasoning validity and accuracy.

artificial intelligence, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

2511.04662

Country: