AITopics | action value

Collaborating Authors

action value

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Integrated accounts of behavioral and neuroimaging data using flexible recurrent neural network models

Amir Dezfouli, Richard Morris, Fabio T. Ramos, Peter Dayan, Bernard Balleine

Neural Information Processing SystemsFeb-13-2026, 11:56:27 GMT

Neural Information Processing Systems http://nips.cc/

action value, brain region, probability, (14 more...)

Neural Information Processing Systems

Country:

Oceania > Australia (0.14)
Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)
North America > Canada > Quebec > Montreal (0.04)

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

bd31bfd4caa85bffe07a35568182cdfa-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 16:11:28 GMT

agent, coordination pattern, factorization, (14 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > China > Tianjin Province > Tianjin (0.04)
(2 more...)

Genre: Research Report > New Finding (0.88)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

7f6e51d8298aa01b084b700ab91aff94-Paper-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 05:26:22 GMT

correlation, epistemic uncertainty, neural information processing system, (11 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario (0.04)
Asia > China > Hong Kong (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report (0.68)

Industry:

Leisure & Entertainment > Sports > Hockey (1.00)
Leisure & Entertainment > Sports > Soccer (0.69)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Integrated accounts of behavioral and neuroimaging data using flexible recurrent neural network models

Amir Dezfouli, Richard Morris, Fabio T. Ramos, Peter Dayan, Bernard Balleine

Neural Information Processing SystemsNov-20-2025, 17:51:37 GMT

Finally, we validated our method using a previously published dataset.

artificial intelligence, deep learning, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Oceania > Australia (0.14)
Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)
North America > Canada > Quebec > Montreal (0.04)

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

An Analysis of Action-Value Temporal-Difference Methods That Learn State Values

Daley, Brett, Nagarajan, Prabhat, White, Martha, Machado, Marlos C.

arXiv.org Artificial IntelligenceSep-5-2025

The hallmark feature of temporal-difference (TD) learning is bootstrapping: using value predictions to generate new value predictions. The vast majority of TD methods for control learn a policy by bootstrapping from a single action-value function (e.g., Q-learning and Sarsa). Significantly less attention has been given to methods that bootstrap from two asymmetric value functions: i.e., methods that learn state values as an intermediate step in learning action values. Existing algorithms in this vein can be categorized as either QV -learning or A V -learning. Though these algorithms have been investigated to some degree in prior work, it remains unclear if and when it is advantageous to learn two value functions instead of just one--and whether such approaches are theoretically sound in general. In this paper, we analyze these algorithmic families in terms of convergence and sample efficiency. We find that while both families are more efficient than Expected Sarsa in the prediction setting, only A V -learning methods offer any major benefit over Q-learning in the control setting. Finally, we introduce a new A V -learning algorithm called Regularized Dueling Q-learning (RDQ), which significantly outperforms Dueling DQN in the MinAtar benchmark.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

2507.09523

Country: North America > Canada > Alberta (0.14)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

bd31bfd4caa85bffe07a35568182cdfa-Paper-Conference.pdf

Neural Information Processing SystemsAug-18-2025, 10:07:41 GMT

agent, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > China > Tianjin Province > Tianjin (0.04)
(2 more...)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Robots (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Uncertainty-Aware Reinforcement Learning for Risk-Sensitive Player Evaluation in Sports Game

Neural Information Processing SystemsAug-16-2025, 11:03:06 GMT

A major task of sports analytics is player evaluation.

data mining, machine learning, reinforcement learning, (12 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario (0.04)
Asia > China > Hong Kong (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report (0.68)

Industry:

Leisure & Entertainment > Sports > Hockey (1.00)
Leisure & Entertainment > Sports > Soccer (0.69)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)