AITopics | point mdp

Collaborating Authors

point mdp

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

96ca792fddef7c1e3366c405022463cb-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 22:06:22 GMT

mdp, point mdp, point mdp distribution, (13 more...)

Neural Information Processing Systems

Country: North America > United States > Utah > Salt Lake County > Salt Lake City (0.05)

Industry: Transportation > Ground > Road (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

96ca792fddef7c1e3366c405022463cb-Paper-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 22:06:19 GMT

evaluation, mdp, point mdp, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
Asia > Middle East > Jordan (0.04)
North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Ingolstadt (0.04)

Genre:

Research Report (0.68)
Instructional Material (0.46)

Industry:

Health & Medicine (0.68)
Transportation > Infrastructure & Services (0.50)
Transportation > Ground > Road (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

96ca792fddef7c1e3366c405022463cb-Supplemental-Conference.pdf

Neural Information Processing SystemsAug-17-2025, 03:38:07 GMT

artificial intelligence, machine learning, point mdp, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Utah > Salt Lake County > Salt Lake City (0.05)
North America > United States > New York (0.04)

Industry: Transportation > Ground > Road (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.32)

Add feedback

96ca792fddef7c1e3366c405022463cb-Paper-Conference.pdf

Neural Information Processing SystemsAug-17-2025, 03:38:03 GMT

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
Asia > Middle East > Jordan (0.04)
North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Ingolstadt (0.04)

Genre: Research Report (0.68)

Industry:

Health & Medicine (0.68)
Transportation > Infrastructure & Services (0.50)
Transportation > Ground > Road (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.98)
Information Technology > Artificial Intelligence > Robots (0.93)

Add feedback

The Impact of Task Underspecification in Evaluating Deep Reinforcement Learning

Jayawardana, Vindula, Tang, Catherine, Li, Sirui, Suo, Dajiang, Wu, Cathy

arXiv.org Artificial IntelligenceOct-16-2022

Evaluations of Deep Reinforcement Learning (DRL) methods are an integral part of scientific progress of the field. Beyond designing DRL methods for general intelligence, designing task-specific methods is becoming increasingly prominent for real-world applications. In these settings, the standard evaluation practice involves using a few instances of Markov Decision Processes (MDPs) to represent the task. However, many tasks induce a large family of MDPs owing to variations in the underlying environment, particularly in real-world contexts. For example, in traffic signal control, variations may stem from intersection geometries and traffic flow levels. The select MDP instances may thus inadvertently cause overfitting, lacking the statistical power to draw conclusions about the method's true performance across the family. In this article, we augment DRL evaluations to consider parameterized families of MDPs. We show that in comparison to evaluating DRL methods on select MDP instances, evaluating the MDP family often yields a substantially different relative ranking of methods, casting doubt on what methods should be considered state-of-the-art. We validate this phenomenon in standard control benchmarks and the real-world application of traffic signal control. At the same time, we show that accurately evaluating on an MDP family is nontrivial. Overall, this work identifies new challenges for empirical rigor in reinforcement learning, especially as the outcomes of DRL trickle into downstream decision-making.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

2210.08607

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
Asia > Middle East > Jordan (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry:

Transportation > Infrastructure & Services (1.00)
Transportation > Ground > Road (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.34)

Add feedback