AITopics | Givan, Robert

Inductive Policy Selection for First-Order MDPs

Yoon, Sung Wook, Fern, Alan, Givan, Robert

arXiv.org Artificial IntelligenceDec-12-2012

We select policies for large Markov Decision Processes (MDPs) with compact first-order representations. We find policies that generalize well as the number of objects in the domain grows, potentially without bound. Existing dynamic-programming approaches based on flat, propositional, or first-order representations either are impractical here or do not naturally scale as the number of objects grows without bound. We implement and evaluate an alternative approach that induces first-order policies using training data constructed by solving small problem instances using PGraphplan (Blum & Langford, 1999). Our policies are represented as ensembles of decision lists, using a taxonomic concept language. This approach extends the work of Martin and Geffner (2000) to stochastic domains, ensemble learning, and a wider variety of problems. Empirically, we find "good" policies for several stochastic first-order MDPs that are beyond the scope of previous approaches. We also discuss the application of this work to the relational reinforcement-learning problem.

expression, neural network, optimization problem, (23 more...)

arXiv.org Artificial Intelligence

1301.0614

Country: North America > United States > Indiana > Tippecanoe County (0.14)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.50)
(2 more...)

Add feedback

Estimating Densities with Non-Parametric Exponential Families

Yuan, Lin, Kirshner, Sergey, Givan, Robert

arXiv.org Machine LearningSep-6-2012

We propose a novel approach for density estimation with exponential families for the case when the true density may not fall within the chosen family. Our approach augments the sufficient statistics with features designed to accumulate probability mass in the neighborhood of the observed points, resulting in a non-parametric model similar to kernel density estimators. We show that under mild conditions, the resulting model uses only the sufficient statistics if the density is within the chosen exponential family, and asymptotically, it approximates densities outside of the chosen exponential family. Using the proposed approach, we modify the exponential random graph model, commonly used for modeling small-size graph distributions, to address the well-known issue of model degeneracy.

artificial intelligence, exponential family, machine learning, (16 more...)

arXiv.org Machine Learning

1206.5036

Country:

North America > United States > Indiana > Tippecanoe County (0.14)
North America > United States > California (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)

Add feedback

Approximate Policy Iteration with a Policy Language Bias

Fern, Alan, Yoon, Sungwook, Givan, Robert

Neural Information Processing SystemsDec-31-2004

We explore approximate policy iteration, replacing the usual costfunction learning step with a learning step in policy space. We give policy-language biases that enable solution of very large relational Markov decision processes (MDPs) that no previous technique can solve. In particular, we induce high-quality domain-specific planners for classical planning domains (both deterministic and stochastic variants) by solving such domains as extremely large MDPs.

neural network, planning & scheduling, planning domain, (19 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.68)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.47)
(2 more...)

Add feedback

Approximate Policy Iteration with a Policy Language Bias

Fern, Alan, Yoon, Sungwook, Givan, Robert

Neural Information Processing SystemsDec-31-2004

We explore approximate policy iteration, replacing the usual costfunction learning step with a learning step in policy space. We give policy-language biases that enable solution of very large relational Markov decision processes (MDPs) that no previous technique can solve. In particular, we induce high-quality domain-specific planners for classical planning domains (both deterministic and stochastic variants) by solving such domains as extremely large MDPs.

neural network, planning & scheduling, planning domain, (19 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.68)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.47)
(2 more...)

Add feedback

Approximate Policy Iteration with a Policy Language Bias

Fern, Alan, Yoon, Sungwook, Givan, Robert

Neural Information Processing SystemsDec-31-2004

We explore approximate policy iteration, replacing the usual costfunction learningstep with a learning step in policy space. We give policy-language biases that enable solution of very large relational Markov decision processes (MDPs) that no previous technique can solve. In particular, we induce high-quality domain-specific planners for classical planningdomains (both deterministic and stochastic variants) by solving such domains as extremely large MDPs.

Add feedback

Filters

Collaborating Authors

Givan, Robert

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Inductive Policy Selection for First-Order MDPs

Estimating Densities with Non-Parametric Exponential Families

Approximate Policy Iteration with a Policy Language Bias

Approximate Policy Iteration with a Policy Language Bias

Approximate Policy Iteration with a Policy Language Bias