Batch Value Function Approximation via Support Vectors

Dec-31-2002–Neural Information Processing Systems

One formulation is based on SVM regression; the second is based on the Bellman equation; and the third seeks only to ensure that good moves have an advantage over bad moves. All formulations attemptto minimize the number of support vectors while fitting the data. Experiments in a difficult, synthetic maze problem show that all three formulations give excellent performance, but the advantage formulation is much easier to train. Unlike policy gradient methods,the kernel methods described here can easily'adjust the complexity of the function approximator to fit the complexity of the value function.

artificial intelligence, formulation, fuzzy logic, (16 more...)

Neural Information Processing Systems

Dec-31-2002

Conferences PDF

Add feedback

Country:
- North America > United States > Oregon (0.15)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Statistical Learning > Support Vector Machines (0.72)
  - Representation & Reasoning > Uncertainty
    - Fuzzy Logic (0.43)

Duplicate Docs Excel Report

Title
Batch Value Function Approximation via Support Vectors
Batch Value Function Approximation via Support Vectors

Similar Docs Excel Report more

Title	Similarity	Source
None found