A Reinforcement Learning Algorithm in Partially Observable Environments Using Short-Term Memory

Dec-31-1999–Neural Information Processing Systems

We have proved that the model learned by BLHT converges to the optimal model in given hypothesis space, 1{, which provides the most accurate predictions of percepts and rewards, given short-term memory. We believe this fact provides a solid basis for BLHT, and BLHT can be compared favorably with other methods using short-term memory.

artificial intelligence, reinforcement learning, short-term memory, (16 more...)

Neural Information Processing Systems

Dec-31-1999

Conferences PDF

Add feedback

Duplicate Docs Excel Report

Title
A Reinforcement Learning Algorithm in Partially Observable Environments Using Short-Term Memory
A Reinforcement Learning Algorithm in Partially Observable Environments Using Short-Term Memory

Similar Docs Excel Report more

Title	Similarity	Source
None found