The Effect of Eligibility Traces on Finding Optimal Memoryless Policies in Partially Observable Markov Decision Processes

Loch, John

Neural Information Processing Systems 

Such agent-environment systems can be modeled as partially observable Markov decision processes or POMDPs (Sondik, 1978).

Similar Docs  Excel Report  more

TitleSimilaritySource
None found