The Effect of Eligibility Traces on Finding Optimal Memoryless Policies in Partially Observable Markov Decision Processes
–Neural Information Processing Systems
Such agent-environment systems can be modeled as partially observable Markov decision processes or POMDPs (Sondik, 1978).
Neural Information Processing Systems
Dec-31-1999
- Country:
- Technology: