Partially Observable Reinforcement Learning with Memory Traces