Goto

Collaborating Authors

 pomdp


Overleaf Example

Neural Information Processing Systems

We model episode sessions--parts of the episode where the latent state isfixed--and propose three keymodifications toexisting meta-RL methods: (i) consistency of latent information within sessions, (ii) session masking, and (iii) priorlatent conditioning.