Active Vision Reinforcement Learning under Limited Visual Observability