What should be observed for optimal reward in POMDPs?

Open in new window