Solving POMDPs by Searching the Space of Finite Policies