Efficient Multi-Policy Evaluation for Reinforcement Learning