Biased Aggregation, Rollout, and Enhanced Policy Improvement for Reinforcement Learning