Scaling Autonomous Agents via Automatic Reward Modeling And Planning