SARM: Stage-Aware Reward Modeling for Long Horizon Robot Manipulation

Open in new window