Hierarchical Budget Policy Optimization for Adaptive Reasoning

Open in new window