Scaling Optimal LR Across Token Horizons