Rethinking Training Dynamics in Scale-wise Autoregressive Generation

Open in new window