Rethinking Training Dynamics in Scale-wise Autoregressive Generation