Semiparametric Token-Sequence Co-Supervision