Provably Learning from Modern Language Models via Low Logit Rank