How Well Does First-Token Entropy Approximate Word Entropy as a Psycholinguistic Predictor?