BOW: Reinforcement Learning for Bottlenecked Next Word Prediction

Open in new window