models (LMs). Given a fixed budget of tokens, we study how to best select data