Improving Language Model Behavior by Training on a Curated Dataset