Data Caricatures: On the Representation of African American Language in Pretraining Corpora

Open in new window