Mean BERTs make erratic language teachers: the effectiveness of latent bootstrapping in low-resource settings
–arXiv.org Artificial Intelligence
This paper explores the use of latent bootstrapping, an alternative self-supervision technique, for pretraining language models. Unlike the typical practice of using self-supervision on discrete subwords, latent bootstrapping leverages contextualized embeddings for a richer supervision signal. We conduct experiments to assess how effective this approach is for acquiring linguistic knowledge from limited resources. Specifically, our experiments are based on the BabyLM shared task, which includes pretraining on two small curated corpora and an evaluation on four linguistic benchmarks.
arXiv.org Artificial Intelligence
Oct-30-2023
- Country:
- North America > United States
- Texas (0.04)
- Washington > King County
- Seattle (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Europe
- Slovenia (0.04)
- Czechia > Prague (0.04)
- Croatia (0.04)
- Norway > Eastern Norway
- Oslo (0.04)
- Iceland > Capital Region
- Reykjavik (0.04)
- Faroe Islands > Streymoy
- Tórshavn (0.04)
- Estonia > Tartu County
- Tartu (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia > Middle East
- Jordan (0.04)
- North America > United States
- Genre:
- Research Report (0.64)
- Industry:
- Education (0.68)
- Technology: