Project MOSLA: Recording Every Moment of Second Language Acquisition
Hagiwara, Masato, Tanner, Joshua
–arXiv.org Artificial Intelligence
Second language acquisition (SLA) is a complex and dynamic process. Many SLA studies that have attempted to record and analyze this process have typically focused on a single modality (e.g., textual output of learners), covered only a short period of time, and/or lacked control (e.g., failed to capture every aspect of the learning process). In Project MOSLA (Moments of Second Language Acquisition), we have created a longitudinal, multimodal, multilingual, and controlled dataset by inviting participants to learn one of three target languages (Arabic, Spanish, and Chinese) from scratch over a span of two years, exclusively through online instruction, and recording every lesson using Zoom. The dataset is semi-automatically annotated with speaker/language IDs and transcripts by both human annotators and fine-tuned state-of-the-art speech models. Our experiments reveal linguistic insights into learners' proficiency development over time, as well as the potential for automatically detecting the areas of focus on the screen purely from the unannotated multimodal data. Our dataset is freely available for research purposes and can serve as a valuable resource for a wide range of applications, including but not limited to SLA, proficiency assessment, language and speech processing, pedagogy, and multimodal learning analytics.
arXiv.org Artificial Intelligence
Mar-25-2024
- Country:
- Asia (0.04)
- North America
- United States
- Washington > King County
- Seattle (0.04)
- New York > New York County
- New York City (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California > San Diego County
- San Diego (0.04)
- Washington > King County
- Canada > Ontario
- Toronto (0.04)
- United States
- Europe
- Sweden > Vaestra Goetaland
- Gothenburg (0.04)
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Sweden > Vaestra Goetaland
- Genre:
- Instructional Material (0.94)
- Research Report
- New Finding (0.93)
- Experimental Study (0.68)
- Industry:
- Technology:
- Information Technology > Artificial Intelligence
- Natural Language (1.00)
- Machine Learning (1.00)
- Cognitive Science (1.00)
- Speech > Speech Recognition (0.95)
- Information Technology > Artificial Intelligence