Merging Language and Domain Specific Models: The Impact on Technical Vocabulary Acquisition
Rousset, Thibault, Kakibuchi, Taisei, Sasaki, Yusuke, Nomura, Yoshihide
–arXiv.org Artificial Intelligence
This paper investigates the integration of technical vocabulary in merged language models. We explore the knowledge transfer mechanisms involved when combining a general-purpose language-specific model with a domain-specific model, focusing on the resulting model's comprehension of technical jargon. Our experiments analyze the impact of this merging process on the target model's proficiency in handling specialized terminology. We present a quantitative evaluation of the performance of the merged model, comparing it with that of the individual constituent models. The findings offer insights into the effectiveness of different model merging methods for enhancing domain-specific knowledge and highlight potential challenges and future directions in leveraging these methods for cross-lingual knowledge transfer in Natural Language Processing.
arXiv.org Artificial Intelligence
Feb-17-2025
- Country:
- Asia > Japan
- Honshū
- Kantō > Tokyo Metropolis Prefecture
- Tokyo (0.04)
- Tōhoku (0.04)
- Kantō > Tokyo Metropolis Prefecture
- Honshū
- Europe > France
- Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
- North America
- Canada > Quebec
- Montreal (0.04)
- United States
- Florida > Miami-Dade County
- Miami (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Florida > Miami-Dade County
- Canada > Quebec
- Asia > Japan
- Genre:
- Research Report > New Finding (0.69)