Efficient Text Encoders for Labor Market Analysis
Decorte, Jens-Joris, Van Hautte, Jeroen, Develder, Chris, Demeester, Thomas
–arXiv.org Artificial Intelligence
Labor market analysis relies on extracting insights from job advertisements, which provide valuable yet unstructured information on job titles and corresponding skill requirements. While state-of-the-art methods for skill extraction achieve strong performance, they depend on large language models (LLMs), which are computationally expensive and slow. In this paper, we propose \textbf{ConTeXT-match}, a novel contrastive learning approach with token-level attention that is well-suited for the extreme multi-label classification task of skill classification. \textbf{ConTeXT-match} significantly improves skill extraction efficiency and performance, achieving state-of-the-art results with a lightweight bi-encoder model. To support robust evaluation, we introduce \textbf{Skill-XL}, a new benchmark with exhaustive, sentence-level skill annotations that explicitly address the redundancy in the large label space. Finally, we present \textbf{JobBERT V2}, an improved job title normalization model that leverages extracted skills to produce high-quality job title representations. Experiments demonstrate that our models are efficient, accurate, and scalable, making them ideal for large-scale, real-time labor market analysis.
arXiv.org Artificial Intelligence
Jul-30-2025
- Country:
- Asia
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.14)
- Singapore > Central Region
- Singapore (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Middle East > UAE
- Europe
- Belgium > Flanders
- East Flanders > Ghent (0.04)
- France (0.04)
- Iceland > Capital Region
- Reykjavik (0.04)
- Middle East > Malta
- Eastern Region > Northern Harbour District > St. Julian's (0.04)
- Netherlands (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Sweden > Östergötland County
- Linköping (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Belgium > Flanders
- North America > United States
- California > Yolo County
- Davis (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- New York > New York County
- New York City (0.04)
- Washington > King County
- Seattle (0.04)
- California > Yolo County
- Asia
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Banking & Finance > Economy (0.91)
- Information Technology (0.93)
- Marketing (1.00)
- Technology: