Slot Induction via Pre-trained Language Model Probing and Multi-level Contrastive Learning
Nguyen, Hoang H., Zhang, Chenwei, Liu, Ye, Yu, Philip S.
–arXiv.org Artificial Intelligence
Recent advanced methods in Natural Language Understanding for Task-oriented Dialogue (TOD) Systems (e.g., intent detection and slot filling) require a large amount of annotated data to achieve competitive performance. In reality, token-level annotations (slot labels) are time-consuming and difficult to acquire. In this work, we study the Slot Induction (SI) task whose objective is to induce slot boundaries without explicit knowledge of token-level slot annotations. We propose leveraging Unsupervised Pre-trained Language Model (PLM) Probing and Contrastive Learning mechanism to exploit (1) unsupervised semantic knowledge extracted from PLM, and (2) additional sentence-level intent label signals available from TOD. Our approach is shown to be effective in SI task and capable of bridging the gaps with token-level supervised models on two NLU benchmark datasets. When generalized to emerging intents, our SI objectives also provide enhanced slot label representations, leading to improved performance on the Slot Filling tasks.
arXiv.org Artificial Intelligence
Aug-9-2023
- Country:
- Europe
- Denmark (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- North America
- Canada
- United States
- California > Santa Clara County
- Palo Alto (0.04)
- Illinois > Cook County
- Chicago (0.04)
- New York > New York County
- New York City (0.04)
- South Carolina (0.05)
- Washington > King County
- Seattle (0.04)
- Wyoming (0.04)
- California > Santa Clara County
- Europe
- Genre:
- Research Report (0.82)
- Technology: