Generating Hard-Negative Out-of-Scope Data with ChatGPT for Intent Classification
Li, Zhijian, Larson, Stefan, Leach, Kevin
–arXiv.org Artificial Intelligence
Intent classifiers must be able to distinguish when a user's utterance does not belong to any supported intent to avoid producing incorrect and unrelated system responses. Although out-of-scope (OOS) detection for intent classifiers has been studied, previous work has not yet studied changes in classifier performance against hard-negative out-of-scope utterances (i.e., inputs that share common features with in-scope data, but are actually out-of-scope). We present an automated technique to generate hard-negative OOS data using ChatGPT. We use our technique to build five new hard-negative OOS datasets, and evaluate each against three benchmark intent classifiers. We show that classifiers struggle to correctly identify hard-negative OOS utterances more than general OOS utterances. Finally, we show that incorporating hard-negative OOS data for training improves model robustness when detecting hard-negative OOS data and general OOS data. Our technique, datasets, and evaluation address an important void in the field, offering a straightforward and inexpensive way to collect hard-negative OOS data and improve intent classifiers' robustness.
arXiv.org Artificial Intelligence
Mar-8-2024
- Country:
- Europe (0.14)
- Asia (0.04)
- Pacific Ocean > North Pacific Ocean
- San Francisco Bay > Golden Gate (0.04)
- North America
- Canada (0.04)
- United States
- New Jersey (0.04)
- Pennsylvania (0.04)
- New York (0.04)
- Tennessee > Davidson County
- Nashville (0.04)
- Genre:
- Research Report > New Finding (0.93)
- Technology: