Structured Language Generation Model for Robust Structure Prediction
Lee, Minho, Min, Junghyun, Lee, Woochul, Lee, Yeonsoo
–arXiv.org Artificial Intelligence
Previous work in structured prediction (e.g. NER, information extraction) using single model make use of explicit dataset information, which helps boost in-distribution performance but is orthogonal to robust generalization in real-world situations. To overcome this limitation, we propose the Structured Language Generation Model (SLGM), a framework that reduces sequence-to-sequence problems to classification problems via methodologies in loss calibration and decoding method. Our experimental results show that SLGM is able to maintain performance without explicit dataset information, follow and potentially replace dataset-specific fine-tuning.
arXiv.org Artificial Intelligence
Feb-18-2024
- Country:
- Europe
- North America > United States (0.14)
- Genre:
- Research Report > New Finding (0.88)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (1.00)
- Natural Language
- Generation (0.84)
- Large Language Model (0.68)
- Text Processing (0.69)
- Information Technology > Artificial Intelligence