Unified Language Model Pre-training for Natural Language Understanding and Generation

Dec-25-2025, 22:49:34 GMT–Neural Information Processing Systems

This paper presents a new Unified pre-trained Language Model (UniLM) that can be fine-tuned for both natural language understanding and generation tasks. The model is pre-trained using three types of language modeling tasks: unidirectional, bidirectional, and sequence-to-sequence prediction. The unified modeling is achieved by employing a shared Transformer network and utilizing specific self-attention masks to control what context the prediction conditions on. UniLM compares favorably with BERT on the GLUE benchmark, and the SQuAD 2.0 and CoQA question answering tasks.

absolute improvement, name change, unified language model pre-training, (3 more...)

Neural Information Processing Systems

Dec-25-2025, 22:49:34 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.43)