Unified Pretraining Framework for Document Understanding

Apr-24-2026, 09:33:36 GMT–Neural Information Processing Systems

Document intelligence automates the extraction of information from documents and supports many business applications. Recent self-supervised learning methods on large-scale unlabeled document datasets have opened up promising directions towards reducing annotation efforts by training models with self-supervised objectives. However, most of the existing document pretraining methods are still language-dominated.

information retrieval, machine learning, natural language, (22 more...)

Neural Information Processing Systems

Apr-24-2026, 09:33:36 GMT

Conferences PDF

Add feedback

Industry:
- Information Technology (0.48)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Natural Language
    - Text Processing (0.68)
    - Information Retrieval (0.46)
  - Machine Learning
    - Inductive Learning (0.68)
    - Neural Networks (0.68)

Duplicate Docs Excel Report

Title
Unified Pretraining Framework for Document Understanding Jiuxiang Gu, Vlad I. Morariu

Similar Docs Excel Report more

Title	Similarity	Source
None found