Unified Pretraining Framework for Document Understanding

Neural Information Processing Systems 

UDoc is designed to support most document understanding tasks, extending the Transformer to take multimodal embeddings as input.