Document Intelligence in the Era of Large Language Models: A Survey
Wang, Weishi, Hu, Hengchang, Zhang, Zhijie, Li, Zhaochen, Shao, Hongxin, Dahlmeier, Daniel
–arXiv.org Artificial Intelligence
Document AI (DAI) has emerged as a vital application area, and is significantly transformed by the advent of large language models (LLMs). While earlier approaches relied on encoder-decoder architectures, decoder-only LLMs have revolutionized DAI, bringing remarkable advancements in understanding and generation. This survey provides a comprehensive overview of DAI's evolution, highlighting current research attempts and future prospects of LLMs in this field. We explore key advancements and challenges in multimodal, multilingual, and retrieval-augmented DAI, while also suggesting future research directions, including agent-based approaches and document-specific foundation models. This paper aims to provide a structured analysis of the state-of-the-art in DAI and its implications for both academic and practical applications.
arXiv.org Artificial Intelligence
Oct-16-2025
- Country:
- Asia (1.00)
- Europe > France (0.67)
- North America > United States
- California (0.67)
- Florida > Miami-Dade County
- Miami (0.14)
- Genre:
- Research Report (1.00)
- Overview (1.00)
- Industry:
- Information Technology (0.45)
- Technology: