Arctic-Extract Technical Report
Chiliński, Mateusz, Ołtusek, Julita, Jaśkowski, Wojciech
–arXiv.org Artificial Intelligence
Arctic-Extract is a state-of-the-art model designed for extracting structural data (question answering, entities and tables) from scanned or digital-born business documents. Despite its SoTA capabilities, the model is deployable on resource-constrained hardware, weighting only 6.6 GiB, making it suitable for deployment on devices with limited resources, such as A10 GPUs with 24 GB of memory. Arctic-Extract can process up to 125 A4 pages on those GPUs, making suitable for long document processing. This paper highlights Arctic-Extract's training protocols and evaluation results, demonstrating its strong performance in document understanding.
arXiv.org Artificial Intelligence
Nov-21-2025
- Country:
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- North America > United States (0.68)
- South America > Chile (0.04)
- Africa > Ethiopia
- Genre:
- Research Report > New Finding (0.88)
- Industry:
- Information Technology (0.46)
- Law (0.69)
- Leisure & Entertainment (0.46)
- Technology: