AITopics | text line

Collaborating Authors

text line

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Joint Line Segmentation and Transcription for End-to-End Handwritten Paragraph Recognition

Theodore Bluche

Neural Information Processing SystemsMar-23-2026, 05:09:41 GMT

Neural Information Processing Systems http://nips.cc/

machine learning, natural language, segmentation, (17 more...)

Neural Information Processing Systems

Genre: Workflow (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding

Neural Information Processing SystemsFeb-18-2026, 04:41:08 GMT

Accurate identification and organizing of textual content is crucial for the automation of document processing in the field of form understanding. Existing datasets,

data mining, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Asia > China > Anhui Province > Hefei (0.04)
Asia > Indonesia (0.04)

Genre: Research Report > New Finding (0.67)

Industry: Government (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.94)
(2 more...)

Add feedback

A file format used in the

Neural Information Processing SystemsFeb-15-2026, 15:14:35 GMT

The keywords were extracted using the procedure described in SectionC. The restricted part of the Muharaf dataset has 428 images distributed under a proprietary license.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Industry:

Law (0.97)
Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.95)
Information Technology > Artificial Intelligence > Machine Learning (0.68)

Add feedback

Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text Recognition

Neural Information Processing SystemsFeb-15-2026, 15:14:33 GMT

We present the Manuscripts of Handwritten Arabic (Muharaf) dataset, which is a machine learning dataset consisting of more than 1,600 historic handwritten page images transcribed by experts in archival Arabic.

machine learning, natural language, pattern recognition, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > North Carolina (0.04)
North America > Canada > Quebec > Montreal (0.04)
Asia > Middle East > Iran (0.04)
(6 more...)

Genre: Research Report (0.46)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.83)

Add feedback

cbeaff878d6446ed06c3e0ffa53477f2-Paper-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsOct-10-2025, 16:48:24 GMT

annotation, dataset, srfund dataset, (11 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Asia > China > Anhui Province > Hefei (0.04)
Asia > Indonesia (0.04)

Genre: Research Report > New Finding (0.67)

Industry: Government (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.94)
(2 more...)

Add feedback

6b8cb6b291045e217c3ff3f854f2fd0f-Paper-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsOct-10-2025, 05:09:34 GMT

dataset, muharaf dataset, transcription, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > North Carolina (0.04)
North America > Canada > Quebec > Montreal (0.04)
Asia > Middle East > Iran (0.04)
(6 more...)

Genre: Research Report (0.46)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.68)

Add feedback

Masked Self-Supervised Pre-Training for Text Recognition Transformers on Large-Scale Datasets

Kišš, Martin, Hradiš, Michal

arXiv.org Artificial IntelligenceMar-28-2025

Self-supervised learning has emerged as a powerful approach for leveraging large-scale unlabeled data to improve model performance in various domains. In this paper, we explore masked self-supervised pre-training for text recognition transformers. Specifically, we propose two modifications to the pre-training phase: progressively increasing the masking probability, and modifying the loss function to incorporate both masked and non-masked patches. We conduct extensive experiments using a dataset of 50M unlabeled text lines for pre-training and four differently sized annotated datasets for fine-tuning. Furthermore, we compare our pre-trained models against those trained with transfer learning, demonstrating the effectiveness of the self-supervised pre-training. In particular, pre-training consistently improves the character error rate of models, in some cases up to 30 % relatively. It is also on par with transfer learning but without relying on extra annotated text lines.

large language model, machine learning, pattern recognition, (18 more...)

arXiv.org Artificial Intelligence

2503.22513

Country:

Europe > Switzerland (0.05)
Europe > Czechia > South Moravian Region > Brno (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Text Recognition (0.65)

Add feedback

MathMistake Checker: A Comprehensive Demonstration for Step-by-Step Math Problem Mistake Finding by Prompt-Guided LLMs

Zhang, Tianyang, Jiang, Zhuoxuan, Zhang, Haotian, Lin, Lin, Zhang, Shaohua

arXiv.org Artificial IntelligenceMar-6-2025

We propose a novel system, MathMistake Checker, designed to automate step-by-step mistake finding in mathematical problems with lengthy answers through a two-stage process. The system aims to simplify grading, increase efficiency, and enhance learning experiences from a pedagogical perspective. It integrates advanced technologies, including computer vision and the chain-of-thought capabilities of the latest large language models (LLMs). Our system supports open-ended grading without reference answers and promotes personalized learning by providing targeted feedback. We demonstrate its effectiveness across various types of math problems, such as calculation and word problems.

arxiv preprint arxiv, mathmistake checker, student, (11 more...)

arXiv.org Artificial Intelligence

2503.04291

Country: Asia > China > Shanghai > Shanghai (0.07)

Genre: Research Report (0.40)

Industry: Education > Assessment & Standards > Student Performance (0.50)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

RVAFM: Re-parameterizing Vertical Attention Fusion Module for Handwritten Paragraph Text Recognition

Zheng, Jinhui, Liu, Zhiquan, Si, Yain-Whar, Li, Jianqing, Zhang, Xinyuan, Li, Xiaofan, Huang, Haozhi, Gong, Xueyuan

arXiv.org Artificial IntelligenceMar-4-2025

Handwritten Paragraph Text Recognition (HPTR) is a challenging task in Computer Vision, requiring the transformation of a paragraph text image, rich in handwritten text, into text encoding sequences. One of the most advanced models for this task is Vertical Attention Network (VAN), which utilizes a Vertical Attention Module (VAM) to implicitly segment paragraph text images into text lines, thereby reducing the difficulty of the recognition task. However, from a network structure perspective, VAM is a single-branch module, which is less effective in learning compared to multi-branch modules. In this paper, we propose a new module, named Re-parameterizing Vertical Attention Fusion Module (RVAFM), which incorporates structural re-parameterization techniques. RVAFM decouples the structure of the module during training and inference stages. During training, it uses a multi-branch structure for more effective learning, and during inference, it uses a single-branch structure for faster processing. The features learned by the multi-branch structure are fused into the single-branch structure through a special fusion method named Re-parameterization Fusion (RF) without any loss of information. As a result, we achieve a Character Error Rate (CER) of 4.44% and a Word Error Rate (WER) of 14.37% on the IAM paragraph-level test set. Additionally, the inference speed is slightly faster than VAN.

dual-parameter layer, recognition, rvafm, (15 more...)

arXiv.org Artificial Intelligence

2503.03104

Country:

Asia > Macao (0.04)
Asia > China > Guangdong Province (0.04)
Europe > Switzerland > Vaud > Lausanne (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Text Recognition (0.63)

Add feedback

Filters

Collaborating Authors

text line

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Joint Line Segmentation and Transcription for End-to-End Handwritten Paragraph Recognition

SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding

A file format used in the

Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text Recognition

cbeaff878d6446ed06c3e0ffa53477f2-Paper-Datasets_and_Benchmarks_Track.pdf

6b8cb6b291045e217c3ff3f854f2fd0f-Supplemental-Datasets_and_Benchmarks_Track.pdf

6b8cb6b291045e217c3ff3f854f2fd0f-Paper-Datasets_and_Benchmarks_Track.pdf

Masked Self-Supervised Pre-Training for Text Recognition Transformers on Large-Scale Datasets

MathMistake Checker: A Comprehensive Demonstration for Step-by-Step Math Problem Mistake Finding by Prompt-Guided LLMs

RVAFM: Re-parameterizing Vertical Attention Fusion Module for Handwritten Paragraph Text Recognition