What Shape Is Optimal for Masks in Text Removal?
Nakada, Hyakka, Kubota, Marika
–arXiv.org Artificial Intelligence
The advent of generative models has dramatically improved the accuracy of image inpainting. In particular, by removing specific text from document images, reconstructing original images is extremely important for industrial applications. However, most existing methods of text removal focus on deleting simple scene text which appears in images captured by a camera in an outdoor environment. There is little research dedicated to complex and practical images with dense text. Therefore, we created benchmark data for text removal from images including a large amount of text. From the data, we found that text-removal performance becomes vulnerable against mask profile perturbation. Thus, for practical text-removal tasks, precise tuning of the mask shape is essential. This study developed a method to model highly flexible mask profiles and learn their parameters using Bayesian optimization. The resulting profiles were found to be character-wise masks. It was also found that the minimum cover of a text region is not optimal. Our research is expected to pave the way for a user-friendly guideline for manual masking.
arXiv.org Artificial Intelligence
Dec-1-2025
- Country:
- Asia
- India (0.04)
- Japan
- Honshū
- Kansai > Kyoto Prefecture
- Kyoto (0.04)
- Kantō > Tokyo Metropolis Prefecture
- Tokyo (0.14)
- Kansai > Kyoto Prefecture
- Kyūshū & Okinawa > Okinawa (0.04)
- Honshū
- South Korea > Seoul
- Seoul (0.04)
- Europe
- North America
- Canada > Quebec
- Montreal (0.04)
- United States
- Alaska > Anchorage Municipality
- Anchorage (0.04)
- California
- Los Angeles County > Long Beach (0.04)
- San Diego County > San Diego (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Alaska > Anchorage Municipality
- Canada > Quebec
- Oceania > Australia
- New South Wales > Sydney (0.04)
- Asia
- Genre:
- Research Report > New Finding (0.47)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning > Neural Networks (0.94)
- Natural Language (1.00)
- Representation & Reasoning > Optimization (0.93)
- Vision (1.00)
- Data Science (0.93)
- Sensing and Signal Processing > Image Processing (1.00)
- Artificial Intelligence
- Information Technology