Segmentation-free Connectionist Temporal Classification loss based OCR Model for Text Captcha Classification
Khatavkar, Vaibhav, Velankar, Makarand, Petkar, Sneha
–arXiv.org Artificial Intelligence
Captcha are widely used to secure systems from automatic responses by distinguishing computer responses from human responses. Text, audio, video, picture picture-based Optical Character Recognition (OCR) are used for creating captcha. Text-based OCR captcha are the most often used captcha which faces issues namely, complex and distorted contents. There are attempts to build captcha detection and classification-based systems using machine learning and neural networks, which need to be tuned for accuracy. The existing systems face challenges in the recognition of distorted characters, handling variable-length captcha and finding sequential dependencies in captcha. In this work, we propose a segmentation-free OCR model for text captcha classification based on the connectionist temporal classification loss technique. The proposed model is trained and tested on a publicly available captcha dataset. The proposed model gives 99.80\% character level accuracy, while 95\% word level accuracy. The accuracy of the proposed model is compared with the state-of-the-art models and proves to be effective. The variable length complex captcha can be thus processed with the segmentation-free connectionist temporal classification loss technique with dependencies which will be massively used in securing the software systems.
arXiv.org Artificial Intelligence
Feb-8-2024
- Country:
- Africa > Middle East
- Tunisia (0.04)
- Asia
- Bangladesh > Dhaka Division
- Dhaka District > Dhaka (0.04)
- China
- Guangdong Province > Guangzhou (0.04)
- Hubei Province > Wuhan (0.04)
- Zhejiang Province > Hangzhou (0.04)
- India
- Madhya Pradesh > Bhopal (0.04)
- Maharashtra > Pune (0.04)
- NCT
- Japan > Honshū
- Kansai > Kyoto Prefecture > Kyoto (0.04)
- Macao (0.04)
- Singapore (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- Thailand > Chiang Rai
- Chiang Rai (0.04)
- Bangladesh > Dhaka Division
- Europe
- Germany
- Baden-Württemberg > Karlsruhe Region
- Heidelberg (0.04)
- Berlin (0.04)
- North Rhine-Westphalia > Cologne Region
- Aachen (0.04)
- Baden-Württemberg > Karlsruhe Region
- Poland > Masovia Province
- Warsaw (0.04)
- Germany
- North America
- Canada > Ontario
- Toronto (0.04)
- United States
- California > San Francisco County
- San Francisco (0.14)
- Hawaii > Honolulu County
- Honolulu (0.04)
- New York > New York County
- New York City (0.04)
- California > San Francisco County
- Canada > Ontario
- Africa > Middle East
- Genre:
- Research Report (0.70)
- Industry:
- Information Technology > Security & Privacy (0.88)
- Technology: