AITopics | Optical Character Recognition

Collaborating Authors

Optical Character Recognition

Our second example deals with a more challenging problem: the recognition of hand-printed letters of the alphabet. The characters that people print in the ordinary course of filling out forms and questionnaires are surprisingly varied. Gaps abound wherecontinuous lines might be expected; curves and sharp angles appear interchangeably; there is almost every imaginable distortion of slant, shape and size. Even human readers cannot always identify such characters; their error rate is about 3 per cent on randomly selected letters and numbers, seen out of context.
– from Oliver G. Selfridge & Ulric Neisser. PATTERN RECOGNITION BY MACHINE . In Computers & thought, Edward A. Feigenbaum and Julian Feldman (Eds.). MIT Press, Cambridge, MA, USA, 1963. pp. 8-30.

News Overviews Instructional Materials AI-Alerts Classics

what-is-the-use-of-machine-learning-handwriting-recognition

#artificialintelligenceFeb-17-2022, 20:13:23 GMT

Recent Deep Learning advancements, such as the introduction of transformer topologies, have helped us accelerate our handwritten character recognition. Intelligent Character Recognition (ICR), is a term used to describe the process for recognizing handwritten content. ICR algorithms require more intelligence than ordinary OCR. This post will cover the challenges of handwritten text identification and the techniques that can be used to tackle them using deep learning and machine learning. In the healthcare/pharmaceutical industry, patient medication digitization is a serious issue. Roche processes millions of PDFs each day, processing petabytes in medical PDFs.

character recognition, digitization, feature extraction, (3 more...)

#artificialintelligence

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision > Handwriting Recognition (1.00)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.34)

Add feedback

Hindi Character Recognition

#artificialintelligenceFeb-13-2022, 19:45:34 GMT

Character recognition is a process that allows computers to recognize written or printed characters such as numbers or letters and to change them into a form that computers can use. As a part of this case study, we are going to recognize "Hindi characters". It is a Character Recognition problem related to computer vision, where our task is to predict the Hindi character present in the image. The Model should predict or recognize the character present in the image in real-time. So the latency of the model should be low.

dataset, dense layer, hindi character recognition, (11 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.84)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.84)

Add feedback

Omnifont Persian OCR System Using Primitives

Keipour, Azarakhsh, Eshghi, Mohammad, Ghadikolaei, Sina Mohammadzadeh, Mohammadi, Negin, Ensafi, Shahab

arXiv.org Artificial IntelligenceFeb-13-2022

In this paper, we introduce a model-based omnifont Persian OCR system. The system uses a set of 8 primitive elements as structural features for recognition. First, the scanned document is preprocessed. After normalizing the preprocessed image, text rows and sub-words are separated and then thinned. After recognition of dots in sub-words, strokes are extracted and primitive elements of each sub-word are recognized using the strokes. Finally, the primitives are compared with a predefined set of character identification vectors in order to identify sub-word characters. The separation and recognition steps of the system are concurrent, eliminating unavoidable errors of independent separation of letters. The system has been tested on documents with 14 standard Persian fonts in 6 sizes. The achieved precision is 97.06%.

algorithm, ocr system, recognition, (15 more...)

arXiv.org Artificial Intelligence

2202.06371

Country:

Asia > Middle East > Iran > Tehran Province > Tehran (0.05)
Asia > Singapore (0.05)
Asia > Middle East > UAE > Sharjah Emirate > Sharjah (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.89)

Add feedback

ANPR System, Number Plate Recognition

#artificialintelligenceFeb-11-2022, 12:01:35 GMT

Plate.Vision is a vehicle identification software through license plate recognition,

anpr system, number plate recognition

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.40)

Add feedback

Online Text to Speech platform for generating realistic Voiceover using AI

#artificialintelligenceJan-28-2022, 08:25:15 GMT

Online text to speech converter with realistic natural sounding voices. AI voiceover generator for your videos, free TTS solution for Voiceover

generating realistic voiceover, online text, speech platform

#artificialintelligence

Genre:

Instructional Material > Online (0.60)
Instructional Material > Course Syllabus & Notes (0.60)

Technology:

Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Synthesis (1.00)
Information Technology > Artificial Intelligence > Assistive Technologies (1.00)

Add feedback

Cross-Lingual Text-to-Speech Using Multi-Task Learning and Speaker Classifier Joint Training

Yang, J., He, Lei

arXiv.org Artificial IntelligenceJan-20-2022

In cross-lingual speech synthesis, the speech in various languages can be synthesized for a monoglot speaker. Normally, only the data of monoglot speakers are available for model training, thus the speaker similarity is relatively low between the synthesized cross-lingual speech and the native language recordings. Based on the multilingual transformer text-to-speech model, this paper studies a multi-task learning framework to improve the cross-lingual speaker similarity. To further improve the speaker similarity, joint training with a speaker classifier is proposed. Here, a scheme similar to parallel scheduled sampling is proposed to train the transformer model efficiently to avoid breaking the parallel training mechanism when introducing joint training. By using multi-task learning and speaker classifier joint training, in subjective and objective evaluations, the cross-lingual speaker similarity can be consistently improved for both the seen and unseen speakers in the training set.

joint training, speaker classifier, speaker similarity, (13 more...)

arXiv.org Artificial Intelligence

2201.08124

Country: North America > United States (0.04)

Genre: Research Report (0.70)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Speech > Speech Synthesis (0.93)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.62)

Add feedback

A guide to text detection and recognition using MMOCR

#artificialintelligenceJan-16-2022, 04:00:07 GMT

Optical character recognition (OCR) is a sort of image conversion that basically extracts text from a given image, a document photo, etc. Various applications and technologies, such as Adobe Acrobat and the ML-based tool, such as Tesseract OCR, have been developed to aid with this process. In this article, we will go over tasks performed in the OCR method. Thereafter, we will look into MMOCR, a Python-based application that centralizes all OCR-related operations. Below are major points listed that are to be discussed in this article. Let's first discuss text detection.

recognition, text detection, text detection and recognition, (12 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.83)
Information Technology > Data Science > Data Quality > Data Transformation (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Add feedback

Solving CAPTCHAs With Machine Learning to Enable Dark Web Research

#artificialintelligenceJan-12-2022, 00:35:21 GMT

A joint academic research project from the United States has developed a method to foil CAPTCHA* tests, reportedly outperforming similar state-of-the-art machine learning solutions by using Generative Adversarial Networks (GANs) to decode the visually complex challenges. Testing the new system against the best current frameworks, the researchers found that their method achieves more than 94.4% success on a carefully curated real-world benchmark dataset, and has proved capable of'eliminating human involvement' when navigating a highly CAPTCHA-protected emerging Dark Net Marketplace, automatically resolving CAPTCHA challenges in a maximum of three attempts. The authors contend that their approach represents a breakthrough for cybersecurity researchers, who traditionally have had to bear the costs of supplying humans-in-the-loop to manually solve CAPTCHAs, usually via crowdsourcing platforms such as Amazon Mechanical Turk (AMT). If the system can prove adaptable and resilient, it may further pave the way for more automated oversight systems, and for the indexing and web-scraping of TOR networks. This could enable scalable and high-volume analyses, as well as the development of new cybersecurity approaches and techniques, which have been hamstrung, to date, by CAPTCHA firewalls.

captcha, dw-gan, experiment, (14 more...)

#artificialintelligence

Country:

North America > United States > Florida (0.05)
North America > United States > Arizona (0.05)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.05)
(2 more...)

Genre: Research Report > Promising Solution (0.35)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military > Cyberwarfare (0.56)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Social Media > Crowdsourcing (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.50)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.35)

Add feedback

XPeng upgrades EV voice assistant with Microsoft text-to-speech tech – FutureIoT

#artificialintelligenceJan-10-2022, 06:20:54 GMT

With a deep understanding of urban mobility, we are finding many more scenarios to leverage AI technology for a high level of driver-machine …

futureiot, xpeng upgrade ev voice assistant

#artificialintelligence

Industry: Media > News (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.40)
Information Technology > Artificial Intelligence > Speech > Speech Synthesis (0.40)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.40)

Add feedback

Artificial Intelligence: FinTech's innovation driver - BusinessWorld Online

#artificialintelligenceJan-6-2022, 13:45:16 GMT

FinTech refers to any idea or innovation that improves or optimizes the way individuals or companies conduct financial activities. Early FinTech concentrated on developing add-on products to complement existing financial services. This combination of finance and technology has spawned a slew of valuable goods and services that redefine financial services and make them more accessible to the general public. Some of these products and services include insurance aggregators, mobile wallets, AI investment management advisers, peer-to-peer (P2P) lending and crowdfunding tools, and platforms for trading financial assets. The cutting-edge solutions that contributed to such technologies include Blockchain, Deep Learning, and Artificial Intelligence (AI).

artificial intelligence, businessworld online, fintech, (11 more...)

#artificialintelligence

Genre: Research Report (0.35)

Industry:

Banking & Finance > Financial Services (0.73)
Information Technology > Security & Privacy (0.49)

Technology:

Information Technology > e-Commerce > Financial Technology (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.30)

Add feedback