AITopics | Optical Character Recognition

Collaborating Authors

Optical Character Recognition

Our second example deals with a more challenging problem: the recognition of hand-printed letters of the alphabet. The characters that people print in the ordinary course of filling out forms and questionnaires are surprisingly varied. Gaps abound wherecontinuous lines might be expected; curves and sharp angles appear interchangeably; there is almost every imaginable distortion of slant, shape and size. Even human readers cannot always identify such characters; their error rate is about 3 per cent on randomly selected letters and numbers, seen out of context.
– from Oliver G. Selfridge & Ulric Neisser. PATTERN RECOGNITION BY MACHINE . In Computers & thought, Edward A. Feigenbaum and Julian Feldman (Eds.). MIT Press, Cambridge, MA, USA, 1963. pp. 8-30.

News Overviews Instructional Materials AI-Alerts Classics

How to Classify Documents With OCR and Machine Learning

#artificialintelligenceSep-28-2021, 13:20:06 GMT

Yeelen Knegtering, CEO & Co-founder of Klippa, is passionate about developing digital products that help people to save time on administrative hassle and spend time on the things they love. With a degree in Information Technology at the University of Groningen, he started Klippa with the idea that there had to be a better way to organize and manage receipts. Now, Klippa is a document digitization company with a focus on digitizing and automating document streams for companies.

classify document, klippa, ocr and machine learning

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.85)
Information Technology > Artificial Intelligence > Machine Learning (0.85)

Add feedback

A Proposal of Automatic Error Correction in Text

Luna-Ramírez, Wulfrano A., Jaimez-González, Carlos R.

arXiv.org Artificial IntelligenceSep-24-2021

The great amount of information that can be stored in electronic media is growing up daily. Many of them is got mainly by typing, such as the huge of information obtained from web 2.0 sites; or scaned and processing by an Optical Character Recognition software, like the texts of libraries and goverment offices. Both processes introduce error in texts, so it is difficult to use the data for other purposes than just to read it, i.e. the processing of those texts by other applications like e-learning, learning of languages, electronic tutorials, data minning, information retrieval and even more specialized systems such as tiflologic software, specifically blinded people-oriented applications like automatic reading, where the text would be error free as possible in order to make easier the text to speech task, and so on. In this paper it is showed an application of automatic recognition and correction of ortographic errors in electronic texts. This task is composed of three stages: a) error detection; b) candidate corrections generation; and c) correction -selection of the best candidate. The proposal is based in part of speech text categorization, word similarity, word diccionaries, statistical measures, morphologic analisys and n-grams based language model of Spanish.

correction, error correction, knowledge, (13 more...)

arXiv.org Artificial Intelligence

2112.01846

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > New Jersey (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(5 more...)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)

Add feedback

Recognizing Handwritten Digits using scikit_learn

#artificialintelligenceSep-20-2021, 14:16:21 GMT

Recognizing handwritten text is a problem that can be traced back to the first automatic machines that needed to recognize individual characters in handwritten documents. Think about, for example, the ZIP codes on letters at the post office and the automation needed to recognize these five digits. Perfect recognition of these codes is necessary in order to sort mail automatically and efficiently. Included among the other applications that may come to mind is OCR (Optical Character Recognition) software. OCR software must read handwritten text, or pages of printed books, for general electronic documents in which each character is well defined.

dataset, digit, interpolation, (12 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.90)
Information Technology > Artificial Intelligence > Vision > Handwriting Recognition (0.56)

Add feedback

The Machine Learning Overview -- Part I

#artificialintelligenceSep-5-2021, 17:51:12 GMT

Machine Learning, DeepLearning and artificial intelligence algorithms, in general, are attracting increasing attention in various industrial and social fields. However, many interesting algorithms were developed a few years ago.

algorithm, machine learning overview

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.49)

Add feedback

Highly accurate AWS machine learning based handwritten document scanner – IT Brief New Zealand

#artificialintelligenceAug-27-2021, 08:03:23 GMT

It uses Amazon Textract, a machine learning service that automatically extracts text, handwriting, and data from scanned documents to provide highly …

accurate aw machine, brief new zealand

#artificialintelligence

Country: Oceania > New Zealand (0.40)

Industry: Media > News (0.70)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.90)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.40)

Add feedback

HCR-Net: A deep learning based script independent handwritten character recognition network

Chauhan, Vinod Kumar, Singh, Sukhdeep, Sharma, Anuj

arXiv.org Artificial IntelligenceAug-15-2021

Handwritten character recognition (HCR) is a challenging learning problem in pattern recognition, mainly due to similarity in structure of characters, different handwriting styles, noisy datasets and a large variety of languages and scripts. HCR problem is studied extensively for a few decades but there is very limited research on script independent models. This is because of factors, like, diversity of scripts, focus of the most of conventional research efforts on handcrafted feature extraction techniques which are language/script specific and are not always available, and unavailability of public datasets and codes to reproduce the results. On the other hand, deep learning has witnessed huge success in different areas of pattern recognition, including HCR, and provides end-to-end learning, i.e., automated feature extraction and recognition. In this paper, we have proposed a novel deep learning architecture which exploits transfer learning and image-augmentation for end-to-end learning for script independent handwritten character recognition, called HCR-Net. The network is based on a novel transfer learning approach for HCR, where some of lower layers of a pre-trained VGG16 network are utilised. Due to transfer learning and image-augmentation, HCR-Net provides faster training, better performance and better generalisations. The experimental results on publicly available datasets of Bangla, Punjabi, Hindi, English, Swedish, Urdu, Farsi, Tibetan, Kannada, Malayalam, Telugu, Marathi, Nepali and Arabic languages prove the efficacy of HCR-Net and establishes several new benchmarks. For reproducibility of the results and for the advancements of the HCR research, complete code is publicly released at \href{https://github.com/jmdvinodjmd/HCR-Net}{GitHub}.

dataset, hcr-net, recognition, (15 more...)

arXiv.org Artificial Intelligence

2108.06663

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Asia > India > Chandigarh (0.04)
North America > United States (0.04)
(6 more...)

Genre: Research Report (0.50)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Artificial intelligence technology to manage smart contracts

#artificialintelligenceAug-7-2021, 14:51:43 GMT

Choosing the right contract management software can increase productivity in any company. The main factors are cloud-based and the use of artificial intelligence. Contracts have a direct impact on the success of the company. In order to maintain an overview of the portfolio of contracts and the resulting rights and obligations, automated and clearly defined processes as well as clear lists and dashboards are required. This is especially true when the creation, conclusion and storage of contract documents is decentralized.

artificial intelligence technology, contract, intelligence technology, (5 more...)

#artificialintelligence

Country: Europe > Austria (0.06)

Industry:

Banking & Finance > Economy (0.42)
Information Technology > Services (0.40)

Technology:

Information Technology > Artificial Intelligence > Applied AI (0.54)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.32)

Add feedback

Cortical.io's AI makes bulk contract analysis faster and more accurate

#artificialintelligenceAug-6-2021, 19:40:07 GMT

All the sessions from Transform 2021 are available on-demand now. In the past, reviewing large stacks of documents was a mind-numbing chore for junior attorneys -- a process that could literally consume months of multiple employees' lives. But innovations in artificial intelligence have enabled Cortical.io Using large quantities of documents as inputs and a semantic folding theory-based natural language understanding system to parse content, Contract Intelligence can transform structured agreements and unstructured documents into comprehensible data. The software is able to search, extract, classify, and compare data from contracts, policies, financial reports, and other documents, including the ability to understand the meanings of concepts and whole sentences -- more than just keywords, which might previously have been extracted and searchable using basic optical character recognition.

ai make bulk contract analysis, contract intelligence, cortical, (2 more...)

#artificialintelligence

Industry: Banking & Finance (0.74)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.58)

Add feedback

OCR - [TheJavaSea] OCR - Convert image to text

#artificialintelligenceJul-29-2021, 06:20:15 GMT

Our OCR application allows you to perform basic OCR (Optical Character Recognition) in English and 100+ other languages. It is possible to recognize a...

ocr

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.53)

Add feedback

Searching for ROI in Artificial Intelligence Deployments

#artificialintelligenceJul-21-2021, 14:50:06 GMT

Anyone with any doubts about the interest in AI and its use across enterprise technologies only needs to look at the example of the Intelligent Document Processing (IDP) market and the kind of verticals that are investing in it to quash those doubts. According to the Everest Group's recently published report, Intelligent Document Processing (IDP) State of the Market Report 2021 (purchase required) the market for this segment alone is estimated at $700-750 million in 2020 and expected to grow at a rate of 55-65% over the next year. Cost impact is now the key driver for intelligent document processing adoption, closely followed by improving operational efficiency and productivity. These solutions blend AI technologies to efficiently process all types of documents and feed the output into downstream applications. Optical character recognition (OCR), computer vision, machine learning (ML) and deep learning models, and natural language processing (NLP) are the key core technologies powering IDP capabilities.

adoption, artificial intelligence deployment, respondent, (11 more...)

#artificialintelligence

Country:

Oceania > Australia (0.05)
North America > United States > Florida > Orange County > Orlando (0.05)
Europe > Spain (0.05)
Europe > Netherlands (0.05)

Industry: Information Technology > Security & Privacy (0.32)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.90)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.55)

Add feedback