AITopics | Optical Character Recognition

Collaborating Authors

Optical Character Recognition

Our second example deals with a more challenging problem: the recognition of hand-printed letters of the alphabet. The characters that people print in the ordinary course of filling out forms and questionnaires are surprisingly varied. Gaps abound wherecontinuous lines might be expected; curves and sharp angles appear interchangeably; there is almost every imaginable distortion of slant, shape and size. Even human readers cannot always identify such characters; their error rate is about 3 per cent on randomly selected letters and numbers, seen out of context.
– from Oliver G. Selfridge & Ulric Neisser. PATTERN RECOGNITION BY MACHINE . In Computers & thought, Edward A. Feigenbaum and Julian Feldman (Eds.). MIT Press, Cambridge, MA, USA, 1963. pp. 8-30.

News Overviews Instructional Materials AI-Alerts Classics

Judge Dismisses Lawsuit Over Mail Delivery

U.S. NewsMay-5-2020, 07:00:31 GMT

The apartment complexes near Western Kentucky University sued the United States Postal Service and a postmaster in January after the agency began delivering mail in bulk to property management offices instead of tenants' mailboxes. The change came after the Postal Service reclassified the residences as dormitories, according to the lawsuit.

artificial intelligence, judge dismiss lawsuit, optical character recognition, (2 more...)

U.S. News

Country: North America > United States > Kentucky (0.80)

Industry:

Government > Post Office (1.00)
Government > Regional Government > North America Government > United States Government (0.93)

Technology: Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.40)

Add feedback

Shape Context descriptor and fast characters recognition

#artificialintelligenceApr-26-2020, 12:35:07 GMT

Matching shapes can be much difficult task then just matching images, for example recognition of hand-written text, or fingerprints. Because most of shapes that we trying to match is heavy augmented. I can bet that you will never write to identical letters for all your life. And look at this from the point of people detection algorithm based on handwriting matching -- it would be just hell. Of course in the age of Neural networks and RNNs it also can be solved in a different way then just straight mathematics, but not always you can use heavy and memory hungry things like NNs.

character recognition, descriptor and fast character recognition, shape context descriptor, (5 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.42)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.42)

Add feedback

How to Use Optical Character Recognition for Security System Development

#artificialintelligenceApr-19-2020, 06:44:07 GMT

Applying machine learning techniques to security solutions is one of the current AI trends. This article will cover the approach to developing OCR-based software using deep learning algorithms. This software can be used to analyze and process identification such as a US driver's license as part of a security system for verifying identity. OCR (Optical Character Recognition) technology is already used by machine learning companies for business processes automation and optimization, with use cases ranging from Dropbox using it to parse through pictures to Google Street view identifying different street signs to searching through text messages and translating text in real time. In this particular case, OCR can be used as part of an automated biometric verification system.

driver, ocr solution, optical character recognition, (11 more...)

#artificialintelligence

Country: North America > United States (0.05)

Industry:

Information Technology > Security & Privacy (1.00)
Transportation > Ground > Road (0.62)

Technology:

Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.55)

Add feedback

FastSpeech: Fast, Robust and Controllable Text to Speech

Ren, Yi, Ruan, Yangjun, Tan, Xu, Qin, Tao, Zhao, Sheng, Zhao, Zhou, Liu, Tie-Yan

Neural Information Processing SystemsMar-18-2020, 21:46:47 GMT

Neural network based end-to-end text to speech (TTS) has significantly improved the quality of synthesized speech. Prominent methods (e.g., Tacotron 2) usually first generate mel-spectrogram from text, and then synthesize speech from the mel-spectrogram using vocoder such as WaveNet. Compared with traditional concatenative and statistical parametric approaches, neural network based end-to-end models suffer from slow inference speed, and the synthesized speech is usually not robust (i.e., some words are skipped or repeated) and lack of controllability (voice speed or prosody control). In this work, we propose a novel feed-forward network based on Transformer to generate mel-spectrogram in parallel for TTS. Specifically, we extract attention alignments from an encoder-decoder based teacher model for phoneme duration prediction, which is used by a length regulator to expand the source phoneme sequence to match the length of the target mel-spectrogram sequence for parallel mel-spectrogram generation. Experiments on the LJSpeech dataset show that our parallel model matches autoregressive models in terms of speech quality, nearly eliminates the problem of word skipping and repeating in particularly hard cases, and can adjust voice speed smoothly.

fastspeech, mel-spectrogram generation, robust and controllable text, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.86)
Information Technology > Artificial Intelligence > Speech > Speech Synthesis (0.78)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.64)

Add feedback

Utopia Global Releases Cloud-Based Intelligent Data Capture and Control Software Platform Delivers High Quality Enriched Asset Master Data Leveraging Machine Learning

#artificialintelligenceFeb-27-2020, 17:43:16 GMT

IDCC uniquely leverages optical character recognition, Utopia's advanced machine learning code, intelligent online web search, and document search. Beginning simply with only a photo of a manufacturer's nameplate, IDCC can produce complete and accurate material and asset information. Manufacturer and model data is organized in ISO-14224 standards and can be delivered via a variety of easy-to-integrate methods, including SAP Asset Intelligence Network . The cloud-based nature of IDCC enables cost-effective, rapid deployments by large and small organizations alike. IDCC can be deployed in pure cloud environments, such as SAP Intelligent Asset Management, or hybrid deployments using SAP Master Data Governance, enterprise asset management extension by Utopia.

deployment, master data leveraging machine learning, release cloud-based intelligent data capture, (2 more...)

#artificialintelligence

Genre: Press Release (0.40)

Industry: Information Technology > Services (0.69)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.71)

Add feedback

Introducing The AI Reading Machine That Reconstructs Books As Illustrated Haikus

#artificialintelligenceFeb-26-2020, 06:30:57 GMT

The dynamic design duo of Karen Ann Donnachie and Andy Simionato are set to receive the Tokyo Type Directors Club award for their AI reading machine project – a machine that essentially transforms books into short Haikus accompanied by related images. It does this by using computer vision and optical character recognition to'read' books. Then with machine learning and natural language processing, it selects a poetic combination of words while erasing the rest to form an artsy-looking Haiku. While doing this, the reading machine also using Google to search up images that relate to said words. Donnachie and Simionato have released a series of books that we know and love with a slight twist.

haikus, illustrated haikus, reading machine, (5 more...)

#artificialintelligence

Country: Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.26)

Technology: Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (1.00)

Add feedback

6 strategies for building AI-based software TechBeacon

#artificialintelligenceFeb-19-2020, 17:16:03 GMT

Developing software that incorporates artificial intelligence (AI) can be unpredictable, and you need a unique set of knowledge and skills to code, test, and make sense of the data. What's more, tuning the system can take time, and the decisions AI-based software makes can sometimes be difficult to explain. My organization specializes in developing software test automation tools that help users develop tests that run on different platforms, such as desktop computers and mobile devices. We wanted to make it even easier to write and run these tests, and avoid having to customize the test for each platform. Our research led to adopt natural-language processing, which allows users of our software to describe a test using simple English, and computer vision with optical character recognition to identify the objects on a screen.

ai model, building ai-based software techbeacon, techbeacon, (11 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.71)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.55)

Add feedback

Iterative Learning for Reliable Crowdsourcing Systems

Karger, David R., Oh, Sewoong, Shah, Devavrat

Neural Information Processing SystemsFeb-14-2020, 23:26:50 GMT

Crowdsourcing systems, in which tasks are electronically distributed to numerous information piece-workers'', have emerged as an effective paradigm for human-powered solving of large scale problems in domains such as image classification, data entry, optical character recognition, recommendation, and proofreading. Because these low-paid workers can be unreliable, nearly all crowdsourcers must devise schemes to increase confidence in their answers, typically by assigning each task multiple times and combining the answers in some way such as majority voting. In this paper, we consider a general model of such rowdsourcing tasks, and pose the problem of minimizing the total price (i.e., number of task assignments) that must be paid to achieve a target overall reliability. We give new algorithms for deciding which tasks to assign to which workers and for inferring correct answers from the workers' answers. We show that our algorithm significantly outperforms majority voting and, in fact, are asymptotically optimal through comparison to an oracle that knows the reliability of every worker.

iterative learning, majority voting, reliable crowdsourcing system, (1 more...)

Neural Information Processing Systems

Technology:

Information Technology > Communications > Social Media > Crowdsourcing (0.65)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.64)

Add feedback

Volume Regularization for Binary Classification

Crammer, Koby, Wagner, Tal

Neural Information Processing SystemsFeb-14-2020, 21:42:02 GMT

We introduce a large-volume box classification for binary prediction, which maintains a subset of weight vectors, and specifically axis-aligned boxes. Our learning algorithm seeks for a box of large volume that contains simple'' weight vectors which most of are accurate on the training set. Two versions of the learning process are cast as convex optimization problems, and it is shown how to solve them efficiently. The formulation yields a natural PAC-Bayesian performance bound and it is shown to minimize a quantity directly aligned with it. The algorithm outperforms SVM and the recently proposed AROW algorithm on a majority of $30$ NLP datasets and binarized USPS optical character recognition datasets.

binary classification, regularization, weight vector

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.68)

Add feedback

Generalizing Analytic Shrinkage for Arbitrary Covariance Structures

Bartz, Daniel, Müller, Klaus-Robert

Neural Information Processing SystemsFeb-14-2020, 17:43:47 GMT

Analytic shrinkage is a statistical technique that offers a fast alternative to cross-validation for the regularization of covariance matrices and has appealing consistency properties. We show that the proof of consistency implies bounds on the growth rates of eigenvalues and their dispersion, which are often violated in data. We prove consistency under assumptions which do not restrict the covariance structure and therefore better match real world data. In addition, we propose an extension of analytic shrinkage --orthogonal complement shrinkage-- which adapts to the covariance structure. Finally we demonstrate the superior performance of our novel approach on data from the domains of finance, spoken letter and optical character recognition, and neuroscience.

arbitrary covariance structure, consistency, generalizing analytic shrinkage

Neural Information Processing Systems

Genre: Research Report (0.49)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.98)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.69)

Add feedback