Goto

Collaborating Authors

 Optical Character Recognition


Artificial Intelligence Is Cracking Open the Vatican's Secret Archives

#artificialintelligence

But a new project could change all that. Known as In Codice Ratio, it uses a combination of artificial intelligence and optical-character-recognition (OCR) software to scour these neglected texts and make their transcripts available for the very first time. If successful, the technology could also open up untold numbers of other documents at historical archives around the world. OCR has been used to scan books and other printed documents for years, but it's not well suited for the material in the Secret Archives. Traditional OCR breaks words down into a series of letter-images by looking for the spaces between letters.


TED 2018: Thought-Reading Machines and the Death of Love

WIRED

Ludwig Wittgenstein once imagined that everyone had a box with something in it called a "beetle." Denying the possibility of private language, the philosopher wrote, "No one can look into anyone else's box, and everyone says he knows what a beetle is only by looking at his beetle." Wittgenstein meant that we learn a word by observing the rules governing its use, but no one sees another person's beetle: "It would be quite possible for everyone to have something different in his box," or nothing at all. An apparently intractable fact of life is that our thoughts are inaccessible to one another. Our skulls are like space helmets; we are trapped in our heads, unable to convey the quiddity of our sensations. But how much longer will our thoughts be truly private?


How AI is taking the drudgery out of business processes

#artificialintelligence

The debate on how artificial intelligence (AI) could shape the way we work tends to take place on a grand scale. We talk about a future where driverless vehicles will deliver our goods from factories filled with armies of robot workers. Adam Reynolds, CEO of webexpenses, discusses how we may be missing the more mundane and practical ways that AI is already reshaping our everyday working lives, and transforming the way businesses operate. Thanks to a new generation of AI-based systems and tools we can eliminate a whole swathe of tedious and repetitive work and home tasks โ€“ bringing intuition, help and time-saving benefits to our lives. These can be found within every industry โ€“ from systems designed to root out important clauses from large volumes of legal documents to medical software that identifies potential risks in patient data.


Mind-reading machine can translate your thoughts and display as text

Daily Mail - Science & tech

Scientists have developed an astonishing mind-reading machine which can translate what you are thinking and instantly display it as text. They claim that it has an accuracy rate of 90 per cent or more and say that it works by interpreting consonants and vowels in our brains. The researchers believe that the machine could one day help patients who suffer from conditions that don't allow them to speak or move. The machine registers and analyses the combination of vowels and consonants that we use when constructing a sentence in our brains. It interprets these sentences based on neural signals and can translate them into text in real time.


Google's new text-to-speech service has more realistic voices

Engadget

Google will now let developers use the text-to-speech synthesis that powers the voices in Google Assistant and Maps. Cloud Text-to-Speech is available now through the Google Cloud Platform and the company says it can be used to power voice response systems in call centers, enable IoT device speech and convert media like news articles and books into a spoken format. There are 32 different voice options in 12 languages and users can customize pitch, speaking rate and volume gain. Additionally, a selection of the available voices were built with Google's WaveNet model. It was developed by Google's DeepMind team and the company first announced it in 2016. Rather than using fragments of speech and stringing them together to make words -- which often sounds very robotic -- WaveNet forms individual sound waves, creating more natural sounding speech.


The best portable document scanner

Engadget

This post was done in partnership with Wirecutter. When readers choose to buy Wirecutter's independently chosen editorial picks, it may earn affiliate commissions that support its work. After putting in more than 100 hours for research and hands-on testing since 2013, we think the Epson ES-300W is the best portable document scanner for digitizing documents without taking up half of a desktop. It combines scan speeds usually found on full-size scanners with extremely accurate text recognition. And thanks to its built-in Wi-Fi and battery, you can use it almost anywhere--even with a phone or tablet.


Veritone Announces General Availability of Artificial Intelligence Developer Application - Veritone, Inc.

#artificialintelligence

Veritone, Inc. (NASDAQ: VERI), a leading provider of artificial intelligence (AI) insights and cognitive solutions, today announced the general availability of its Veritone Developer application. The application empowers developers of cognitive engines, applications and application programming interfaces (APIs) to bring new AI ideas to life through simple integration with the Veritone aiWARE platform. Veritone Developer is a self-service development environment that empowers developers to create, submit and deploy public and private applications and cognitive engines directly into the aiWARE architecture. After a successful limited beta release to a select group of partners, Veritone Developer is now publicly available as a unique resource for machine learning experts, application development firms, and system integrators. Veritone Developer supports RESTful and GraphQL API integrations as well as engine development in major categories of cognition, including: transcription, translation, face and object recognition, audio/video fingerprinting, optical character recognition (OCR), geolocation, transcoding, and logo recognition, among others.


Residents Blast Mail Delivery Service in Michigan Town

U.S. News

Residents complain that mail carriers have failed to deliver prescription drugs and pension checks. Some also say their mailboxes have been damaged or destroyed by carriers' trucks, that telephone complaint lines were never answered and that postal managers were rude when residents visited the post office in person.


Application of Image Processing in Intelligent Character Recognition

#artificialintelligence

Summary: Image processing is a rapidly evolving field with immense significance in science and engineering. One of the latest applications of Image processing is in Intelligent Character Recognition (ICR). Intelligent Character Recognition is the computer translation of handwritten text into machine-readable and machine-editable characters. It is an advanced version of Optical Character Recognition system that allows fonts and different styles of handwriting to be recognized during processing with high accuracy and speed. ICR, in combination with OCR and OMR (Optical Mark Recognition), is used in forms processing. Forms processing is a process by which one can capture information entered into different data fields filled in forms and convert it to an editable text.


Fooling OCR Systems with Adversarial Text Images

arXiv.org Artificial Intelligence

We demonstrate that state-of-the-art optical character recognition (OCR) based on deep learning is vulnerable to adversarial images. Minor modifications to images of printed text, which do not change the meaning of the text to a human reader, cause the OCR system to "recognize" a different text where certain words chosen by the adversary are replaced by their semantic opposites. This completely changes the meaning of the output produced by the OCR system and by the NLP applications that use OCR for preprocessing their inputs.