State of Optical Character Recognition in 2022 part1(Artificial Intelligence)

Oct-22-2022, 07:05:14 GMT–#artificialintelligence

Abstract: Synthetic image generation has recently experienced significant improvements in domains such as natural image or art generation. However, the problem of figure and diagram generation remains unexplored. A challenging aspect of generating figures and diagrams is effectively rendering readable texts within the images. To alleviate this problem, we present OCR-VQGAN, an image encoder, and decoder that leverages OCR pre-trained features to optimize a text perceptual loss, encouraging the architecture to preserve high-fidelity text and diagram structure. To explore our approach, we introduce the Paper2Fig100k dataset, with over 100k images of figures and texts from research papers. The figures show architecture diagrams and methodologies of articles available at arXiv.org from fields like artificial intelligence and computer vision.

artificial intelligence, optical character recognition, text recognition, (11 more...)

#artificialintelligence

Oct-22-2022, 07:05:14 GMT

News Web Page

Add feedback

Genre:
- Research Report (0.54)

Technology:
- Information Technology > Artificial Intelligence
  - Vision > Optical Character Recognition (0.93)
  - Machine Learning
    - Pattern Recognition (0.60)
    - Neural Networks (0.57)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found