GitHub - tesseract-ocr/tesseract: Tesseract Open Source OCR Engine (main repository)
This package contains an OCR engine - libtesseract and a command line program - tesseract. Tesseract 4 adds a new neural net (LSTM) based OCR engine which is focused on line recognition, but also still supports the legacy Tesseract OCR engine of Tesseract 3 which works by recognizing character patterns. Compatibility with Tesseract 3 is enabled by using the Legacy OCR Engine mode (--oem 0). It also needs traineddata files which support the legacy engine, for example those from the tessdata repository. The lead developer is Ray Smith.
Nov-6-2022, 12:41:16 GMT
- Country:
- Europe > United Kingdom
- North America > United States
- Colorado > Weld County > Greeley (0.06)
- Technology: