A
A
AlexBoss2020-04-07 11:43:47
Python
AlexBoss, 2020-04-07 11:43:47

How to train tesseract for your font?

Good afternoon. I'm trying to train tesseract v5 for my font, because standard even after image processing show an incorrect result. Can someone tell me how it's done or throw a manual, for several days I've been looking everywhere to no avail.

Answer the question

In order to leave comments, you need to log in

2 answer(s)
A
Alex F, 2020-04-07
@delvin-fil

tesseract 4.1.1, pytesseract 0.3.0 eats everything (well, almost)
Code:

def ocr(self):
        global dtout
        pytesseract.pytesseract.tesseract_cmd = '/usr/bin/tesseract'
        dtout = pytesseract.image_to_string(Image.open(fname), 'rus+eng+chi_tra+jpn')
        self.ui.textOut.setText(dtout)

5e8c4a0b4c38a428035683.png
5e8c4a954fd38632050017.png
Show the picture you are going to recognize.

R
Roman Mirilaczvili, 2020-05-15
@2ord

For special tasks, the Gamera framework may be suitable . You can even recognize hieroglyph scripts and various historical inscriptions. But this is more for researchers.
Learning takes place interactively.
tutorial_selection2.png
https://gamera.informatik.hsnr.de/docs/gamera-docs...

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question