0
0
0xC0CAC01A2015-12-19 03:49:53
System administration
0xC0CAC01A, 2015-12-19 03:49:53

How to reliably recognize old texts under Linux from the command line?

Like, for example, this text . There, by the way, there is a layer of recognition text, but you can see how this recognition is ugly, and you need a search by last name.

Answer the question

In order to leave comments, you need to log in

1 answer(s)
A
Andrey Ermachenok, 2015-12-19
@eapeap

It was a long time ago - it was necessary to recognize Belarusian texts (symbols i, ў in Cyrillic texts), and FineReader did not know how to do this then, but it has training. Conducted training on 2 pages, he recognized the rest without any problems.
So something like this: take a FineReader or another recognizer that is available for Linux, train it manually, and then set the trained recognizer on scans from the command line.

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question