Answer the question
In order to leave comments, you need to log in
How to split a table into cells from a scan?
I wrote a python script that splits the scanned document into blocks and cuts them into separate files, these files are fed into Tesseract. The problem is the appearance of a table in the document, how can you understand that this is a table and cut it into cells (take into account collspan / rowspan)?
Answer the question
In order to leave comments, you need to log in
Pavel, where is "The problem is the appearance of the table in the document" is this?
Your code produces nparray - this is a set of numbers, but not table data.
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question