P
P
Pavel Eroshevich2020-08-21 06:27:32
Python
Pavel Eroshevich, 2020-08-21 06:27:32

How to split a table into cells from a scan?

I wrote a python script that splits the scanned document into blocks and cuts them into separate files, these files are fed into Tesseract. The problem is the appearance of a table in the document, how can you understand that this is a table and cut it into cells (take into account collspan / rowspan)?

Answer the question

In order to leave comments, you need to log in

1 answer(s)
S
Sergey Svetlov, 2020-08-22
@svetloffs

Pavel, where is "The problem is the appearance of the table in the document" is this?
Your code produces nparray - this is a set of numbers, but not table data.

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question