A
A
AbaGardon2018-12-17 20:51:13
HTML
AbaGardon, 2018-12-17 20:51:13

Is it possible to read the table from PDF and transfer it to HTML, and how?

There is a task to transfer all the tables that are in the PDF file to the site so that everything is in HTML tables.
Question:
Is it possible to read a table from PDF and transfer it to HTML, and how?

Answer the question

In order to leave comments, you need to log in

3 answer(s)
W
Wentixon, 2018-12-17
@Wentixon

With a script on the server. Google pdf to html + your language

M
Moskus, 2018-12-18
@Moskus

PDF does not generally store the structure of a document, it is mostly a vector graphic format, not a semantic one. Therefore, the most effective way is recognition through OCR. All kinds of tools that try to extract tables simply based on the position of the text work, of course, faster, but the result is worse. So decide, checkers or go.

A
Alejandro Esquire, 2019-01-15
@A1ejandro

No, there is no such converter. Moreover, some PDFs are generally scans of documents (pictures). Therefore, what kind of universal converter can we talk about? Another thing is recognition (OCR). In principle, this is the most realistic that you can use. If the document is clearly digitized (text, vector graphics), then you can try to drag it out in fragments and paste it somewhere with an attempt to preserve the structure. But often such attempts end in failure. Although sometimes it doesn't work when using Acrobat Reader, it does when using Foxit Reader... Good luck.

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question