K
K
Koreetz Koreetz2015-08-06 13:19:23
Automation
Koreetz Koreetz, 2015-08-06 13:19:23

How to transfer data from pdf catalog to excel?

Good afternoon! the crux of the matter is as follows:
There are .pdf catalogs of furniture factories, you need to transfer the data from these catalogs to excel.
Data means: picture + product description + price.
Of course, you can copy-paste from pdf to excel, but I want to automate it somehow !!
Please let me know if anyone has experienced this problem...

Answer the question

In order to leave comments, you need to log in

4 answer(s)
V
V Sh., 2015-08-06
@JuniorNoobie

PDF is, at its core, XML with a specific format. You can read the pdf into an XMLDocument and then use an XSLT transformation or some Excel library (Microsoft.Office.Interop.Excel for C#) to write the data into an Excel template.

V
vasRmba, 2015-08-06
@vasRmba

Try Nitro Pro.

D
Dmitry Kovalsky, 2015-08-06
@dmitryKovalskiy

There is one dirty hack that works for the xls format. You can scrape the markup

<html>
<body>
<table>...</table>
</body>
</html>

And save it to a file with .xls extension. Paradoxical, but it works. But only for tabular data and if you do not need to tie the cells with mathematical formulas.

E
elperus, 2016-09-11
@elperus

The transfer of information from catalogs to EXCEL implies an adequate reorganization of the data structure.
If you use a simple PDF to Excel converter, then you will get, at best, an EXCEL document that will visually look exactly the same as in PDF. It's just not what you need. You are not transferring them to EXCEL in order to look at them (you can also look in PDF). Such transformations are done in order to be able to process this information in the future, for example, import it into your accounting program or website. And here you will find a major disappointment. EXCEL files obtained in this way are completely unsuitable for these purposes.
What to do, you ask. There is only one answer - you need to order a specialized converter. I happened to make such converters for converting large directories. I can say one thing - ordering such a converter is much cheaper than manually transferring information, especially if this will need to be done more than once.
The exception to this is when tabular data needs to be migrated. Yes, ready-made converters can help you here. They know how to move tables. For example, for the case when you need to arrange a regular transfer from PDF to EXCEL of invoices, waybills, invoices (invoices), etc. you can use ASTAR PDF converter. It also has a Russian interface. You can see it here: ASTAR PDF converter

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question