Scan datamatrix from pdf and save to csv?

Y

Yana Mikhailenko2021-06-10 16:25:14

Python

Yana Mikhailenko, 2021-06-10 16:25:14

Task:
Scan datamatrix codes from a pdf file and save them to csv.

File

End result:
010466010514027621)ZPTsFWoUgqe,91009492ZCUruNv8/rQRlZyH/mZhkRY11D5aW4aLjpVn3DVxFIi7l9gV/pvguWxiVnpTRI0SFkNx1dPavcQYjiQ6DCSnNw=

I can't form the structure of this code in my head=.

I started to study libraries for working with PDF files, specifically PyPDF2, but I ran into a problem. PyPDF2 finds absolutely nothing in the file. I tried to find the sequence in the code of the pdf file but did not understand anything.

Help me understand any fragment of this code (except for writing to csv).
It may be possible to extract information from PDF even without rendering to an image, since large amounts of codes and code speed play a role.

If there are people who know the PDF structure, tell me if it will be possible to pull out the location of each pixel (black square) of the datamatrix code and whether it will be possible to translate all this into the final form.

I would be grateful for any information. Thank you.