Answer the question
In order to leave comments, you need to log in
How to parse pdf as a structure in c#?
Interested in how to parse pdf not just as a mountain of text, but as a structure. In particular reading tables.
Answer the question
In order to leave comments, you need to log in
stackoverflow.com/questions/3424588/programmatically-extract-pdf-tables
Considering your requirement, Straight forward answer for your question would be it is quite not possible. The reason is, unlike word/excel, PDF specification does not have an object called Table. The table which you see in those PDF documents are just a series of rectangle drawn in such a way that it looks like table and it is up to PDF Writer which created those PDF files, because some might draw table kind of structure using Series of Line.
Well, first you need to find a description of the PDF format, and then push yourself from that.
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question