Answer the question
In order to leave comments, you need to log in
How to read separation names in PDF file in C# Windows Form?
There is a project on C# Windows Form
A simple form with a button text field and a COM component Acrobat Reader to display a PDF file on the form.
private void button2_Click(object sender, EventArgs e)
{
string pFile = textBox1.Text;
string filePath = @"\\TS\Obmen\Штампы\D\" + pFile + ".pdf";
this.axAcroPDF1.LoadFile(filePath);
this.axAcroPDF1.src = filePath;
this.axAcroPDF1.setShowToolbar(true); // показать/отключить панель инструментов
this.axAcroPDF1.setView("FitH");
this.axAcroPDF1.setLayoutMode("SinglePage");
this.axAcroPDF1.Show();
}
Answer the question
In order to leave comments, you need to log in
In general, no way! pdf is a picture with optional text information.
In your case, you can try to convert a pdf file into pictures, cut out a piece from a certain area (imagemagic) and send it for recognition using the same tesseract.
I was able to pull out all the XMP code. It's essentially XML
PdfReader pdf = new PdfReader(filePath);
string metadataXml = System.Text.Encoding.Default.GetString(pdf.Metadata);
label1.Text = metadataXml;
//xmpmeta/RDF/Description/inks/Seq/li[1]/egname
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question