R
R
random2016-07-18 18:10:46
Java
random, 2016-07-18 18:10:46

How to parse xml into java object which is generated by abbyy ocr sdk?

Hello, the question is as follows, I need to parse the xml that is generated by ocr sdk by command:

java TestApp recognize test.pdf result.xml --lang=russian


And extract the necessary information from xml, not all of it, but for example I need to find out information from the document which bank is the recipient (in the example it is JSC Halyk Bank of Kazakhstan). I tried to determine by coordinates (bottom, right, left, top), but the error very large, because the document may not be scanned smoothly or not.Is

it possible to do this?

link to the received xml: https://www.dropbox.com/s/iofy6i4xjesrsyj/result.x...
link to pdf: https://www.dropbox.com/s/girz3it2ntt10fm/test.pdf?dl=0

Answer the question

In order to leave comments, you need to log in

1 answer(s)
C
coden55, 2016-07-18
@coden55

The beneficiary bank can be searched for by BIC (it is always the same length) and the name of the bank from the database can already be determined from it

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question