D
D
Denis2018-07-20 14:58:11
Python
Denis, 2018-07-20 14:58:11

How to parse certain pdf documents from a site that contain the right words?

There is a card file of the arbitration court (kad.arbitr.ru), which contains various open documents in pdf.
You need to parse links to documents that contain certain keywords.
I'm still just learning python, so please give me a tip in which direction to move, what to read / see, what features to consider. Maybe there are some similar solutions?
As I understand the site of the file cabinet in javascript - will there be any difficulties here?
After several search queries, a captcha pops up - will this be some kind of problem when parsing?

Answer the question

In order to leave comments, you need to log in

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question