Answer the question
In order to leave comments, you need to log in
How to parse a specific field from a site?
Good afternoon! There is a site, it has about 250k of the same type of pages, each page has the necessary link in a certain place. I have all 250k landing pages in my txt file. In general, the task is this: parse links from a certain field on the page and then insert them, for example, also into a txt file, only into a new one already.
With what can this be done?
Answer the question
In order to leave comments, you need to log in
I would use nodejs + cheerio
although there are probably more suitable tools
If there is no javascript execution, then absolutely any language. Either python + requests or PHP + curl. And then, either parse HTML, or regular.
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question