H
H
Horosh2942020-06-02 22:43:18
Parsing
Horosh294, 2020-06-02 22:43:18

How to parse a specific field from a site?

Good afternoon! There is a site, it has about 250k of the same type of pages, each page has the necessary link in a certain place. I have all 250k landing pages in my txt file. In general, the task is this: parse links from a certain field on the page and then insert them, for example, also into a txt file, only into a new one already.
With what can this be done?

Answer the question

In order to leave comments, you need to log in

2 answer(s)
P
PeeX, 2020-06-02
@PeeX

I would use nodejs + cheerio
although there are probably more suitable tools

A
alekssamos, 2020-06-03
@alekssamos

If there is no javascript execution, then absolutely any language. Either python + requests or PHP + curl. And then, either parse HTML, or regular.

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question