Y
Y
yohiromoro2021-12-01 17:52:31
Parsing
yohiromoro, 2021-12-01 17:52:31

How to get the entire text of the site (from all tabs, etc.)?

I received an order to translate the site, but this requires the ENTIRE text. How to do it? Is there a program? Ideally, each new page of the site should be saved in a separate dock. Or only paid parsers?

Answer the question

In order to leave comments, you need to log in

1 answer(s)
V
Vasily Chesnokov, 2021-12-02
@yohiromoro

Depending on the complexity of the task, the desire to poke around, and skills in surrounding technologies, you can offer a bunch of options, from banal download managers like wget:
wget -r -k -l 7 -p -E -nc http://site.com/
to heavy and capable libraries and whole frameworks like scrapy
Google using the words " web crawler "

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question