M
M
Mikhail Kostikov2022-04-20 17:12:01
Python
Mikhail Kostikov, 2022-04-20 17:12:01

How to save html from list of url asynchronously?

There is an array of urls that you need to asynchronously save html into another array of these urls, bypassing cloudflare protection.

This script https://github.com/MorvanZhou/easy-scraping-tutori... suits but after a while cloudflare ddos ​​protection appears and part of the url is skipped. I tried to connect the module https://github.com/pavlodvornikov/aiocfscrape but there is an error "name 'domain' is not defined". There are other solutions for example https://github.com/venomous/cloudscraper but they are not asynchronous, I don't know how to connect them.

Answer the question

In order to leave comments, you need to log in

1 answer(s)
M
Mikhail Kostikov, 2022-04-21
@kostikovmu

This https://github.com/venomous/cloudscraper connected writes

Detected a Cloudflare version 2 challenge, This feature is not available in the opensource (free) version.

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question