Answer the question
In order to leave comments, you need to log in
How to parse a site that has a very interesting anti-bot?
Parsing blocking was enabled on the rbt.ru
website . After sending a request to their site, there is a redirect to some third-party service ohio8.vchecks.info/.....
At the end of the js script that is there, 3 parameters are generated that are needed to form a new url and generate cookies.
Can anyone help me bypass this protection?
Answer the question
In order to leave comments, you need to log in
As a 100% option, you can consider headless-Chrome with a webdriver for python, but this will require more resources than a regular http client
import time
from selenium import webdriver
import selenium.webdriver.chrome.service as service
service = service.Service('/path/to/chromedriver')
service.start()
capabilities = {'chrome.binary': '/path/to/custom/chrome'}
driver = webdriver.Remote(service.service_url, capabilities)
driver.get('http://www.google.com/xhtml');
driver.quit()
I join my colleagues and add software if I don’t feel like using pens:
sbfactory.ru/?p=600
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question