S
S
Sergey Sokolov2021-09-22 21:09:37
CAPTCHA
Sergey Sokolov, 2021-09-22 21:09:37

How to show page with hCaptcha in Jupyter Notebook?

I'm collecting data from one site, for a little research. In a volume slightly larger than everyday scrolling, and less than malicious parsing. But the captcha pops up.

I work in Google Colab, I request pages using requests, then I parse BeautifulSoup.

At some stage, instead of the requested page, I get this:

spoiler
<html>
<head>
<title>hCaptcha solve page</title>
<script async="" defer="" src="https://www.hcaptcha.com/1/api.js"></script>
</head>
<body>
<form action="/jobs?as_and=UX&amp;as_any=&amp;jt=all&amp;radius=25&amp;l=London&amp;fromage=all&amp;as_ttl=product%20manager&amp;limit=50&amp;psf=advsrch&amp;from=advancedsearch" method="POST">
<div class="h-captcha" data-sitekey="eb27f525-f936-43b4-91e2-95a426d4a8bd"></div>
<br/>
<input type="submit" value="Submit"/>
</form>
</body>
</html>


Is it possible to somehow display it in a laptop in order to manually solve it and move on?
Or will it not work outside the assigned domain and everything is pointless and you have to parse it with Selenium with a paid puzzle screwed on?

Answer the question

In order to leave comments, you need to log in

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question