Answer the question
In order to leave comments, you need to log in
How to parse raw html pages?
Good day, you need to parse data from the site, read a bunch of docks, all of them are used to sending a request and processing response data.
For example, I found this one:
html = urllib2.urlopen( "http://www.google.com" ).read()
soup = BeautifulSoup( html )
from mechanize import Browser
from bs4 import BeautifulSoup
mech = Browser()
mech.set_handle_robots(False)
url = 'example.com'
page1 = mech.open(url)
html1 = page1.read()
soup1 = BeautifulSoup(html1)
Answer the question
In order to leave comments, you need to log in
This is just processed data, and raw comes to you with urlib.
Use selenium (browser window can be hidden using PhantomJS).
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question