P
P
Pantene7422017-01-04 14:09:08
PHP
Pantene742, 2017-01-04 14:09:08

Is PHP suitable for web scraping, is it enough to satisfy most freelance customers?

- Is it possible to parse to php with authorization and cookies?
- Dynamic pages (sites) seem to be parsed simply by requests to the Rest API. I understand correctly ? to use selenium for this I consider not bad but a crutch.
- I understand that Python with parallelism and the Pandas library is just the only adequate tool for analyzing a large amount of parsed data. BUT THE NEXT QUESTION --- -> What orders are more common in freelancing? Most need the power of Python, or PHP will suffice.

Answer the question

In order to leave comments, you need to log in

5 answer(s)
T
T_y_l_e_r, 2017-01-04
@Pantene742

Still how it fits
Also supports multithreading in CUrl
Cookies and everything else
You need to run it as a console application
There are all kinds of browser emulators from the tools
Or dom will help you parse through doom elements
A screen in Linux will allow you to run several parsers
In general , study the
VK city database was pumped out in a week in 100 streams and checking for errors when downloading
uptime was not 100%

M
malbaron, 2017-01-04
@malbaron

Any universally-developed language is suitable.
But for some, there are ready-made extremely convenient libraries. For example Scrapy for Python.
Perhaps there is something similar for PHP.
Not for everyone.
Now there are a lot of sites with Javascript, they cannot be squandered on bare PHP.
You need to emulate a browser in something like PhantomJS or the like.
And these things are usually controlled from the JS language.

S
Saboteur, 2017-01-04
@saboteur_kiev Куратор тега Python

php и perl обычно более чем достаточно.
python полезен, если вы не знаете ни php ни perl, но знаете python.
И вообще, для парсинга достаточно любого языка, на котором вы можете написать парсер.
А для серьезных скоростей нужно не парсить а API.

АртемЪ, 2017-01-04
@Jump

Is PHP suitable for web scraping?
Yes.
is it enough to satisfy most customers
Not always.
use selenium for this I think is not bad but a crutch
Are there options for how to execute JS without crutches?

L
Leonid, 2017-01-12
@caballero

What's the difference?
take phpQuery and parse for your pleasure

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question