Answer the question
In order to leave comments, you need to log in
PHP parsing in 2018/2019?
In JavaScript, for parsing tasks, in most cases exactly two tools are used:
1) Puppeteer - where it is necessary to emulate browser actions
2) cheerio - where it is enough to simply parse an HTML document
As a person who is quite far from trends in the PHP world , the question arose - what tools are commonly used today for such tasks? I would be very grateful if you could tell me about alternatives for both Puppeteer and cheerio
Answer the question
In order to leave comments, you need to log in
Use Python) After all, using Python, you can parse in several threads and without problems (hidden advertising of python, because python is cool)
Well, if we talk about parsing tools in PHP, then SimpleDomHTML (banal parsing of html pages) is enough for me personally, and all my colleagues sit on PhantomJS (additional JavaScript emulation)
Do it asynchronously.
ReactPHP https://sergeyzhuk.me/2018/03/19/fast-webscraping-...
Even better - asynchronously and multi-process using queues (RabbitMQ).
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question