Answer the question
In order to leave comments, you need to log in
What is a convenient library for parsing sites for nodejs?
On python, I used the wonderful grablib library, which has both a parser and a crawler, query caching and other handy things for parsing sites.
Are there any good analogues for nodejs?
Answer the question
In order to leave comments, you need to log in
If parsing requires filling out forms, submitting and moving between pages, I advise slimerjs www.slimerjs.org/. Everything is in order with the documentation. Like phantom, it is a full-fledged browser, only on Gecko + gui browser windows. There is a module for node https://www.npmjs.com/package/node-slimerjs
The module documentation is worse, but you can always refer to the tests for examples.
For parsing static html, cheerio will suffice.
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question