A
A
Andrey Fedorov2015-05-03 01:46:20
JavaScript
Andrey Fedorov, 2015-05-03 01:46:20

What is a convenient library for parsing sites for nodejs?

On python, I used the wonderful grablib library, which has both a parser and a crawler, query caching and other handy things for parsing sites.
Are there any good analogues for nodejs?

Answer the question

In order to leave comments, you need to log in

2 answer(s)
D
Dialog, 2015-05-03
@Dialog

habrahabr.ru/post/210166

I
IvanY, 2015-05-03
@IvanY

If parsing requires filling out forms, submitting and moving between pages, I advise slimerjs www.slimerjs.org/. Everything is in order with the documentation. Like phantom, it is a full-fledged browser, only on Gecko + gui browser windows. There is a module for node https://www.npmjs.com/package/node-slimerjs
The module documentation is worse, but you can always refer to the tests for examples.
For parsing static html, cheerio will suffice.

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question