V
V
Valentine2016-01-15 18:30:21
JavaScript
Valentine, 2016-01-15 18:30:21

For a parser with support for JavaScript, cookies, what bundle would you recommend?

It is required to write a parser for a complex site, with authorization, checking cookies, user-agent, captcha. Some things are generated by the script.
I read on the Internet, people are increasingly using nodejs for this, as I understand it, in conjunction with NW.js.
What else is required?
What other options are available?

Answer the question

In order to leave comments, you need to log in

8 answer(s)
N
nozzy, 2016-01-15
@nozzy

Python + Selenium, use PhantomJS web driver in Selenium.
For cookies I use pickle.

D
Dasha Tsiklauri, 2016-01-15
@dasha_programmist

for such things, as a rule, 2 applications are written:
1) scripts for phantomJS
2) http-api (runs locally with a phantom) to receive parsed information in a structured form from a phantom. For my part, I recommend .net for the implementation of this part (the application is done in minutes, all the conveniences of linq, etc.).
* to scale the first module, you can write a simple parsing process manager that runs as many instances as you need on the right hosts with the right configuration (on the same .net)

D
Dmitry Avilov, 2016-01-15
@TheCreator

Yes, it is possible to implement this on a regular request quite calmly, there are cookies, and whatever headers you like.
You need to look, sometimes it's easier to figure out how the necessary requests are generated, but if they are generated by a cunning javascript, you can always download this javascript, put it in vm and also generate them.
Well, or as they wrote here, headless browsers.
And yes, the node is ideal for interacting with third-party sites.

E
Evgen, 2018-03-29
@Verz1Lka

In scrapy, all this can be stuffed

I
Inviz Custos, 2019-12-22
@MvcBox

https://github.com/puppeteer/puppeteer

V
Vadim Bogomazov, 2014-07-22
@albert_rar

How to make such slider(?) of images as on the image? - see something from here

S
stas, 2014-07-22
@stas3572

any - from you it is necessary to adjust styles only.
in general, I recently really liked the multifunctional and lightweight here:
Sly.js

D
DeFacto, 2014-07-22
@DeFacto

codecanyon.net/item/boutique-carousel/full_screen_...
codecanyon.net/item/jquery-carousel-evolution/full...
here is something similar

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question