T
T
theSever2017-01-16 04:31:13
Domain name market
theSever, 2017-01-16 04:31:13

How to parse several million domains?

It is necessary to parse a decent number of sites and collect information from them (only muzzles), the problem is that most software is terribly, terribly slow plows. How can this problem be solved, a ninja-style parser or software with distributed parsing?

Answer the question

In order to leave comments, you need to log in

4 answer(s)
S
sim3x, 2017-01-16
@sim3x

One-liner on bash + parallel + wget/curl
And install dns server locally

Y
Yuri, 2017-01-16
@riky

did stupidly on a node like that.
all *.RU domains (5.5 million) in a few days.

P
Paul Webster, 2017-01-16
@kopceak

Python + Celery + Elasticsearch

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question