Answer the question
In order to leave comments, you need to log in
Is there an alternative to regular TCP sockets for a multithreaded parser?
Task: collect information about ~100 million pages on different domains as quickly as possible using the HEAD method.
Now the parser is written in ruby, uses HTTPClient and multithreading.
Maybe there are faster libraries for such tasks?
Answer the question
In order to leave comments, you need to log in
1) Run on jruby/rubinius to get real multi-threading for distributing threads across cores, otherwise the GIL interferes
2) Run multiple processes
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question