Answer the question
In order to leave comments, you need to log in
How to download all articles of Habr?
Hello!
Tell me a script with which you could download all the articles of Habr to a computer, preferably in pdf format. (There is no Internet at work, but I want to read)
Thank you.
Answer the question
In order to leave comments, you need to log in
A bit not what you want, but still: http://habrahabr.ru/blogs/python/111411/
Something like this “wget -r”
For more details, you need to look at the keys in man.
Habr is a country of contrasts ... More recently, there was a question “How to stop reading habr” (:
Yes, this is the first post that I paid attention to, but it didn’t work out - the text in the xml files, not the python code. Maybe I did something wrong.
You can use something like offline explorer (apparently you'll have to tweak the settings).
alternativeto.net/software/offline-explorer/?profile=windows&platform=windows&license=free
just in case: the administration of the resource is unlikely to be happy if many users immediately want to make a Habr mirror for themselves. It will be a very non-figuring load on iron
Imacros is here to help. Sketch a script that will go through all the article numbers in turn and save them ...
True, I prefer to save articles using SingleFile Beta - then the pictures are saved inside the html file. Only he is not friends with YaMacros, of course.
At one time I downloaded interesting articles 2-3 per day after reading with curl -O .... So they covered this opportunity, wrote them a letter to return the ability to download articles using curl, so they didn’t even answer me. After such an attitude, if I were the authors, I would not write articles on habr. They make money on your articles, they also do it like a pig. Here on the same stackoverflow they posted several parts of the database of 80Gb each, download whoever wants. Apparently Habr is afraid that people will no longer come if they download the article, or is it all about money?
PS For those who will tell me why download articles? And then, that the author can delete them at any time, and many times it happened that even in web.archive.org he could not find them.
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question