How to scrawl webarchive?

P

PO6OT2016-01-09 20:34:22

PO6OT, 2016-01-09 20:34:22

I can't create a copy of web.archive.org/web/20150815080935/http://wasm.ru/wault and all nested pages.
Teleport Pro license copies only index.htm and a couple of .js and .css.
Here's the HTTrack output: WASM.zip
Help scrawl the WASM archive.

Reply

Answer the question

In order to leave comments, you need to log in

2 answer(s)

D

Dimonchik, 2016-01-09
@woonem

1) use wget with -np (no parent) option
will be something like

H:\WGET\wget64.exe -m -HErkp -np -Dweb.archive.org 
http://web.archive.org/web/20150815080935/http://wasm.ru/wault/ -k -x -P "S:\WASM" -U "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0; QQ)" -o S:\WASM\wasm.log.txt -nv -e robots=off

2) it will rewrite to relative local links, but due to the construction , the /http://wasmtransition from the article to the index may not work - look at the log, and then overwrite it in all by replacing this path
3) if you need only html, add the types of files you need via the -A option,
-A htm,html
4) If something does not download, cancel the -np option and fiddle with the -I (include directories) option
PS Throw out the teleport

A

Andriweb, 2019-02-02
@Andriweb

Skype majordonua, I'm downloading from the webarchive, there are large volumes. The teleport will not let you download. it is designed for a different task