P
P
PO6OT2016-01-09 20:34:22
Software and Internet Services
PO6OT, 2016-01-09 20:34:22

How to scrawl webarchive?

I can't create a copy of web.archive.org/web/20150815080935/http://wasm.ru/wault and all nested pages.
Teleport Pro license copies only index.htm and a couple of .js and .css.
Here's the HTTrack output: WASM.zip
Help scrawl the WASM archive.

Answer the question

In order to leave comments, you need to log in

2 answer(s)
D
Dimonchik, 2016-01-09
@woonem

1) use wget with -np (no parent) option
will be something like

H:\WGET\wget64.exe -m -HErkp -np -Dweb.archive.org 
http://web.archive.org/web/20150815080935/http://wasm.ru/wault/ -k -x -P "S:\WASM" -U "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0; QQ)" -o S:\WASM\wasm.log.txt -nv -e robots=off

2) it will rewrite to relative local links, but due to the construction , the /http://wasmtransition from the article to the index may not work - look at the log, and then overwrite it in all by replacing this path
3) if you need only html, add the types of files you need via the -A option,
-A htm,html
4) If something does not download, cancel the -np option and fiddle with the -I (include directories) option
PS Throw out the teleport

A
Andriweb, 2019-02-02
@Andriweb

Skype majordonua, I'm downloading from the webarchive, there are large volumes. The teleport will not let you download. it is designed for a different task

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question