Answer the question
In order to leave comments, you need to log in
Answer the question
In order to leave comments, you need to log in
The Wikipedia extractor is a Python script that takes an XML dump of the Wikipedia database as input and text as output. That is, Python must be installed. To feed the database to this script, it must first be extracted from the BZ2 archive. But the unpacked file will take up a lot of space. Therefore, developers recommend doing unpacking on the fly, without saving data on the hard drive. Linux has the bzip2 utility for this. Under Windows, you can use the console 7-zip. The team will be next
Everything before the '|' is the unpacking command. And after - this is the command to launch Wikipedia Extractor with some parameters.
I haven't checked if this works, since I don't have a Wiki dump.
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question