M
M
Mikhail Kacherov2014-10-06 12:35:27
big data
Mikhail Kacherov, 2014-10-06 12:35:27

How to save list from Wikipedia category to file?

There is a specific task. It is necessary to save the list of English surnames from Wikipedia into a text file. Only surnames are needed without links to articles. List at https://en.wikipedia.org/wiki/Category:Surnames .
Lists on Wikipedia are generated automatically from links to articles, so when you enter the article editing mode, the list cannot be copied. The list is very large - 47,617 surnames, while only 200 surnames can be displayed on one Wikipedia page. Hands copy-paste, obviously, for a long time.
Is there any relatively easy way to save such lists? Utilities, tricks, terminal commands - I will be glad for any answer on the topic.

Answer the question

In order to leave comments, you need to log in

2 answer(s)
A
Anatoly Scherbakov, 2014-10-18
@Mihkach

There is a dbpedia.org resource that parses information from Wikipedia and presents it in terms of Semantic Web technologies. So the most correct way out is to write a query in the SPARQL language. But I don't even remember its syntax anymore. However, here is the dbpedia page for the mentioned category: dbpedia.org/describe/?url=http%3A%2F%2Fdbpedia.org...
If this is it. You can search other categories. At the bottom of the page there is an inscription "Raw data" and a link to download CSV. There are a lot of names there.
You will need to select the correct category, perhaps tweak something, and get the result. Or - get to know SPARQL.

A
abutko, 2016-01-16
@abutko

This can be done using the AutoWikiBrowser program. This program can generate a list from a category and save it to a file.
See https://ru.wikipedia.org/wiki/Wikipedia:AutoWikiBrowser (they write about registration, but you only need it for editing, you can use it without registration to create lists)

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question