V
V
vatuma2012-03-22 12:43:10
Database
vatuma, 2012-03-22 12:43:10

Where can I find the base of words for the game?

Task : make a word game that uses different parts of speech in the basic form (i.e. for nouns - the case is singular, for verbs - the infinitive, etc.)
Issues : All the dictionaries that I managed to find (Dal, Ushakov, Lopatin, Efremov, Zaliznyak, etc.) suffer from one or more shortcomings:
1. They are all incomplete. At the same time, not some tricky words are missing, but completely normal ones - muesli (rarely where there is), believe (Lopatin does not have it!) And other completely ordinary words.
2. There is no indication of the part of speech, or it is impossible to single out proper names (Dal, Ushakov, etc.)
3. It is impossible to single out words that exist only in plurals. including (Efremov, Lopatin)
4. It is impossible to single out diminutives. (Only Efremov can somehow do this, but it is very problematic).
5. It is impossible to separate reflexive verbs. Those. it is necessary to leave only "prick", but throw out "prick". At the same time, it cannot be separated by a suffix - for example, “to be afraid” is the basic form of the word.
6. There are completely ridiculous words, like "pereobjective" (morphological dictionary of word forms).
An attempt to combine dictionaries was unsuccessful - either wrong words are necessarily found, or (if limited) completely normal words are cut off.
And another note about Zaliznyak's dictionary. Its incompleteness is not entirely clear. The fact is that in the dictionary available for download (for example, here) - indeed, there are not very many ordinary words, but at the same time, the Zaliznyak classification is indicated for these words on wiktionary.org. An example is the same muesli . Those. somewhere Zaliznyak still has these words. But I couldn't find it.
Question : Where can I find a word base that satisfies the following requirements?
1. Must contain the basic forms of all (conditionally, of course) words, indicating the part of speech
2. Non-basic forms (Diminutive, reflexive verbs, etc.) must be separated
3. It must be possible to cut off proper names

Answer the question

In order to leave comments, you need to log in

3 answer(s)
D
Dmitry, 2012-03-22
@DedalX

Obviously, your game has competitors, look for their games (for example, for Windows), and get into their resources (if everything is inside some kind of files, then try the Dragon Unpacker program, it understands a lot of packers and game archives). There will also be a dictionary they use. And often in such games for Windows, the dictionary is not even hidden, it is in the folder with the .exe of the game, either directly in txt, or with a renamed extension to something like .dat.

S
stg34, 2012-03-22
@stg34

Look in this direction :
aot.ru/phpmorphy.sourceforge.net/dokuwiki/

M
Mikhail Lyalin, 2012-03-22
@mr_jok

if your game has predecessors - then dance
differently from them - make it possible for users to replenish the dictionary

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question