A
A
Andrey_Epifantsev2019-01-07 07:12:13
Search engines
Andrey_Epifantsev, 2019-01-07 07:12:13

What is the easiest way to implement your search engine?

There is a person studying a foreign language. While he is at the initial stage of learning and the vocabulary is small. The entire vocabulary is known and written down as a list of words (including all forms of words: different number, different person, declensions, conjugations, etc.).
I would like for this person to find texts on the Internet, such that he can read these texts without referring to the dictionary on every sentence. That is, the vast majority of words in the text should either be included in the list of known words, or be a name. Only a few unknown words are allowed for the entire text.
How realistic is it to create such a search engine? Can existing search engines be used? It seems like there is Google Custom Search, but its customization doesn't seem to go that far.

Answer the question

In order to leave comments, you need to log in

1 answer(s)
D
Dimonchik, 2019-01-07
@dimonchik2013

the easiest way is to take a wikipedia dump and compare the corpora;
everything else depends on the presence of indexed content, and this is much longer / more difficult
. closer to IRL language

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question