P
P
pavel0staratel2015-06-11 15:23:21
Algorithms
pavel0staratel, 2015-06-11 15:23:21

What algorithm to use to find a match between the data and the dictionary?

The data and the dictionary are sets of strings. The data contains the same strings from the dictionary, but with typos, with permutations of words, with extra or missing words, etc.
You need to make a match - which line from the dictionary corresponds to the line from the data.
It might be worth looking into machine learning algorithms. Then tell me which one.

Answer the question

In order to leave comments, you need to log in

2 answer(s)
L
Lesha Kiselev, 2015-06-11
@Yakud

Alternatively, use fuzzy search algorithms:
algolist.manual.ru/search/fsearch
habrahabr.ru/post/123320
habrahabr.ru/post/114997

B
beduin01, 2015-06-11
@beduin01

The implementation of the algorithm itself: https://neil.fraser.name/software/diff_match_patch...
Diff-Match-Patch (version for D). Versions for other languages ​​can be found here

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question