Answer the question
In order to leave comments, you need to log in
How to normalize names with ML or other ways?
Hello.
There was a question how it is possible to normalize the text, to be exact a full name.
For example, at the entrance we have:
1) Sidorov Alexey Alex.
2) Aleksey, A. Sidorrov
The output should be that 1 = 2
What methods can be done?
Answer the question
In order to leave comments, you need to log in
It is necessary to determine the degree of similarity of the strings and select the appropriate coefficient. You can use the Levenshtein distance
If the text is exclusively a full name, then you can try to write a regular expression for all spellings; highlight separately F, I, O; handle name abbreviations; compare full variants according to Levenshtein.
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question