M
M
Maxim Melnikov2017-12-05 12:20:36
Python
Maxim Melnikov, 2017-12-05 12:20:36

How to normalize names with ML or other ways?

Hello.
There was a question how it is possible to normalize the text, to be exact a full name.
For example, at the entrance we have:
1) Sidorov Alexey Alex.
2) Aleksey, A. Sidorrov
The output should be that 1 = 2
What methods can be done?

Answer the question

In order to leave comments, you need to log in

2 answer(s)
N
Nikita Dergachov, 2017-12-05
@vanillathunder

It is necessary to determine the degree of similarity of the strings and select the appropriate coefficient. You can use the Levenshtein distance

I
ivodopyanov, 2017-12-06
@ivodopyanov

If the text is exclusively a full name, then you can try to write a regular expression for all spellings; highlight separately F, I, O; handle name abbreviations; compare full variants according to Levenshtein.

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question