Answer the question
In order to leave comments, you need to log in
An algorithm for separating words that, when mistyped, accidentally “glued together” (written without a space)?
Everyone, good!
The question is in the title.
Illustrative examples:
many roses => many roses
steamboat => steamboat (i.e. leave unchanged)
(and the most difficult) channel => ?
How to implement a check with corrections of such typos?
Is there something ready for PHP?
Thanks in advance for the information!
Answer the question
In order to leave comments, you need to log in
It's useless without context. At best, get the very fact that a given word can be composed of several other words. But in most cases it will be impossible to determine whether it should be broken.
For example, the famous word " multichannel "
It seems to me that even if there is, it works sucks. For example, the word
groceries is a tank alley (not to be confused with an alley). Fight_cue, lie down, hot_cue, cinema_theatre, com_pass, bor_wind.
It seems to me that even with the use of machine learning, inaccuracies will arise.
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question