L
L
litehaus2016-07-17 18:33:40
Pattern recognition
litehaus, 2016-07-17 18:33:40

How to normalize text?

Good day to all. There is a task of normalization of texts on an input the text "Mom washed a frame".
When normalizing individual words, we get [mother] [wash, soap] [frame] the question is how to choose correctly that in this sentence the word "soap" will have a normal form namely "wash" and not "soap". thank you all in advance.

Answer the question

In order to leave comments, you need to log in

1 answer(s)
D
Dimonchik, 2016-07-17
@dimonchik2013

take a stemmer
to distinguish a verb from a noun - there’s no way without a corpus, however, NLP libraries can do something, look
, but search engines don’t need it, but a competent stemmer is needed

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question