Answer the question
In order to leave comments, you need to log in
In addition to text, does NTLK vectorize HTML special characters and more?
Good morning!
Since I know well the basics of NTLK and its method word_tokenize()
, I run into a problem where NTLK has to turn the source text into vectors if it has HTML special characters and other kinds of characters...
For example:
👐 Привет! Как настроение?
[Region = Samara]
😇 Ок, я нашел для вас интересные места в районе[moscow district = Krylatskoe]
word_tokenize()
for vectorizing text with any kind of text (except plain text)?
Answer the question
In order to leave comments, you need to log in
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question