T
T
Taras Serevann2017-08-07 10:19:56
Neural networks
Taras Serevann, 2017-08-07 10:19:56

What is text tokenization?

Hello!
Can someone clearly and in simple language explain what text tokenization in machine learning is and how it is applied in practice

Answer the question

In order to leave comments, you need to log in

1 answer(s)
L
longclaps, 2017-08-07
@longclaps

tokenization - splitting text into words (and non-words, those punctuation marks, paragraph boundaries, etc.). Its usefulness in machine learning is a direct message to the grid of the fact that a person (whose actions it needs to be taught to imitate) perceives the text as a stream of words, not a stream of letters.

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question