A
A
Asya2017-04-25 22:04:40
data mining
Asya, 2017-04-25 22:04:40

C++ libraries for text processing?

I would like to know if there are any libraries or ready-made solutions for doing text preprocessing (remove stop words, convert words to their initial form, remove punctuation marks, search for keywords, etc.)? It's just that R is full of libraries and ready-made solutions, but in C++ I couldn't find anything suitable...

Answer the question

In order to leave comments, you need to log in

1 answer(s)
A
Alexey Sergey, 2017-04-26
@asyaevloeva

remove stop words, remove punctuation marks, search for keywords

this can definitely be solved with regular expressions.
doc.qt.io/qt-4.8/qregexp.html
en.cppreference.com/w/cpp/regex
but I'm not sure about this
ps: personally, once this article helped me deal with regular expressions, although it was written in the context of javascript.
https://habrahabr.ru/post/242695/
and I can still recommend this resource. It helps a lot in composing and debugging regular expressions.
https://regex101.com

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question