V
V
Vladislav Shchekoldin2013-06-14 13:53:21
Search Engine Optimization
Vladislav Shchekoldin, 2013-06-14 13:53:21

Rails: Are there any out-of-the-box mechanisms for parsing text?

Gentlemen, I am solving the following problem: there is a site under development, there are desirable keywords for it, and there is a set of already written individual texts.
Now I'm making a mechanism to detect intersections of how the current text content matches a set of goals.
With the search for keywords in the text from a given list, the situation is clear:
When adding keywords, we use the ruby ​​implementation of the Stemming mechanism. The

question is to evaluate something else: how the text matches this query, and whether it generates others. That is, I need some mechanism that will analyze the text, find the most frequently repeated words, and give their% content in the text.

The question is, does anyone already know (maybe even described on the Web?) a ready-made solution on rails? For example, according to the TF-IDF algorithm ?
It is clear that everything can be written. But there is such an important resource as time. That's why I'm asking: are there any ready-made solutions to this issue?
Thank you.

Answer the question

In order to leave comments, you need to log in

1 answer(s)
D
dustalov, 2013-06-15
@dustalov

There are no developed competitors to NLTK and OpenNLP, but look at Treat .

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question