P
P
pikkvile2016-05-23 11:24:58
Java
pikkvile, 2016-05-23 11:24:58

What are the libraries for evaluating the "similarity" of texts?

Best in java or some jvm language.
I'm new to text analysis, so I'm not sure how to correctly formulate what I need.
For example, here are two product names:
HP 728 300-ml Cyan DesignJet Ink Cartridge
Hp 728 - Cyan - Original - Designjet - Blækpatron You
need a tool that can understand with more or less high probability that they are about the same thing.
In general, the task is as follows: there are two lists of names, formulated differently, you need to find correspondences between the elements of the lists.
I try to invent something, but it turns out slowly and there are a lot of errors. Here, I thought, maybe there is some ready-made solution?

Answer the question

In order to leave comments, you need to log in

1 answer(s)
V
Vasily, 2016-05-27
@Applez

Look for libraries that implement LSA . For example here .

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question