Answer the question
In order to leave comments, you need to log in
Algorithm to cluster documents from multiple parts?
Hey!
I need to cluster documents that have multiple parts, content, title, cities, etc.
Sample document model: Doc(content: String, Title: String, geo: array[String], persons: array[String], ...)
Text fields will be represented as vectors.
It is desirable that each part could be indicated by the weight.
What clustering algorithm can be used? Are there implementations of similar algorithms in Python?
Answer the question
In order to leave comments, you need to log in
fastText
... Facebook announced the open source of the fastText library, which provides tools for text classification using machine learning methods(Note: classification, not clustering)
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question