S
S
strelkovandreyv2019-05-17 07:44:49
Solr
strelkovandreyv, 2019-05-17 07:44:49

How to index documents using Apache Solr?

Good afternoon, I have Apache Solr 7, which is configured to index the database using the DataImportHandler
Indexing is successful.
The task is that in the database in each line there is also a link to a file located on a network drive, as in this case, this file can also be "feeded" to the index.
As I understand it, Solr itself has some kind of built-in functionality, and there is also the Apache Tika library.
In which direction should you go? (files mostly recognized as PDF, but there are also doc,docx,xlsx, rtf, etc.)

Answer the question

In order to leave comments, you need to log in

1 answer(s)
A
al_gon, 2019-08-09
@al_gon

It's better not to use DataImportHandlerit at all.
Causes :
Complex extensibility
Absolutely 0 possibilities for scaling the indexing process
Possible, but very complex and fat-free monitoring of the indexing process.

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question