Answer the question
In order to leave comments, you need to log in
How to add html and plain text files to Sphinx index?
The Sphinx documentation says "The data to be indexed can generally come from very different sources: SQL databases, plain text files, HTML files , mailboxes, and so on".
But the Quick Sphinx usage tour only shows you how to set up Sphinx to work with a Mysql database. How to configure it to work with html and plain text?
Answer the question
In order to leave comments, you need to log in
You will need xmlpipe data source:
sphinxsearch.com/docs/1.10/xmlpipe2.html
And you will have to write a script that will make xml (in the format as it is written there) from html or plaintext files (although maybe already something ready wrote, it is necessary to google).
Indexing example from MemcacheDb:
nutrun.com/weblog/distributed-key-value-store-indexing/
Here pdf is even indexed:
www.sphinxsearch.com/forum/view.html?id=338
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question