H
H
Hellcunt2010-12-02 02:23:31
Sphinx
Hellcunt, 2010-12-02 02:23:31

How to add html and plain text files to Sphinx index?

The Sphinx documentation says "The data to be indexed can generally come from very different sources: SQL databases, plain text files, HTML files , mailboxes, and so on".
But the Quick Sphinx usage tour only shows you how to set up Sphinx to work with a Mysql database. How to configure it to work with html and plain text?

Answer the question

In order to leave comments, you need to log in

1 answer(s)
A
Antelle, 2010-12-02
@Hellcunt

You will need xmlpipe data source:
sphinxsearch.com/docs/1.10/xmlpipe2.html
And you will have to write a script that will make xml (in the format as it is written there) from html or plaintext files (although maybe already something ready wrote, it is necessary to google).
Indexing example from MemcacheDb:
nutrun.com/weblog/distributed-key-value-store-indexing/
Here pdf is even indexed:
www.sphinxsearch.com/forum/view.html?id=338

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question