P
P
propovednik2012-01-16 09:21:13
Sphinx
propovednik, 2012-01-16 09:21:13

Questions instead of Russian letters SphinxSearch 2.0.2

I have a fully utf mysql database with utf labels and utf fields.
Here's what's in SHOW VARIABLES: In the sphinx config: In the index section of the sphinx config (charset_table is the default from the manual for charset_type utf-8): But even taking all this into account, the search results contain questions instead of Russian characters. At the same time, if you make xml files and receive data through xmlpipe2, then it searches as it should and shows Russian characters, that is, most likely the data from the database already comes with questions. Could you tell me where and what else could I have forgotten?
character set client utf8
character set connection utf8
character set database utf8
character set results utf8
character set server utf8
character set system utf8
collation connection utf8_general_ci
collation database utf8_general_ci
collation server utf8_general_ci

sql_query_pre = SET NAMES utf8
sql_query_pre = SET CHARACTER SET utf8

charset_type = utf-8
charset_table = 0..9, A..Z->a..z, _, a..z, U+410..U+42F->U+430..U+44F, U+430..U+44F

Answer the question

In order to leave comments, you need to log in

3 answer(s)
A
Arkady, 2012-01-16
@p0is0n

Do you have a sphinx returning text data?

I
Iskander Giniyatullin, 2012-01-16
@rednaxi

The Sphinx should return only the id of the documents, and you should generate the search results for them yourself.
In principle, to use the sphinx through the console, it also allows you to display some information as a query result, but this is only for tests and is only available through the console.
When you start using the sphinx in your project, you will have to work with the id of documents, and accordingly the sphinx will not return any letters, neither Russian nor English.

G
gena09, 2015-06-25
@gena09

How about returning snippets? Here one id is not enough.
In the my.ini file (for MySQL) in any parameters there should be no other encoding than utf-8.
Cut also the rights of the user of a DB (there is such trick).

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question