Jump to content

Sphinx and unicode search in 2.3.6


Recommended Posts

Posted

Hello

I am using the manual search and not the full text as only manual give results for greek characters even from my latin 1 database but this is slowing down my server.

I was read about sphinx engine and the ipb plugin for 2.3.6 and i am wondering if search for greek characters will work ?

Did anyone test a similar configuration?

Thank you

  • 1 month later...
Posted

I have made a sphinx configuration for Greek, and it's a good bit of work.... ;)

What character set are you using, iso-8869-7 or utf-8?

The biggest problem is you need to write the character mapping for the config file. They have a suggested one here:

http://sphinxsearch.com/wiki/doku.php?id=charset_tables#greek

This is supposed to work for iso-8859-7: http://sphinxsearch.com/forum/view.html?id=364

But for utf-8, I had to write my own (see last post on this page): http://www.sphinxsearch.com/forum/view.html?id=1047

The main problem is some of the characters that aren't used very often that have markings (tonos, etc.) need to be folded into the version of the character that doesn't have the markings, and then on top of that, case folding needed to be done. I'm not even sure if that char mapping is the final version we ended up using, but it should get you started.


Long story short, yes, Sphinx works with greek fine.

Posted

You want to use the utf-8 mapping I believe. Keep in mind, you need to set query_pre or whatever it is in sphinx config to "SET NAMES utf8" if you have a charset configured in IPB's conf_global.php file.

  • 2 months later...
Posted

I have made a sphinx configuration for Greek, and it's a good bit of work.... ;)



...



But for utf-8, I had to write my own (see last post on this page): http://www.sphinxsearch.com/forum/view.html?id=1047



The main problem is some of the characters that aren't used very often that have markings (tonos, etc.) need to be folded into the version of the character that doesn't have the markings, and then on top of that, case folding needed to be done. I'm not even sure if that char mapping is the final version we ended up using, but it should get you started.




Long story short, yes, Sphinx works with greek fine.



Brandon, I would like to do exactly the same (Greek support for sphinx in utf8 forum) and I tried so many different versions/changes in sphinx.conf but none worked. (I can search fine with English characters, but searches with Greek characters return zero results!).

Could you please post (or pm to me) your sphinx.conf file here, so I can make the correct changes?
I try with the latest version (0.9.9). Maybe this is a problem?

Many thanks in advance!
Posted

ΟΚ, it finally worked (at least with 0.9.8.1)!
What I was doing wrong is that I did not put the "sql_query_pre = SET NAMES utf8" query at MULTIPLE places in sphinx.conf file, I thought that putting it at the setup section would suffice.
Brandon your greek char tables seem to work very fine, although I will check them again, as you ask in your post at the sphinx forum above.
However, after so much tests and failures, I don't have the courage to test 0.9.9 once again :) - it clearly seemed to have some problems though..

Archived

This topic is now archived and is closed to further replies.

  • Recently Browsing   0 members

    • No registered users viewing this page.
×
×
  • Create New...