Hi!
On Jul 09, Peter Cruickshank wrote:
> Hello
>
> If this is the wrong place to ask this question, I apologise in advance.
>
> I am part of a team working on adapting a MySql-based CMS to work
> with Persian, and the ability to collate content accurately is proving a
> bit of an issue...
>
> My question is: is anyone working on adding Persian (Farsi) UTF-8 collation
> to MySql? If so, is there an estimate of when the work could be done?
Nobody is working on it as far as I know.
Unfortunately it's it not very high in our todo - all the developers are
busy with higher priority tasks.
Though adding it is not very difficult, as MySQL already supports almost
everything that is necessary. In particular within UCA framework one can
easily define collations like (quoting from ctype-uca.c, MySQL sources)
static const char czech[]=
"& C < \\u010D <<< \\u010C "
"& H < ch <<< Ch <<< CH"
"& R < \\u0159 <<< \\u0158"
"& S < \\u0161 <<< \\u0160"
"& Z < \\u017E <<< \\u017D";
For now contractions (like above) are only implemented for basic latin
characters 'a'..'z'. It won't be difficult to extend the code to suport
contractions for any unicode characters - then Persian collation would
be possible.
If you're interested in doing it, we would be very glad to help with any
information and guidance that would be necessary!
Regards,
Sergei
--
__ ___ ___ ____ __
/ |/ /_ __/ __/ __ \/ / Sergei Golubchik <serg@stripped>
/ /|_/ / // /\ \/ /_/ / /__ MySQL AB, Senior Software Developer
/_/ /_/\_, /___/\___\_\___/ Osnabrueck, Germany
<___/ www.mysql.com