List:General Discussion« Previous MessageNext Message »
From:Sergei Golubchik Date:April 7 2004 6:54am
Subject:Re: REGEXP with UTF-8
View as plain text  
Hi!

On Apr 07, Hirofumi Fujiwara wrote:
> 
> From: Sergei Golubchik <serg@stripped>
> > Hi!
> > 
> > On Apr 06, Hirofumi Fujiwara wrote:
> > > 
> > > I am testing regular expression feature of MYSQL 4.1.1-alpha
> > > (REGEXP) with UTF-8 characters (Japanese).  So far it doesn't
> > > seem to work.
> > > 
> > > LIKE handles each Japanese character properly as one character,
> > > but REGEXP doesn't.
> > > 
> > > I'd like to know if REGEXP handles UTF-8 characters.
> > 
> > No it does not :(
> > 
> > (we need to change regexp library that we use, to have it fixed)
> 
> I wonder if REGEXP doesn't handle multi bytes character(big5,ujis,
> sjis,euckr,gb2312,gbk,utf8,ucs2) either, which means it handles only
> single byte character set (Maxlen=1).

Unfortunately, it does not either :(

Regards,
Sergei

-- 
   __  ___     ___ ____  __
  /  |/  /_ __/ __/ __ \/ /   Sergei Golubchik <serg@stripped>
 / /|_/ / // /\ \/ /_/ / /__  MySQL AB, Senior Software Developer
/_/  /_/\_, /___/\___\_\___/  Osnabrueck, Germany
       <___/  www.mysql.com
Thread
REGEXP with UTF-8Hirofumi Fujiwara6 Apr
  • Re: REGEXP with UTF-8Sergei Golubchik6 Apr
    • Re: REGEXP with UTF-8Hirofumi Fujiwara7 Apr
      • Re: REGEXP with UTF-8Sergei Golubchik7 Apr