List:Internals« Previous MessageNext Message »
From:Thomas Spahni Date:October 6 2005 11:27am
Subject:Re: MyISAM FT-Index and stop words
View as plain text  
Dear Hagen

you may keep your stopwords in a separate file (available as of MySQL
4.0.10.).

In /etc/my.cnf I have

set-variable    = ft_stopword_file=/etc/my.stopwords

The file format is one stopword per line.

Regards,
Thomas Spahni

On Thu, 6 Oct 2005, Hagen Höpfner wrote:

> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> Dear list,
>
> I know that the MyISAM sources include some stop words (words longer
> than 2 characters, that does not have to be indexed) for English texts.
> If I want do index German (or other languages) texts this is not really
> helpful. Is it possible to adapt the stop word list to another language
> without modifying the source code?
>
> Hagen
>
> - --
> - ------------------------------------------------------------
> Dr.-Ing. Hagen Höpfner | hoepfner@stripped | +49 7251 700 239
> - ------------------------------------------------------------
>          http://www.i-u.de/schools/hopfner/index.html
>
>              International University in Germany
>                School of Information Technology
>                   Campus 3  D-76646 Bruchsal
> - ------------------------------------------------------------
> Information about the German textbook  on  "Mobile Databases
> and   Information   Systems"  by   H. Höpfner, C. Türker and
> B. König-Ries are available at http://www.dpunkt.de/.
> - ------------------------------------------------------------
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.1 (MingW32)
> Comment: GnuPT 2.6.0 by EQUIPMENTE.DE
> Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org
>
> iD8DBQFDRN+7ataeEpWdQtYRAj0WAJ4rHd3TFXUkkwzAwe1PDjzGHi6HZwCfaoei
> /BCd9HfO6XyONmJVTYTeOBY=
> =PdF1
> -----END PGP SIGNATURE-----
>
>

Thread
MyISAM FT-Index and stop wordsHagen Höpfner6 Oct
  • Re: MyISAM FT-Index and stop wordsThomas Spahni6 Oct