Author: paul
Date: 2007-11-29 14:56:07 +0100 (Thu, 29 Nov 2007)
New Revision: 8979
Log:
r27548@frost: paul | 2007-11-29 07:52:54 -0600
Generalize a _general_ci vs. _unicode_ci collation performance difference to
apply to all Unicode character sets. (Bar, Sinisa)
Modified:
trunk/refman-4.1/internationalization.xml
trunk/refman-5.0/internationalization.xml
trunk/refman-5.1/internationalization.xml
trunk/refman-6.0/internationalization.xml
Property changes on: trunk
___________________________________________________________________
Name: svk:merge
- 4767c598-dc10-0410-bea0-d01b485662eb:/mysqldoc-local/mysqldoc/trunk:33513
7d8d2c4e-af1d-0410-ab9f-b038ce55645b:/mysqldoc-local/mysqldoc:27533
b5ec3a16-e900-0410-9ad2-d183a3acac99:/mysqldoc-local/mysqldoc/trunk:14218
bf112a9c-6c03-0410-a055-ad865cd57414:/mysqldoc-local/mysqldoc/trunk:22282
+ 4767c598-dc10-0410-bea0-d01b485662eb:/mysqldoc-local/mysqldoc/trunk:33513
7d8d2c4e-af1d-0410-ab9f-b038ce55645b:/mysqldoc-local/mysqldoc:27548
b5ec3a16-e900-0410-9ad2-d183a3acac99:/mysqldoc-local/mysqldoc/trunk:14218
bf112a9c-6c03-0410-a055-ad865cd57414:/mysqldoc-local/mysqldoc/trunk:22282
Modified: trunk/refman-4.1/internationalization.xml
===================================================================
--- trunk/refman-4.1/internationalization.xml 2007-11-29 13:46:04 UTC (rev 8978)
+++ trunk/refman-4.1/internationalization.xml 2007-11-29 13:56:07 UTC (rev 8979)
Changed blocks: 1, Lines Added: 18, Lines Deleted: 16; 2584 bytes
@@ -3663,25 +3663,27 @@
</remark>
<para>
- The most significant feature in
- <literal>utf8_unicode_ci</literal> is that it supports
- expansions; that is, when one character compares as equal to
- combinations of other characters. For example, in German and
- some other languages <quote><literal>ß</literal></quote> is
- equal to <quote><literal>ss</literal></quote>.
+ For any Unicode character set, operations performed using the
+ <literal>_general_ci</literal> collation are faster than those
+ for the <literal>_unicode_ci</literal> collation. For example,
+ comparisons for the <literal>utf8_general_ci</literal>
+ collation are faster, but slightly less correct, than
+ comparisons for <literal>utf8_unicode_ci</literal>. The reason
+ for this is that <literal>utf8_unicode_ci</literal> supports
+ mappings such as expansions; that is, when one character
+ compares as equal to combinations of other characters. For
+ example, in German and some other languages
+ <quote><literal>ß</literal></quote> is equal to
+ <quote><literal>ss</literal></quote>.
+ <literal>utf8_unicode_ci</literal> also supports contractions
+ and ignorable characters. <literal>utf8_general_ci</literal>
+ is a legacy collation that does not support expansions,
+ contractions, or ignorable characters. It can make only
+ one-to-one comparisons between characters.
</para>
<para>
- <literal>utf8_general_ci</literal> is a legacy collation that
- does not support expansions. It can make only one-to-one
- comparisons between characters. This means that comparisons
- for the <literal>utf8_general_ci</literal> collation are
- faster, but slightly less correct, than comparisons for
- <literal>utf8_unicode_ci</literal>.
- </para>
-
- <para>
- For example, the following equalities hold in both
+ To further illustrate, the following equalities hold in both
<literal>utf8_general_ci</literal> and
<literal>utf8_unicode_ci</literal> (for the effect this has in
comparisons or when doing searches, see
Modified: trunk/refman-5.0/internationalization.xml
===================================================================
--- trunk/refman-5.0/internationalization.xml 2007-11-29 13:46:04 UTC (rev 8978)
+++ trunk/refman-5.0/internationalization.xml 2007-11-29 13:56:07 UTC (rev 8979)
Changed blocks: 1, Lines Added: 18, Lines Deleted: 16; 2584 bytes
@@ -3564,25 +3564,27 @@
</remark>
<para>
- The most significant feature in
- <literal>utf8_unicode_ci</literal> is that it supports
- expansions; that is, when one character compares as equal to
- combinations of other characters. For example, in German and
- some other languages <quote><literal>ß</literal></quote> is
- equal to <quote><literal>ss</literal></quote>.
+ For any Unicode character set, operations performed using the
+ <literal>_general_ci</literal> collation are faster than those
+ for the <literal>_unicode_ci</literal> collation. For example,
+ comparisons for the <literal>utf8_general_ci</literal>
+ collation are faster, but slightly less correct, than
+ comparisons for <literal>utf8_unicode_ci</literal>. The reason
+ for this is that <literal>utf8_unicode_ci</literal> supports
+ mappings such as expansions; that is, when one character
+ compares as equal to combinations of other characters. For
+ example, in German and some other languages
+ <quote><literal>ß</literal></quote> is equal to
+ <quote><literal>ss</literal></quote>.
+ <literal>utf8_unicode_ci</literal> also supports contractions
+ and ignorable characters. <literal>utf8_general_ci</literal>
+ is a legacy collation that does not support expansions,
+ contractions, or ignorable characters. It can make only
+ one-to-one comparisons between characters.
</para>
<para>
- <literal>utf8_general_ci</literal> is a legacy collation that
- does not support expansions. It can make only one-to-one
- comparisons between characters. This means that comparisons
- for the <literal>utf8_general_ci</literal> collation are
- faster, but slightly less correct, than comparisons for
- <literal>utf8_unicode_ci</literal>.
- </para>
-
- <para>
- For example, the following equalities hold in both
+ To further illustrate, the following equalities hold in both
<literal>utf8_general_ci</literal> and
<literal>utf8_unicode_ci</literal> (for the effect this has in
comparisons or when doing searches, see
Modified: trunk/refman-5.1/internationalization.xml
===================================================================
--- trunk/refman-5.1/internationalization.xml 2007-11-29 13:46:04 UTC (rev 8978)
+++ trunk/refman-5.1/internationalization.xml 2007-11-29 13:56:07 UTC (rev 8979)
Changed blocks: 1, Lines Added: 18, Lines Deleted: 16; 2584 bytes
@@ -3561,25 +3561,27 @@
</remark>
<para>
- The most significant feature in
- <literal>utf8_unicode_ci</literal> is that it supports
- expansions; that is, when one character compares as equal to
- combinations of other characters. For example, in German and
- some other languages <quote><literal>ß</literal></quote> is
- equal to <quote><literal>ss</literal></quote>.
+ For any Unicode character set, operations performed using the
+ <literal>_general_ci</literal> collation are faster than those
+ for the <literal>_unicode_ci</literal> collation. For example,
+ comparisons for the <literal>utf8_general_ci</literal>
+ collation are faster, but slightly less correct, than
+ comparisons for <literal>utf8_unicode_ci</literal>. The reason
+ for this is that <literal>utf8_unicode_ci</literal> supports
+ mappings such as expansions; that is, when one character
+ compares as equal to combinations of other characters. For
+ example, in German and some other languages
+ <quote><literal>ß</literal></quote> is equal to
+ <quote><literal>ss</literal></quote>.
+ <literal>utf8_unicode_ci</literal> also supports contractions
+ and ignorable characters. <literal>utf8_general_ci</literal>
+ is a legacy collation that does not support expansions,
+ contractions, or ignorable characters. It can make only
+ one-to-one comparisons between characters.
</para>
<para>
- <literal>utf8_general_ci</literal> is a legacy collation that
- does not support expansions. It can make only one-to-one
- comparisons between characters. This means that comparisons
- for the <literal>utf8_general_ci</literal> collation are
- faster, but slightly less correct, than comparisons for
- <literal>utf8_unicode_ci</literal>.
- </para>
-
- <para>
- For example, the following equalities hold in both
+ To further illustrate, the following equalities hold in both
<literal>utf8_general_ci</literal> and
<literal>utf8_unicode_ci</literal> (for the effect this has in
comparisons or when doing searches, see
Modified: trunk/refman-6.0/internationalization.xml
===================================================================
--- trunk/refman-6.0/internationalization.xml 2007-11-29 13:46:04 UTC (rev 8978)
+++ trunk/refman-6.0/internationalization.xml 2007-11-29 13:56:07 UTC (rev 8979)
Changed blocks: 1, Lines Added: 18, Lines Deleted: 16; 2584 bytes
@@ -3573,25 +3573,27 @@
</remark>
<para>
- The most significant feature in
- <literal>utf8_unicode_ci</literal> is that it supports
- expansions; that is, when one character compares as equal to
- combinations of other characters. For example, in German and
- some other languages <quote><literal>ß</literal></quote> is
- equal to <quote><literal>ss</literal></quote>.
+ For any Unicode character set, operations performed using the
+ <literal>_general_ci</literal> collation are faster than those
+ for the <literal>_unicode_ci</literal> collation. For example,
+ comparisons for the <literal>utf8_general_ci</literal>
+ collation are faster, but slightly less correct, than
+ comparisons for <literal>utf8_unicode_ci</literal>. The reason
+ for this is that <literal>utf8_unicode_ci</literal> supports
+ mappings such as expansions; that is, when one character
+ compares as equal to combinations of other characters. For
+ example, in German and some other languages
+ <quote><literal>ß</literal></quote> is equal to
+ <quote><literal>ss</literal></quote>.
+ <literal>utf8_unicode_ci</literal> also supports contractions
+ and ignorable characters. <literal>utf8_general_ci</literal>
+ is a legacy collation that does not support expansions,
+ contractions, or ignorable characters. It can make only
+ one-to-one comparisons between characters.
</para>
<para>
- <literal>utf8_general_ci</literal> is a legacy collation that
- does not support expansions. It can make only one-to-one
- comparisons between characters. This means that comparisons
- for the <literal>utf8_general_ci</literal> collation are
- faster, but slightly less correct, than comparisons for
- <literal>utf8_unicode_ci</literal>.
- </para>
-
- <para>
- For example, the following equalities hold in both
+ To further illustrate, the following equalities hold in both
<literal>utf8_general_ci</literal> and
<literal>utf8_unicode_ci</literal> (for the effect this has in
comparisons or when doing searches, see
| Thread |
|---|
| • svn commit - mysqldoc@docsrva: r8979 - in trunk: . refman-4.1 refman-5.0 refman-5.1 refman-6.0 | paul | 29 Nov |