So...
This is the show create output:
# Host: localhost
# Database: corpus
# Table: 'kbase'
#
CREATE TABLE `kbase` (
`word` varchar(100) NOT NULL default '',
`changes` varchar(100) NOT NULL default '',
`status` enum('unknown','accept','reject','change') NOT NULL default
'unknown',
PRIMARY KEY (`word`)
) ENGINE=InnoDB DEFAULT CHARSET=utf8;
and these are the character codes in C# format (to get an idea
of how the characters look like, take a look at the character map
of windowsXP).
Small a with a an accent on top of its head: \u00e0 (U+00E0)
Small c with a dot on top of its head: \u010b (U+010B)
Small h with a small line through its head: \u0127 (U+0127)
Small z with a small dot on top of its head: \u017c (U+017C)
-----------------------
Pls note the following:
If I insert 'accessibli' with the small dots on the c's
and afterwards I insert 'accessibli' without the dots
on the c's (i.e. normal english 'c'), the second insertion is
not successful (duplicate key).
If I insert 'adezjoni' with the small dot on the z,
and afterwards I insert 'adezjoni' without the dot
on the z, the latter insertion is unsuccessful
(duplicate key).
If I insert 'behsiebna' with the h having a small line
through its head, and, afterwards I insert 'behsiebna'
with a normal 'h' (i.e. as in the quotes), the insertion
IS successful.
If I insert 'appogg' with the dots on the g's and afterwards
I insert 'appogg' with a normal english 'g', the latter insertion
is unsuccessful.
If I insert 'realta' with the accent on the 'a', and then insert
'realta' without an accent on the 'a', the second insertion
is unsuccessful.
Duncan