Working with UTF-8 in the kernel
Working with UTF-8 in the kernel
Posted Mar 30, 2019 21:44 UTC (Sat) by mirabilos (subscriber, #84359)In reply to: Working with UTF-8 in the kernel by foom
Parent article: Working with UTF-8 in the kernel
Another reason why this belongs into userspace.
And no, the turkish case is not theoretical. They have words which only differ in the dot above the i, and in one case, one of the two words is normal and one a rather crass insult, which led to (IIRC) a knife attack (well, some kind of real-life attack at the person) because they had no dotless i on their keyboard when texting.
I’ll quote someone else: just because your latin alphabet has 26 letters, not everyone else’s does. Imagine if we’d *always* (independent on what word it’s in) make “oo” compare the same as “u”, for example.
