Filesystems and case-insensitivity
Filesystems and case-insensitivity
Posted Nov 28, 2018 14:59 UTC (Wed) by gioele (subscriber, #61675)Parent article: Filesystems and case-insensitivity
> Supporting case-insensitive file names requires the encoding-awareness changes in order to define what case folding means for a given character.
"Case" is properly defined only in terms of locale, not of encoding. Knowing the encoding (say, UTF-8+NFD vs UTF-16+NFKC) is necessary, but not sufficient. The user locale is needed as well.
In English "istanBUL" matches case-insensitively "Istanbul", in Turkish it does not. (In Turkish the uppercase version of "i" is "İ".)
What the developers could do is a kind of case-insensitive look-up that also clusters together "similar" letters. Defining which characters are similar opens, however, another can of worms (see `confusables.txt` from Unicode or all the discussions around IDNA and its Nameprep algorithm).
Maybe we should come up with another technical name for these locale-independent imprecise implementations of case insensitiveness?
