Working with UTF-8 in the kernel
Working with UTF-8 in the kernel
Posted Apr 9, 2019 15:35 UTC (Tue) by foom (subscriber, #14868)In reply to: Working with UTF-8 in the kernel by dvdeug
Parent article: Working with UTF-8 in the kernel
NTFS and exFAT only maps a single utf16 code unit to another single utf16 code unit, via a lookup table written to disk during filesystem creation. No unicode normalization, no multicharacter equivalencies, and no folding for any characters above FFFF.
You say that other cases "have to be dealt with"...but we have widely used examples showing that to not actually be the case.
