Rustaceans at the border
Rustaceans at the border
Posted Apr 19, 2022 11:06 UTC (Tue) by ssokolow (guest, #94568)In reply to: Rustaceans at the border by flussence
Parent article: Rustaceans at the border
Wikipedia's commentary on Han Ideographs (Chinese, Japanese Kanji, Korean Hanja, etc.) being only offered precomposed is "However, attempts to do this for character encoding have stumbled over the fact that Chinese characters do not decompose as simply or as regularly as Hangul does."
...and I remember reading that the precomposed Latin stuff is necessary to guarantee that text strings used as opaque lookup keys (eg. filesystem paths) wouldn't get altered when round-tripping between a legacy encoding and Unicode, regardless of the circumstances.
