"Unicode"
Posted Oct 30, 2009 21:59 UTC (Fri) by
spitzak (guest, #4593)
In reply to:
"Unicode" by nix
Parent article:
Proposal: Moratorium on Python language changes
hell, there they'd probably just go back to 7-bit ASCII!
Indeed they have. I have personally encountered software that "fixed" encoding problems by masking the high bit, by removing all bytes with the high bit set, and by replacing all bytes with the high bit set with "\xNN" sequences. So claiming that they would even preserve ISO-8859-1 was perhaps being too kind. In fact we are regressing to earlier than the 1980's by going ASCII-only.
What is happening in the far east is that Asian text is getting stored in UCS-2 (thought they may claim it is UTF-16), or in non-error-throwing encodings such as the older JP multibyte, while all other text is in ISO-8859-1 or ASCII (they may claim it is UTF-8). Thus text is delegated to two different file types, the exact thing Unicode was supposed to fix!
(
Log in to post comments)