Bad understanding of UTF-8
Bad understanding of UTF-8
Posted Mar 28, 2009 3:40 UTC (Sat) by njs (subscriber, #40338)In reply to: Bad understanding of UTF-8 by spitzak
Parent article: Wheeler: Fixing Unix/Linux/POSIX Filenames
An "invalid" UTF-8 string can contain only some extraneous bytes in the range 0x80-0xff. These high-order bytes do not cause any problems with any programs.
