If Unicode had stayed a reasonable project, and not turned into - it appears - a full-fledged programming language capable of hosting exploits - then the difference between bytes and text would be much less of a problem. I *LIKED* the proposal for a simple system which just expanded a character to a 2-byte entity, from a one-byte one. Now, we've got three or four different systems that all have to be checked for, since trusting the encoding information presented to us from outside sources is a fool's mistake. Too many linguistics academics and typographers got into the whole mess with defining the character set.
Copyright © 2017, Eklektix, Inc.
Comments and public postings are copyrighted by their creators.
Linux is a registered trademark of Linus Torvalds