8 byte characters?
Posted Aug 12, 2005 13:45 UTC (Fri) by
ringerc (subscriber, #3071)
In reply to:
8 byte characters? by smitty_one_each
Parent article:
Our bloat problem
Many apps use UCS-2 internally, because it's *MUCH* faster to work with for many things than UTF-8 . With utf-8, to take the first 6 characters of a buffer you must decode the UTF-8 data (you don't know if each character is one, two, or four bytes long). With UCS-2, you just return the first 12 bytes of the buffer.
That said - it's only double. For text, that's not a big deal, and really doesn't explain the extreme memory footprints we're seeing.
(
Log in to post comments)