|
|
Subscribe / Log in / New account

Moving to Python 3

Moving to Python 3

Posted Feb 10, 2011 17:10 UTC (Thu) by marcH (subscriber, #57642)
In reply to: Moving to Python 3 by tialaramex
Parent article: Moving to Python 3

I think a lot of programs support UCS-2 only. I mean they would fail in various ways as soon as a supplementary character comes. How many Java programs do you expect to use Java.lang.String.codePointCount() ?

In this sense, UCS-2 is extremely often found in the real world.


to post comments

UTF family

Posted Feb 11, 2011 4:01 UTC (Fri) by tialaramex (subscriber, #21167) [Link]

I expect a lot of Java programs (and other programs) work fine with supplementary characters and myraid other thing so long as they leave anything clever to software written by someone else (or more likely a team of somebody elses) who actually knows lots about text.

What were you imagining they should be using java.lang.String.codePointCount() for ? Text is hard, like I said, and a count of Unicode code points is rarely what you need.

Examples of things which are assigned one or more Unicode code points: A harmless, invisible and ignorable marker; indication that subsequent neutral text is intended to be displayed right-to-left; the cedilla accent on a character; a lowercase x; a vertical tab; indication that a non-fatal error occurred in some previous processing.


Copyright © 2025, Eklektix, Inc.
Comments and public postings are copyrighted by their creators.
Linux is a registered trademark of Linus Torvalds