So long as the programmer never knows, I don't care.
But in my experience it's surprisingly hard to prevent this abstraction from leaking. Text is really tricky, in fact one of the main lessons from the Unicode project is that text is way trickier than anyone had really thought before.
For example, what happens with canonicalisation in Python?
(You will not be surprised to know that the answer in C is generally "C does not care about canonicalisation, it's all byte strings to us")