Git v2.24.1 and others
Git v2.24.1 and others
Posted Dec 12, 2019 13:30 UTC (Thu) by epa (subscriber, #39769)In reply to: Git v2.24.1 and others by nix
Parent article: Git v2.24.1 and others
Another plug for Fixing Unix/Linux/POSIX Filenames, for those who have not yet read it.
Posted Dec 14, 2019 23:37 UTC (Sat)
by adobriyan (subscriber, #30858)
[Link] (10 responses)
David Wheeler's logic is like this:
We've seen this pattern before:
Real programming languages (say Python) don't have the problem with whitespace (subprocess.call()).
Maybe it is the Unix shells that should be fixed?
Even if whitespace and other characters are banned where will they stop? Unicode is big, there are 8 newlines.
Posted Dec 15, 2019 0:11 UTC (Sun)
by Cyberax (✭ supporter ✭, #52523)
[Link] (9 responses)
Getting filenames to be valid UTF-8 would be an awesome improvement over the status quo.
Posted Dec 15, 2019 11:43 UTC (Sun)
by adobriyan (subscriber, #30858)
[Link] (1 responses)
Posted Dec 15, 2019 11:45 UTC (Sun)
by Cyberax (✭ supporter ✭, #52523)
[Link]
Posted Dec 16, 2019 0:13 UTC (Mon)
by zlynx (guest, #2285)
[Link] (6 responses)
Otherwise Linux / Unix would have ended up like the others in love with "the future" and we'd be stuck with UCS-2 circa 2002. Which is neither big enough for every character, nor space efficient.
Won't it be fun in the year 2100 when users have to create wild WTF-8 hacks to work around the encoding limitations hard coded into their virtual storage backend.
Posted Dec 17, 2019 22:02 UTC (Tue)
by flussence (guest, #85566)
[Link] (3 responses)
Posted Dec 18, 2019 12:06 UTC (Wed)
by NAR (subscriber, #1313)
[Link] (2 responses)
Posted Dec 20, 2019 0:33 UTC (Fri)
by flussence (guest, #85566)
[Link] (1 responses)
We're actually running out of things to add to Unicode. New emoji proposals are in short supply and most of the recent additions have been ancient scripts and increasingly obscure precomposed CJK glyphs. Maybe of more relevance to people reading this, Unicode 13 is adding characters from ancient computer systems (Spectrum, Teletext, C64 and the like): https://www.unicode.org/charts/PDF/Unicode-13.0/
Posted Dec 23, 2019 15:13 UTC (Mon)
by geert (subscriber, #98403)
[Link]
Posted Dec 17, 2019 22:26 UTC (Tue)
by Cyberax (✭ supporter ✭, #52523)
[Link]
Pretty much the only case where UTF-8 won't be enough is if the Earth join the Galactic Federation with FTL communications. But in this case I think that migration off UTF-8 would be a good problem to have.
Posted Dec 18, 2019 12:24 UTC (Wed)
by jezuch (subscriber, #52988)
[Link]
It's like with date handling: do not *ever* write your own date handling library; now that even Java got a decent support for it in its standard library, you don't have to be stupid like this anymore ;) And one standard is enough. We've got it solved, let's move on already.
Git v2.24.1 and others
most shell scripts are buggy because shell authors make it easy to make mistakes despite knowing perfectly well that Unix allows whitespace in filenames, therefore OS kernel should accomodate shell users.
shell can't do system calls therefore OS kernel should interface in text which is inferior in nearly any way.
Git v2.24.1 and others
Unicode? What Unicode? Unix file names need not be Unicode in any encoding. They can be arbitrary binary garbage.
Git v2.24.1 and others
Git v2.24.1 and others
Git v2.24.1 and others
Git v2.24.1 and others
Git v2.24.1 and others
Git v2.24.1 and others
Git v2.24.1 and others
Git v2.24.1 and others
Git v2.24.1 and others
