Transparent versus opaque formats
Transparent versus opaque formats
Posted Feb 26, 2017 11:35 UTC (Sun) by excors (subscriber, #95769)In reply to: Transparent versus opaque formats by epa
Parent article: Linus on Git and SHA-1
A plain text file can contain lots of zero-width Unicode spaces, soft hyphens, RTL marks, etc, which the end user won't see because they're invisible.
If you constrain it to ASCII text files in a viewer that makes all control characters visible and that highlights trailing whitespace, then maybe that could count as transparent. But that sounds too constrained to be a useful distinction in practice.
Posted Feb 27, 2017 12:07 UTC (Mon)
by epa (subscriber, #39769)
[Link] (1 responses)
For non-code text documents, such as a legal agreement, tightening up the allowed whitespace might be a good idea. You'll never eliminate all possible avenues for fiddling the content, but if you can get the number of variations of a document down from trillions to under a million or so, you've greatly reduced the scope for finding a collision.
Posted Feb 27, 2017 12:18 UTC (Mon)
by johill (subscriber, #25196)
[Link]
Posted Mar 4, 2017 1:57 UTC (Sat)
by zslade (subscriber, #72097)
[Link] (2 responses)
Posted Mar 6, 2017 14:28 UTC (Mon)
by epa (subscriber, #39769)
[Link] (1 responses)
Posted Mar 6, 2017 15:29 UTC (Mon)
by excors (subscriber, #95769)
[Link]
(Also it's not true that names are the only exceptions - in some older versions of the kernel I see staging drivers with Big5-encoded Chinese comments.)
Transparent versus opaque formats
Transparent versus opaque formats
Transparent versus opaque formats
Transparent versus opaque formats
Transparent versus opaque formats