Posted Sep 18, 2012 17:33 UTC (Tue) by dskoll (subscriber, #1630)
[Link]
Following up on myself: The '662 patent talks about using hashes to deduplicate data and was filed in December 2003. Monotone was first released in April 2003 and I'm pretty sure it used SHA1 to identify data.
The '662 patent talks about data deduplication, but surely if git infringes then so would Monotone have, and if Monotone does not then surely git does not?
Time to search for prior art
Posted Sep 18, 2012 17:41 UTC (Tue) by Cyberax (✭ supporter ✭, #52523)
[Link]
Posted Sep 19, 2012 0:17 UTC (Wed) by rknight (subscriber, #26792)
[Link]
If that's all the '662 patent is about is using hashes to deduplicate data then it should be invalid. I know I used this approach with MD5 as far back as October of 2001, and it was suggested by a co-worker who claimed to have done something similar 5 years prior.
Time to search for prior art
Posted Sep 19, 2012 3:24 UTC (Wed) by Cyberax (✭ supporter ✭, #52523)
[Link]
"Fossil" fileserver on top of "venti" filesystem did this in Plan9 in 1993. And the idea itself is even more ancient.
Time to search for prior art
Posted Sep 27, 2012 12:26 UTC (Thu) by njs (guest, #40338)
[Link]
The idea was sufficiently well known for Val Henson (now Val Aurora) to publish a paper arguing against it in May 2003; she cites 6 different earlier systems using it: http://valerieaurora.org/review/hash/node2.html
The oldest appears to be rsync, with Tridge's thesis coming out in 1999, and for de-duplication specifically I'd check the paper on a backup system called "Pastiche" that was formally published in 2002...