Improving ext4: bigalloc, inline data, and metadata checksums
Improving ext4: bigalloc, inline data, and metadata checksums
Posted Dec 8, 2011 15:34 UTC (Thu) by lopgok (guest, #43164)In reply to: Improving ext4: bigalloc, inline data, and metadata checksums by pr1268
Parent article: Improving ext4: bigalloc, inline data, and metadata checksums
I wrote a trivial python script to generate a checksum file for each directory's files. If you run it, and it finds a checksum file, it checks that the files in the directory match the checksum file, and if they don't it reports that.
I wrote it when I had a serverworks chipset on my motherboard that corrupted IDE hard drives when DMA was enabled. However, the utility lets me know there is no bit rot in my files.
It can be found at http://jdeifik.com/ , look for 'md5sum a directory tree'. It is GPL3 code. It works independently from the files being checksummed and independently of the file system. I have found flaky disks that passed every other test with this utility.
The other thing that can corrupt files is memory errors. Many new computers do not support ECC memory. If you care about data integrity, you should use ECC memory. Intel has this feature for their server chips (xeons) and AMD has this feature for all ofgf their processors (though not all motherboard makers support it).
It is very cheap insurance.
Posted Dec 8, 2011 16:24 UTC (Thu)
by nix (subscriber, #2304)
[Link] (7 responses)
ECCRAM is worthwhile, but it is not at all cheap once you factor all that in.
Posted Dec 8, 2011 17:47 UTC (Thu)
by tytso (subscriber, #9993)
[Link] (6 responses)
It's like people who balk at spending an extra $200 to mirror their data, or to provide a hot spare for their RAID array. How much would you be willing to spend to get back your data after you discover it's been vaporized? What kind of chances are you willing to take against that eventuality happen?
It will vary depending on each person, but traditional people are terrible and figuring out cost/benefit tradeoffs.
Posted Dec 8, 2011 19:10 UTC (Thu)
by nix (subscriber, #2304)
[Link] (5 responses)
(Also, last time I tried you couldn't buy a desktop with ECCRAM for love nor money. Servers, sure, but not desktops. So of course all my work stays on the server with battery-backed hardware RAID and ECCRAM, and I just have to hope the desktop doesn't corrupt it in transit.)
Posted Dec 9, 2011 0:57 UTC (Fri)
by tytso (subscriber, #9993)
[Link] (2 responses)
I really like how quickly I can build kernels on this machine. :-)
I'll grant it's not "cheap" in absolute terms, but I've always believed that skimping on a craftsman's tools is false economy.
Posted Dec 9, 2011 7:41 UTC (Fri)
by quotemstr (subscriber, #45331)
[Link]
I have the same machine. Oddly enough, it only supports 12GB of non-ECC memory, at least according to Dell's manual. How does that happen?
(Also, Intel's processor datasheet claims that several hundred gigabytes of either ECC or non-ECC memory should be supported using the integrated memory controller. I wonder why Dell's system supports less.)
Posted Dec 9, 2011 12:40 UTC (Fri)
by nix (subscriber, #2304)
[Link]
EDAC support for my Nehalem systems landed in mainline a couple of years ago but I'll admit to never having looked into how to get it to tell me what errors may have been corrected, so I have no idea how frequent they might be.
(And if it didn't mean dealing with Dell I might consider one of those machines myself...)
Posted Dec 9, 2011 13:53 UTC (Fri)
by james (subscriber, #1325)
[Link] (1 responses)
Even ECC memory isn't that much more expensive: Crucial do a 2x2GB ECC kit for £27 + VAT ($42 in the US) against £19 ($30).
Posted Dec 9, 2011 15:19 UTC (Fri)
by lopgok (guest, #43164)
[Link]
If you buy assembled computers and can't get ECC support without spending big bucks, it is time to switch vendors.
It is true that ECC memory is more expensive and less available than non-ECC memory, but the price difference is around 20% or so, and Newegg and others sell a wide variety of ECC memory. Mainstream memory manufacturers, including Kingston sell ECC memory.
Of course, virtually all server computers come with ECC memory.
Improving ext4: bigalloc, inline data, and metadata checksums
It is very cheap insurance.
Look at the price differential between the motherboards and CPUs that support ECCRAM and those that do not. Now add in the extra cost of the RAM.
Improving ext4: bigalloc, inline data, and metadata checksums
Improving ext4: bigalloc, inline data, and metadata checksums
Improving ext4: bigalloc, inline data, and metadata checksums
Improving ext4: bigalloc, inline data, and metadata checksums
Improving ext4: bigalloc, inline data, and metadata checksums
Improving ext4: bigalloc, inline data, and metadata checksums
Improving ext4: bigalloc, inline data, and metadata checksums