LWN.net Logo

The 2010 Linux Storage and Filesystem Summit, day 2

The 2010 Linux Storage and Filesystem Summit, day 2

Posted Aug 11, 2010 21:07 UTC (Wed) by buchanmilne (guest, #42315)
Parent article: The 2010 Linux Storage and Filesystem Summit, day 2

The cluster filesystems are in a bit of a tough place. Neither has a huge group dedicated to its development, and, as Ric Wheeler pointed out, there just isn't much of a hobbyist community equipped with enterprise-level storage arrays out there.

Enterprise-level storage arrays aren't required to run clustered filesystems. In fact, a single machine (laptop in my case) is sufficient, assuming you can run either Xen or KVM.

I'm not a kernel developer, but I test 'cluster' and 'gfs2' quite often on a test cluster that comprises two VMs that were originally installed under Xen, but now run under KVM, sharing one block device which is used as the GFS2 filesystem.

Unfortunately, virtualbox doesn't allow concurrent use of virtual hard disks by multiple VMs, but that's only a problem if you need OSs KVM doesn't boot.


(Log in to post comments)

The 2010 Linux Storage and Filesystem Summit, day 2

Posted Aug 11, 2010 21:24 UTC (Wed) by sniper (subscriber, #13219) [Link]

This has been addressed in the recently released VirtualBox 3.2.8.

http://www.virtualbox.org/wiki/Changelog

VirtualBox 3.2.8 (released 2010-08-06)

Sharing disks: support for attaching one disk to several VMs without external tools and tricks (see here for a short explanation)

http://www.virtualbox.org/manual/ch05.html#hdimagewrites

The 2010 Linux Storage and Filesystem Summit, day 2

Posted Aug 13, 2010 23:29 UTC (Fri) by giraffedata (subscriber, #1954) [Link]

... I test 'cluster' and 'gfs2' quite often on a test cluster that comprises two VMs ...

As a hobby? The point was that hobbyists don't care about cluster filesystems because they don't have enterprise-level storage arrays and cluster filesystems aren't good for anything else.

The 2010 Linux Storage and Filesystem Summit, day 2

Posted Aug 15, 2010 23:33 UTC (Sun) by tytso (subscriber, #9993) [Link]

Enterprise-level storage arrays aren't required to run clustered filesystems. In fact, a single machine (laptop in my case) is sufficient, assuming you can run either Xen or KVM.

Shared-block cluster file systems (which is what Red Hat's GFS and OCFS2 are) don't provide any redundancy in case of failure; they depend on the hardware being utterly reliable. Which generally means enterprise storage arrays which are FC connected, which can be equally accessed by all all of the nodes in the cluster file system. You don't _have_ to run them on an enterprise storage array, but then if any of the hard drives fail, you're really badly out of luck.

There are other possibilities, such as a network block device connected a server which uses multiple disks via an MD software RAID device, but your nbd server becomes a single point of failure. In theory at least an enterprise storage array has many more redundancies and can survive a controller card or hard drive failure, even though the enterprise storage array itself is still a single point of failure.

There are other types of cluster file systems, such as Google's GFS, Hadoopfs (which is basically a copy of GFS as described in the GFS paper), Lustre, Ceph, etc. These generally use an object-based storage paradigm with the objects replicated across multiple object stores, so you can survive a single disk or a single server biting the dust.

But what was being discussed in the original post was really focused on the two cluster file systems currently in the kernel which are supported by enterprise distributions: GFS2 and OCFS2, which are really very much identical in terms of feature set, scalability, and design at the 10,000 foot level. The fact that we have two of them is largely an accident of history, but it's splitting the amount of resources available hack on shared-block cluster file systems, which since they require rather specialized and expensive equipment in order to use them practically, tends to constrain the size of their developer communities.

-- Ted

The 2010 Linux Storage and Filesystem Summit, day 2

Posted Aug 24, 2010 21:06 UTC (Tue) by jpnp (subscriber, #63341) [Link]

There are other possibilities, such as a network block device connected a server which uses multiple disks via an MD software RAID device, but your nbd server becomes a single point of failure. In theory at least an enterprise storage array has many more redundancies and can survive a controller card or hard drive failure, even though the enterprise storage array itself is still a single point of failure.

Probably the most relevant option outside of enterprise SAN devices is DRBD. It's not mainline yet, but is quite widely used by smaller setups, offers no single point of failure, and provides a good substrate to run clustered file systems such as GFS/OCFS2, at least in 2 node systems.

John

The 2010 Linux Storage and Filesystem Summit, day 2

Posted Aug 24, 2010 21:35 UTC (Tue) by dlang (✭ supporter ✭, #313) [Link]

drbd went into the mainline in 2.6.33

Copyright © 2013, Eklektix, Inc.
Comments and public postings are copyrighted by their creators.
Linux is a registered trademark of Linus Torvalds