That said, raid arrays that fail when plugging in a specific device are awfully suspect.
Signal problems on the port should be a soft failure, not a hard one. That's the whole
spending lots of money on server hardware. If it was OK for a drive to take down the system,
they could have been running on a $900 box from Walmart. All file server boxes really need is
a ton of RAM. Disk bandwidth needn't enter into it. Didn't I see a report a while back that
kernel.org was, in fact, serving everything out of cache anyway?
All of which, really, just goes down as evidence for my long-held opinion that hardware-level
solutions for reliability never work. Reliability can only be achieved at the software level
via full redundancy.