Pluto bad drives issue


Recommended Posts

We're using these:

For pluto anyways but across all machines it could be a set of 15k.6 or 15k.5 as well. We're looking more reliability than speed so it makes no difference to us.

As far as SAS drives I've seen them fail in production as well. I remember 2 in the last year both 15k RPM drives.

Oh and Titan it's going to get swapped out as well it was on the to-do. It's a tough game to play people want the biggest and the best and SAS cost a lot more over SATA disks. Raptors were a great middle ground but it got to a point where for us there was no cost difference if we just switched the entire fleet. A lot of hosts though still use sata disks and some not any raid at all. They're using the enterprise sata disks typically which cost a lot more but not as much as sas disks still. They have close to the same annual failure rate advertised anyways.

Link to comment
Share on other sites

Notice posted:

Basically 15K SAS over 10K SATA drives is the only change.

Good call. I applaud the move.

Well this is pretty amusing we deployed the replacement Pluto only to discover a failed drive! It died literally after deployment. So much for SAS being more reliable the name Pluto is what's cursed!

And a good thing, too, that you have a sense of humor to carry you through Pluto's unbelievably bad luck.

Link to comment
Share on other sites

The drive gods are angry for us questioning them. Marlin two raptor drives failed and replaced without issue:

Saturn SAS drive failed and replaced:

Interesting error came from the saturn one:

saturn Sense data: Hardware error (TRACK FOLLOWING ERROR). Controller 1, channel 0, SCSI device ID 1, LUN 0, cdb [28 00 0d 3d 44 00 00 02 00 00 00 00], data [70 00 04 00 00 00 00 00 00 00 00 00 09 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00]
Physical drive removed: controller 1, connector 0, device 1, S/N (serial here).
Logical device is degraded: controller 1, logical device 0 ("RAID10-A").

So yes it's true SAS drives do fail! Odd number of failures this week though that's 5 in a weeks span.

Link to comment
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.