SUMMARY: Failed drive in Raidshelf - hint needed

From: Christian Wessely (christian.wessely@uni-graz.at)
Date: Fri Aug 29 2003 - 02:44:32 EDT


hello admin wizards,

a big thanks to all who replied, in order of appearance:
Phil Baldwin (after 22 Minutes !!!), Emil Dragic, Pat OŽBrien, Alan
Rollow (thank you once again) and Fred Serino.

Generally, there was an agreement that the controller chooses the spare
drive that fits best to the defined spare policy, and if there is none
available at the same "column" it will take the next best in sequence.
Most of you suggested not to use the drive any further, there were,
however, reports (thank you, Emil) that drives my go offline due to
thermal problems when the air slits are covered with dust.
I decided to clean the drive (it was really necessary), will define it
as single "jbod" and make a stress test with it in the next weeks.
Meanwhile, I have it replaced with a new one (thanks to the great
support of Compaq/HP Austria, namely ACP in Graz!)

As to diag the reason for the error, hints were to check uerf, DECevent,
WEBES/CA. Since I still run 4.0g on that cluster, i will give that a try
asap.

Thanks again all of you !!!
CW



This archive was generated by hypermail 2.1.7 : Sat Apr 12 2008 - 10:49:34 EDT