Help with Raid Manager, A1000 and failing drive

From: Alan Aldrich (aaldrich@mathesoninc.com)
Date: Thu Jul 29 2004 - 20:17:23 EDT


Hi all,
I need some sage advice.

I have a Sunfire system running Solaris 8 and a A1000 attached for storage

I got some error messages from the Raid Manager as follows yesterday

Jul 28 01:49:24 egsorc002 raid: [ID 702911 user.error] AEN event
Host=egsorc002 Ctrl=1T030153
55 Dev=c1t5d3
Jul 28 01:49:24 egsorc002 raid: [ID 702911 user.error] AEN event
Host=egsorc002 Ctrl=1T030153
55 Dev=c1t5d3
Jul 28 01:49:24 egsorc002 raid: [ID 702911 user.error] ASC=3F ASCQ=80
FRU=23 LUN=03 LUN Sta
t=02
Jul 28 01:49:24 egsorc002 raid: [ID 702911 user.error] ASC=3F ASCQ=80
FRU=23 LUN=03 LUN Sta
t=02
Jul 28 01:49:24 egsorc002 raid: [ID 702911 user.error]
Sense=7000060000000098000000003F8023
000000000000000000000000000000000000000000000805000000000000000000000000000B
05315430333031353
3353520202020202003010200000302000000000000000000000000000000000000000000000
00001000000000000
0000000000000000000000000000000000000000000000000000000001AAB59B303732383034
2F303234383233000
00000000000
Jul 28 01:49:24 egsorc002 raid: [ID 702911 user.error]
Sense=7000060000000098000000003F8023
000000000000000000000000000000000000000000000805000000000000000000000000000B
05315430333031353
3353520202020202003010200000302000000000000000000000000000000000000000000000
00001000000000000
0000000000000000000000000000000000000000000000000000000001AAB59B303732383034
2F303234383233000
00000000000

And one of the drives now has a 'yellow' indicator instead of green
I assume that means it is a problem?

When I do a drivutil on the array I get this though

# /usr/lib/osa/bin/drivutil -l LH_UltraII_001
 Logical Unit Information for LH_UltraII_001

 LUN Group Device RAID Capacity Status
                     Name Level (MB)

  0 1 c1t5d0 1 34692 Optimal
  1 1 c1t5d1 1 34692 Optimal
  2 1 c1t5d2 1 34692 Optimal
  3 2 c1t5d3 1 34692 Optimal
  4 3 c1t5d4 0 34692 Optimal
  5 4 c1t5d5 0 34692 Optimal

drivutil succeeded!
#

Which would seem to indicate that is is 'ok' for now

My question is this.
I have a 'spare' drive as a backup. Should I go ahead and replace it?
If so, since the drive is in the c1t5d3 device in a RAID1 config, can I just
pull the drive with the yellow LED and it will rebuild automatically?
Or is there something else I need to do first, next, last

I would rather replace it now if it is going to ultimately fail, but I have
an Oracle database on that device so don't know whether I
should shut it down or what before replacing the drive.

Any advice appreciated.

Thanks
alan
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:29:10 EDT