metareplace -e

From: Jordi Vidal (jordivi@wtransnet.net)
Date: Wed Jan 21 2004 - 10:20:05 EST


Hi

SunOS xxx 5.9 Generic_112233-04 sun4u sparc SUNW,Sun-Fire-480R:

Yesterday, one disk of an Solaris-9 SVM (SDS in previos releases) mirror
failed:

Jan 20 20:20:44 xxx scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/pci@1/scsi@5/sd@a,0 (sd25):
Jan 20 20:20:44 xxx SCSI transport failed: reason 'reset': retrying command
Jan 20 20:31:13 xxx scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/pci@1/scsi@5/sd@a,0 (sd25):
Jan 20 20:31:13 xxx Unhandled Sense Key 'Vendor Unique'
Jan 20 20:46:17 xxx md_stripe: [ID 641072 kern.warning] WARNING: md: d62: write error on /dev/dsk/c3t10d0s7
Jan 20 20:46:18 xxx md_mirror: [ID 104909 kern.warning] WARNING: md: d62: /dev/dsk/c3t10d0s7 needs maintenance

I mounted the failed disk to /mnt, touch a file, umount. It seems ok.

I invoked "metareplace -e d60 c3t10d0s7" to enable the submirror and
resync it to see if it fails again, and after 5-10 minutes it failed:

Jan 21 15:52:50 xxx md_stripe: [ID 641072 kern.warning] WARNING: md: d62: write error on /dev/dsk/c3t10d0s7
Jan 21 15:52:55 xxx md_mirror: [ID 104909 kern.warning] WARNING: md: d62: /dev/dsk/c3t10d0s7 needs maintenance

No other errors in /var/adm/messages (bad-blocks or so). Other times that
a disk failed, in an other server, there were errors about bad blocks in
the messages file and "metareplace -e" worked for a while (some days)
before the mirror failed again (I dont have spare disks, and in the mean
time I prefer a bad mirror than no mirror)

How can I check if is a disk problem or a SCSI bus problem?

Jordi

http://www.wtransnet.com
Dpto. Ticnico
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:27:52 EDT