SCSI disk errors - how worried should I be?

From: Forrest Houston (fhouston@east.isi.edu)
Date: Tue Dec 07 2004 - 11:17:25 EST


I'm in the process of setting up a "new" machine (it's actually an old
E250 w/ Sol9 [new install] and I'm redesignating w/ a new purpose).
Anyway there are 6 HDs in the system and 6 in an external multipak (all
are 36G except one in the multipak which is an 18G [the old boot drive I'm
keeping around for "reference" right now]). I want to use this system as
such:

Int drives Ext drives
boot & mirror
3 in a RAID5 3 in a RAID5 (mirroring Int drives)
1 stand alone 1 stand alone (mirroring Int drive)

The problem I'm having is the final two spots in the multipak (it seems to
be the actual slots not the drives) are giving me the below errors on
reboot. I would like to be able to setup these slots as hotspares for the
mirrors/RAID (eventually swapping out the 18G for another 36G obviously
;)) but I'm a little worried about these error messages, although the
drives seems to be ok once the machine is up? This machine is actually
under service support so I might be able to get Sun to swap out the
multipak for me. Unfortunately I'm already overdue on this project (you
know how it is ;)) so I'd prefer to avoid that delay if at all possible.

So anyone know what is causing these errors (besides faulty hardware, or
is it just that)? Will using these slots for the hotspares potentially
create a problem for me (if so I can cut the RAID to a simple strip set
and use the extra disk in each of those for the hotspare instead)?

Hopefully that covers all the info that is needed.
Thanks for the help
Forrest

36G drive
scsi: [ID 107833 kern.warning] WARNING: /pci@1f,4000/scsi@3,1 (glm1):
      Connected command timeout for Target 13.0
genunix: [ID 408822 kern.info] NOTICE: glm1: fault detected in device; service still available
genunix: [ID 611667 kern.info] NOTICE: glm1: Connected command timeout for Target 13.0
glm: [ID 280919 kern.warning] WARNING: ID[SUNWpd.glm.cmd_timeout.6017]
scsi: [ID 107833 kern.warning] WARNING: /pci@1f,4000/scsi@3,1 (glm1):
     Target 13 reducing sync. transfer rate
glm: [ID 923092 kern.warning] WARNING: ID[SUNWpd.glm.sync_wide_backoff.6014]
scsi: [ID 107833 kern.warning] WARNING: /pci@1f,4000/scsi@3,1 (glm1):
      got SCSI bus reset
genunix: [ID 408822 kern.info] NOTICE: glm1: fault detected in device; service still available
genunix: [ID 611667 kern.info] NOTICE: glm1: got SCSI bus reset
scsi: [ID 193665 kern.info] sd27 at glm1: target d lun 0
genunix: [ID 936769 kern.info] sd27 is /pci@1f,4000/scsi@3,1/sd@d,0
scsi: [ID 365881 kern.info] /pci@1f,4000/scsi@3,1 (glm1):
     Cmd (0xf50dd0) dump for Target 14 Lun 0:
scsi: [ID 365881 kern.info] /pci@1f,4000/scsi@3,1 (glm1):
             cdb=[ 0x12 0x0 0x0 0x0 0x30 0x0 ]
scsi: [ID 365881 kern.info] /pci@1f,4000/scsi@3,1 (glm1):
     pkt_flags=0x808 pkt_statistics=0x60 pkt_state=0x7
scsi: [ID 365881 kern.info] /pci@1f,4000/scsi@3,1 (glm1):
     pkt_scbp=0x0 cmd_flags=0x2860

18G "old boot" drive
scsi: [ID 107833 kern.warning] WARNING: /pci@1f,4000/scsi@3,1 (glm1):
     Connected command timeout for Target 14.0
genunix: [ID 408822 kern.info] NOTICE: glm1: fault detected in device; service still available
genunix: [ID 611667 kern.info] NOTICE: glm1: Connected command timeout for Target 14.0
glm: [ID 280919 kern.warning] WARNING: ID[SUNWpd.glm.cmd_timeout.6017]
scsi: [ID 107833 kern.warning] WARNING: /pci@1f,4000/scsi@3,1 (glm1):
     Target 14 reducing sync. transfer rate
glm: [ID 923092 kern.warning] WARNING: ID[SUNWpd.glm.sync_wide_backoff.6014]
scsi: [ID 107833 kern.warning] WARNING: /pci@1f,4000/scsi@3,1 (glm1):
     got SCSI bus reset
genunix: [ID 408822 kern.info] NOTICE: glm1: fault detected in device; service still available
genunix: [ID 611667 kern.info] NOTICE: glm1: got SCSI bus reset
scsi: [ID 193665 kern.info] sd28 at glm1: target e lun 0
genunix: [ID 936769 kern.info] sd28 is /pci@1f,4000/scsi@3,1/sd@e,0
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:29:50 EDT