Disk drive errors - is the drive dead?

From: Franz Fischer (Franz.Fischer@franz-fischer.de)
Date: Sun Mar 13 2005 - 05:59:54 EST


Hi all,

I was trying to move a COMPAQ DGHS18Y 18GB drive from a retiered file server
to my Alpha box, but I see repeated hard errors reported by uerf, sometimes
the system gets hung for a while due to SCSI bus timeouts / resets.

Does this indicate the drive is (almost) dead?

Current setup is AlphaStation 255, Tru64 UNIX 4.0G, narrow internal SCSI
bus, DGHS18Y connected via 50pin to 80pin SCA adapter.

Uerf report below.

Thanks in advance for your help

        \franz

----- EVENT INFORMATION -----

EVENT CLASS ERROR EVENT
OS EVENT TYPE 199. CAM SCSI
SEQUENCE NUMBER 4.
OPERATING SYSTEM DEC OSF/1
OCCURRED/LOGGED ON Sun Mar 13 11:49:06 2005
OCCURRED ON SYSTEM moco
SYSTEM ID x0006000D CPU TYPE: DEC 7000
SYSTYPE x00000000

----- UNIT INFORMATION -----

CLASS x0000 DISK
SUBSYSTEM x0000 DISK
BUS # x0000
                              x0010 LUN x0
                                        TARGET x2

----- CAM STRING -----

ROUTINE NAME cdisk_check_sense

----- CAM STRING -----

                                        Device aborted command - parity error?

----- CAM STRING -----

ERROR TYPE Hard Error Detected

----- CAM STRING -----

DEVICE NAME COMPAQ DGHS18Y 01C0

----- CAM STRING -----

                                        Active CCB at time of error

----- CAM STRING -----

                                        CCB request completed with an error

----- ENT_CCB_SCSIIO -----

*MY ADDR x09F9D580
CCB LENGTH x00C0
FUNC CODE x01
CAM_STATUS x00C4 CAM_REQ_CMP_ERR
                                        SIM QFRZN
                                        AUTOSNS_VALID
PATH ID 0.
TARGET ID 2.
TARGET LUN 0.
CAM FLAGS x00000442
                                        CAM_QUEUE_ENABLE
                                        CAM_DIR_IN
                                        CAM_SIM_QFRZDIS
*PDRV_PTR x09F9D228
*NEXT_CCB x00000000
*REQ_MAP x09F74200
VOID (*CAM_CBFCNP)() x00465210
*DATA_PTR x40039800
DXFER_LEN x00002000
*SENSE_PTR x09F9D250
SENSE_LEN x40
CDB_LEN x0A
SGLIST_CNT x0000
CAM_SCSI_STATUS x0002 SCSI_STAT_CHECK_CONDITION
SENSE_RESID x20
RESID x00000000
CAM_CDB_IO x000000100000B02EF5010028
CAM_TIMEOUT x0000003C
MSGB_LEN x0000
VU_FLAGS x4000
TAG_ACTION x20

----- CAM STRING -----

                                        Error, exception, or abnormal
                                         _condition

----- CAM STRING -----

                                        ABORTED COMMAND - Target aborted
                                         _command

----- ENT_SENSE_DATA -----

ERROR CODE x0070 CODE x70
SEGMENT x00
SENSE KEY x000B ABORTED CMD
INFO BYTE 3 x00
INFO BYTE 2 x00
INFO BYTE 1 x00
INFO BYTE 0 x00
ADDITION LEN x18
CMD SPECIFIC 3 x00
CMD SPECIFIC 2 x00
CMD SPECIFIC 1 x00
CMD SPECIFIC 0 x00
ASC x1B
ASQ x00
FRU x00
SENSE SPECIFIC x000000
ADDITIONAL SENSE
0000: 05010000 00000000 00000000 00000000 *................*
0010: 00000000 00000000 00000000 00000000 *................*
0020: 00000000 00000000 00000000 00000000 *................*
0030: 7E250000 00005E3C 00000000 00000000 *..%~<^..........*

--
Franz G. Fischer ------------ Franz dot Fischer at franz-fischer dot de


This archive was generated by hypermail 2.1.7 : Sat Apr 12 2008 - 10:50:16 EDT