SUMMARY: Thousands of Peripheral Device Write Fault in HSG80

From: Dušek Martin (Martin.Dusek@pregis.cz)
Date: Mon Mar 10 2003 - 04:08:45 EST


Hi all,

many thanks to all who responded, especially Rob Leadbeater and Jay Vlack. They mentioned two possible known causes of the problem:

1. (Rob - I quote him):
-----------------------
The disk that you have is a BF03665223. These are 15K RPM 36GB Fujitsu disks with which there are known problems - we had a whole bunch of them which Compaq replaced with Seagate drives, after seeing similar problems.

To quote Compaq support:
"A problem within the HDD build process was found to effect a limited
quantity of BF03665223 disk drives built prior to June, 2002. Peripheral
Device Write Faults (01/03/00) are symptomatic of this problem.
BF03665223 drives experiencing PDWF's should be replaced through the
normal logistics process. Logistics supply has been purged of effected
devices. Drives of this model which are returned through logistics will
be screened for this problem."

The part number that you want to be checking for on any replacement disks sent out is BF03664664.
HP Call Reference is 2362033RE.

Our summary:
Please note once more that the problem effected only "a limited quantity of BF03665223 disk drives built prior to June, 2002" !!! We have BF03665223 disk drives with revision number B008 and B014 and HP claims that these revision numbers are OK and there is no reason to change the disks.

2. (Jay):
---------
There were some specific batches of disk drive that had vibration problems in the past.

3. (Others):
------------
Several of you recommended to upgrade ACS to 8.7. HP hasn't done that but has upgraded to 8.6-11.

======================================================
======================================================
Now, we are happy because we have no more thousands of Peripheral Device Write Fault in HSG80. Why? HP changed:

Both of the batteries in HSG (no effect).
Then the IO module, the EMU module and two cables in the shelf (not as much PDWF, but some were there again).
Then (for the third time during three months) the same disk (it finally failed), the new disk is still the BF03665223 rev. B014.
Then both of the new batteries, because both of them failed(!) during two months.

Some of the steps may helped, because at least for two weeks, we have had no PDWF.

With regards,

Martin

__________________________________________________
Martin Dušek martin.dusek@pregis.cz

-----Original Message-----
From: Dušek Martin
Sent: Wednesday, December 04, 2002 11:32 AM
To: tru64-unix-managers@ornl.gov
Subject: Thousands of Peripheral Device Write Fault in HSG80

Hi all,

for at least two months, we have had the same problem in our EMA12000:

Every hour we get hundreds of "199=Tru64 UNIX CAM Event" and "1197=KGPSA Tru64 UNIX Event" messages from the same disk (disk40000) - it is always a "Information Message Detected (recovered)", but it tells "UNIT ATTENTION - Medium changed or target reset" and "Disk Transfer Error Event" or "Peripheral Device Write Fault". Sometimes a "Low Voltage Condition" message comes from ca. See the ca listing and mail attached below.

Our local HP support changed (one at a time) the disk, both HSG80's, the shelf, both of the shelf power supplies, the firmware (to V86F-11) and the result has been always the same - something is generating many error messages. There has't remained too much to "try to change". HP is happy doing this (so far they didn't escalate the problem, now they finally promised to do this) but our customer with 24x7 operation is very unhappy by these repeating many-hours shutdowns while changing various parts.

Even if this would be not really an error, our ES45 is very busy resolving these messages.

Our EMA configuration is:
A pair of HSG80's in multibus failover mode, software V86F-11
Three split bus shelves
36 GB 15 krpm disks in striped mirror-pairs (mirrored across the shelves)

Because we had problems with another disk40000 in another HSG80 pair, I would like to make "a poll":

Has anybody similar strange problems with disk40000 in EMA12000/MA8000 ?
Does anybody know a cause of these problems? Could it be caused by some vibrations, magnetic field and so on? If so, why are the problems with one disk only?

Best Regards,

Martin Dusek
PREGIS, CZ
martin.dusek@pregis.cz

####################################################

For those who would be so kind and try to find more:

Event: 1063
Description: Tru64 UNIX CAM Event at Dec 4, 2002 10:25:32 AM GMT+01:00 from es45 in file /var/adm/binary.errlog
File: /var/adm/binary.errlog
================================================================================

OS_Type 1 -- Tru64 UNIX
Hardware_Arch 4 -- Alpha
CEH_Vendor_ID 3,564 -- Compaq Computer Corp
Hdwr_Sys_Type 38 -- Titan Corelogic
Logging_CPU 1 -- CPU Logging this Event
CPUs_In_Active_Set 4
Entry_Type 199 -- Tru64 UNIX CAM Event
DSR_Msg_Num 1,978 -- Compaq AlphaServer ES45
                                               .... Model 2/2B
                                               .... CPU Slots: 4 (1000 Mhz)
                                               .... PCI Slots: 10
                                               .... MMB Slots: 8 (DIMMs)
Chip_Type 12 -- EV68CB - 21264C
CEH_Device 0
CEH_Device_ID_0 x0000 0002
CEH_Device_ID_1 x0000 0002
CEH_Device_ID_2 x0000 001F
Unique_ID_Count 1,724
Unique_ID_Prefix 20,376
TLV_DSR_String Compaq AlphaServer ES45 Model 2
TLV_OS_Version Compaq Tru64 UNIX V5.1A (Rev. 1885)
TLV_Sys_Serial_Num AY21007793
TLV_Time_as_Local Dec 4, 2002 10:25:32 AM GMT+01:00
TLV_Computer_Name es45
CAM_hdr_type 199
CAM_hdr_class x00 Disk
CAM_hdr_subsystem x0000 0000
CAM_hdr_entries 11
Start CAM SCSI Subpackets START OF SUBPACKETS IN THIS ENTRY
CAM_ent_type 258 Module Name String
Module_Name_Str cdisk_rec_status
CAM_ent_type 256 Generic String
Generic_String Recovery progress event, this is NOT an error
CAM_ent_type 262 Informational Error String
Info_Error_String Information Message Detected (recovered)
CAM_ent_type 256 Generic String
Generic_String Hardware ID = 256
CAM_ent_type 257 Device Name String
Device_Name DEC HSG80 V86F
CAM_ent_type 256 Generic String
Generic_String Active CCB at time of error
CAM_ent_type 256 Generic String
Generic_String CCB request completed with an error
CAM_ent_type 1
cam_ccb_len 192
camfunc_code x01
cam_status x84
cam_path x02 Logical SCSI Bus #
cam_target x02 Logical Host Target #
cam_lun x1F Logical Unit Number (LUN)
cam_flags x0000 54C0
cam_dxfer_len 0
cam_sense_len 255
cam_cdb_len 6
cam_sglist_cnt 0
cam_scsi_status x02 Check Condition
cam_timeout 20
cam_msg_len 0
cam_vu_flags x0000
cam_tag_action x00
cam_sim_priv0 xFFFF FC01 844D 61C8
cam_sim_priv1 x0000 0000 DEC0 0DEC
cam_sim_priv2 x0000 0000 0000 0000
cam_sim_priv3 x0000 0000 0000 0000
cam_sim_priv4 x0000 0000 0000 0000
cam_sim_priv5 x0000 0000 0000 0000
cam_sim_priv6 x0000 0000 0000 0000
CAM_ent_type 256 Generic String
Generic_String Error, exception, or abnormal condition
CAM_ent_type 256 Generic String
Generic_String UNIT ATTENTION - Medium changed or target reset
CAM_ent_type 768 CAM Sense Data
Sense_byte_0 xF0
   Error Code[6:0] x70
   Valid[7] x1

Sense_Key x06
   Sense_Key[3:0] x6 Unit Attention

Info_bytes x011A 90D0 Usually LBA
Additional_Sense_Len 152 Std Length for HSZ/HSGxx
Cmd_spec_info x0000 0000
ASCQ_ASC x810C
   ASC[7:0] xC
   ASCQ[15:8] x81 Vendor Specific ASCQ

FRU_code x00
Sense_Key_Specfic_Byte0x80
Sense_Key_Specfic_Bytes 0
Total_Num_of_Errs 1
Total_Retry_Cnt 0
ASC_ASCQ_Stack0 x0C81
ASC_ASCQ_Stack1 x0000
Dev_Port 4 Physical Port #
Dev_Target 0 Physical Target #
Dev_LUN 0
HS_Instance_Code x0326 450A
Template_Type x51 Disk Transfer Error Event
Template_Flags x20
Command_Opcode x2A
Sense_Data_Qual x80
Original_CDB0 x00 0000 0000
Original_CDB1 x00 0000 0000
Host_ID x00
Ctrl_Serialnum ZG94416851
Ctlr_Firmware_Rev V86F
LUN_Status x00
Dev_Prod_ID BF03665223
Device_Type x00 Direct Access (Disk)
Dev_Sense_byte_0 xF0
Dev_Segment_Num 0
Dev_Sense_Key x01
Dev_Info_bytes x011A 90D0
Dev_Command_spec_info x0000 0000
DEV_ASCQ_ASC x810C
   ASC[7:0] xC
   ASCQ[15:8] x81

Dev_FRU_code x00
Dev_Sense_Key_Specific_Byte0x80
Dev_Sense_Key_Specific_Bytes 0

Event: 1062
Description: Tru64 UNIX CAM Event at Dec 4, 2002 10:25:32 AM GMT+01:00 from es45 in file /var/adm/binary.errlog
File: /var/adm/binary.errlog
================================================================================

OS_Type 1 -- Tru64 UNIX
Hardware_Arch 4 -- Alpha
CEH_Vendor_ID 3,564 -- Compaq Computer Corp
Hdwr_Sys_Type 38 -- Titan Corelogic
Logging_CPU 2 -- CPU Logging this Event
CPUs_In_Active_Set 4
Entry_Type 199 -- Tru64 UNIX CAM Event
DSR_Msg_Num 1,978 -- Compaq AlphaServer ES45
                                               .... Model 2/2B
                                               .... CPU Slots: 4 (1000 Mhz)
                                               .... PCI Slots: 10
                                               .... MMB Slots: 8 (DIMMs)
Chip_Type 12 -- EV68CB - 21264C
CEH_Device 0
CEH_Device_ID_0 x0000 0007
CEH_Device_ID_1 x0000 0004
CEH_Device_ID_2 x0000 001F
Unique_ID_Count 1,723
Unique_ID_Prefix 20,376
TLV_DSR_String Compaq AlphaServer ES45 Model 2
TLV_OS_Version Compaq Tru64 UNIX V5.1A (Rev. 1885)
TLV_Sys_Serial_Num AY21007793
TLV_Time_as_Local Dec 4, 2002 10:25:32 AM GMT+01:00
TLV_Computer_Name es45
CAM_hdr_type 199
CAM_hdr_class x00 Disk
CAM_hdr_subsystem x0000 0000
CAM_hdr_entries 11
Start CAM SCSI Subpackets START OF SUBPACKETS IN THIS ENTRY
CAM_ent_type 258 Module Name String
Module_Name_Str cdisk_rec_status
CAM_ent_type 256 Generic String
Generic_String Recovery progress event, this is NOT an error
CAM_ent_type 262 Informational Error String
Info_Error_String Information Message Detected (recovered)
CAM_ent_type 256 Generic String
Generic_String Hardware ID = 256
CAM_ent_type 257 Device Name String
Device_Name DEC HSG80 V86F
CAM_ent_type 256 Generic String
Generic_String Active CCB at time of error
CAM_ent_type 256 Generic String
Generic_String CCB request completed with an error
CAM_ent_type 1
cam_ccb_len 192
camfunc_code x01
cam_status x84
cam_path x07 Logical SCSI Bus #
cam_target x04 Logical Host Target #
cam_lun x1F Logical Unit Number (LUN)
cam_flags x0000 54C0
cam_dxfer_len 0
cam_sense_len 255
cam_cdb_len 6
cam_sglist_cnt 0
cam_scsi_status x02 Check Condition
cam_timeout 20
cam_msg_len 0
cam_vu_flags x0000
cam_tag_action x00
cam_sim_priv0 xFFFF FC01 844D 61C8
cam_sim_priv1 x0000 0000 DEC0 0DEC
cam_sim_priv2 x0000 0000 0000 0000
cam_sim_priv3 x0000 0000 0000 0000
cam_sim_priv4 x0000 0000 0000 0000
cam_sim_priv5 x0000 0000 0000 0000
cam_sim_priv6 x0000 0000 0000 0000
CAM_ent_type 256 Generic String
Generic_String Error, exception, or abnormal condition
CAM_ent_type 256 Generic String
Generic_String UNIT ATTENTION - Medium changed or target reset
CAM_ent_type 768 CAM Sense Data
Sense_byte_0 xF0
   Error Code[6:0] x70
   Valid[7] x1

Sense_Key x06
   Sense_Key[3:0] x6 Unit Attention

Info_bytes x011A 90D0 Usually LBA
Additional_Sense_Len 152 Std Length for HSZ/HSGxx
Cmd_spec_info x0000 0000
ASCQ_ASC x810C
   ASC[7:0] xC
   ASCQ[15:8] x81 Vendor Specific ASCQ

FRU_code x00
Sense_Key_Specfic_Byte0x80
Sense_Key_Specfic_Bytes 0
Total_Num_of_Errs 1
Total_Retry_Cnt 0
ASC_ASCQ_Stack0 x0C81
ASC_ASCQ_Stack1 x0000
Dev_Port 4 Physical Port #
Dev_Target 0 Physical Target #
Dev_LUN 0
HS_Instance_Code x0326 450A
Template_Type x51 Disk Transfer Error Event
Template_Flags x20
Command_Opcode x2A
Sense_Data_Qual x80
Original_CDB0 x00 0000 0000
Original_CDB1 x00 0000 0000
Host_ID x00
Ctrl_Serialnum ZG94416851
Ctlr_Firmware_Rev V86F
LUN_Status x00
Dev_Prod_ID BF03665223
Device_Type x00 Direct Access (Disk)
Dev_Sense_byte_0 xF0
Dev_Segment_Num 0
Dev_Sense_Key x01
Dev_Info_bytes x011A 90D0
Dev_Command_spec_info x0000 0000
DEV_ASCQ_ASC x810C
   ASC[7:0] xC
   ASCQ[15:8] x81

Dev_FRU_code x00
Dev_Sense_Key_Specific_Byte0x80
Dev_Sense_Key_Specific_Bytes 0

Event: 1064
Description: Tru64 UNIX CAM Event at Dec 4, 2002 10:25:32 AM GMT+01:00 from es45 in file /var/adm/binary.errlog
File: /var/adm/binary.errlog
================================================================================

OS_Type 1 -- Tru64 UNIX
Hardware_Arch 4 -- Alpha
CEH_Vendor_ID 3,564 -- Compaq Computer Corp
Hdwr_Sys_Type 38 -- Titan Corelogic
Logging_CPU 0 -- CPU Logging this Event
CPUs_In_Active_Set 4
Entry_Type 199 -- Tru64 UNIX CAM Event
DSR_Msg_Num 1,978 -- Compaq AlphaServer ES45
                                               .... Model 2/2B
                                               .... CPU Slots: 4 (1000 Mhz)
                                               .... PCI Slots: 10
                                               .... MMB Slots: 8 (DIMMs)
Chip_Type 12 -- EV68CB - 21264C
CEH_Device 0
CEH_Device_ID_0 x0000 0003
CEH_Device_ID_1 x0000 0004
CEH_Device_ID_2 x0000 001F
Unique_ID_Count 1,725
Unique_ID_Prefix 20,376
TLV_DSR_String Compaq AlphaServer ES45 Model 2
TLV_OS_Version Compaq Tru64 UNIX V5.1A (Rev. 1885)
TLV_Sys_Serial_Num AY21007793
TLV_Time_as_Local Dec 4, 2002 10:25:32 AM GMT+01:00
TLV_Computer_Name es45
CAM_hdr_type 199
CAM_hdr_class x00 Disk
CAM_hdr_subsystem x0000 0000
CAM_hdr_entries 11
Start CAM SCSI Subpackets START OF SUBPACKETS IN THIS ENTRY
CAM_ent_type 258 Module Name String
Module_Name_Str cdisk_rec_status
CAM_ent_type 256 Generic String
Generic_String Recovery progress event, this is NOT an error
CAM_ent_type 262 Informational Error String
Info_Error_String Information Message Detected (recovered)
CAM_ent_type 256 Generic String
Generic_String Hardware ID = 256
CAM_ent_type 257 Device Name String
Device_Name DEC HSG80 V86F
CAM_ent_type 256 Generic String
Generic_String Active CCB at time of error
CAM_ent_type 256 Generic String
Generic_String CCB request completed with an error
CAM_ent_type 1
cam_ccb_len 192
camfunc_code x01
cam_status x84
cam_path x03 Logical SCSI Bus #
cam_target x04 Logical Host Target #
cam_lun x1F Logical Unit Number (LUN)
cam_flags x0000 1442
cam_dxfer_len 18
cam_sense_len 255
cam_cdb_len 6
cam_sglist_cnt 0
cam_scsi_status x02 Check Condition
cam_timeout 20
cam_msg_len 0
cam_vu_flags x0000
cam_tag_action x20
cam_sim_priv0 xFFFF FC00 476D 0708
cam_sim_priv1 x0000 0000 DEC0 0DEC
cam_sim_priv2 x0000 0000 0000 0000
cam_sim_priv3 x0000 0000 0000 0000
cam_sim_priv4 x0000 0000 0000 0000
cam_sim_priv5 x0000 0000 0000 0000
cam_sim_priv6 x0000 0000 0000 0000
CAM_ent_type 256 Generic String
Generic_String Error, exception, or abnormal condition
CAM_ent_type 256 Generic String
Generic_String ILLEGAL REQUEST - Illegal request or CDB parameter
CAM_ent_type 768 CAM Sense Data
Sense_byte_0 x70
   Error Code[6:0] x70
   Valid[7] x0

Sense_Key x05
   Sense_Key[3:0] x5 Illegal Request

Info_bytes x0000 0000 Usually LBA
Additional_Sense_Len 10
Cmd_spec_info0 x0000 0000
ASCQ_ASC x0024 Invalid Field In Cdb
   ASC[7:0] x24
   ASCQ[15:8] x0

FRU_code x00
Sense_Key_Specfic_Byte0x00
Sense_Key_Specfic_Bytes 0
Additional_Sense_Bytes Dump starting at offset: x4be
        [x0] x0000000000000000
        [x8] x0000000000000000
        [x10] x0000000000000000
        [x18] x0000000000000000
        [x20] x0000000000000000
        [x28] x0000000000000000
        [x30] x0000000000000000
        [x38] x0000000000000000
        [x40] x0000000000000000
        [x48] x0000000000000000
        [x50] x0000000000000000
        [x58] x0000000000000000
        [x60] x0000000000000000
        [x68] x0000000000000000
        [x70] x0000000000000000
        [x78] x0000000000000000
        [x80] x0000000000000000
        [x88] x0000000000000000
        [x90] x0000000000000000
        [x98] x0000000000000000
        [xa0] x0000000000000000
        [xa8] x0000000000000000
        [xb0] x0000000000000000
        [xb8] x0000000000000000
        [xc0] x0000000000000000
        [xc8] x0000000000000000
        [xd0] x0000000000000000
        [xd8] x0000000000000000
        [xe0] x0000000000000000
        [xe8] x000000

Event: 1057
Description: Tru64 UNIX CAM Event at Dec 4, 2002 10:24:59 AM GMT+01:00 from es45 in file /var/adm/binary.errlog
File: /var/adm/binary.errlog
================================================================================

OS_Type 1 -- Tru64 UNIX
Hardware_Arch 4 -- Alpha
CEH_Vendor_ID 3,564 -- Compaq Computer Corp
Hdwr_Sys_Type 38 -- Titan Corelogic
Logging_CPU 3 -- CPU Logging this Event
CPUs_In_Active_Set 4
Entry_Type 199 -- Tru64 UNIX CAM Event
DSR_Msg_Num 1,978 -- Compaq AlphaServer ES45
                                               .... Model 2/2B
                                               .... CPU Slots: 4 (1000 Mhz)
                                               .... PCI Slots: 10
                                               .... MMB Slots: 8 (DIMMs)
Chip_Type 12 -- EV68CB - 21264C
CEH_Device 0
CEH_Device_ID_0 x0000 0007
CEH_Device_ID_1 x0000 0004
CEH_Device_ID_2 x0000 001F
Unique_ID_Count 1,718
Unique_ID_Prefix 20,376
TLV_DSR_String Compaq AlphaServer ES45 Model 2
TLV_OS_Version Compaq Tru64 UNIX V5.1A (Rev. 1885)
TLV_Sys_Serial_Num AY21007793
TLV_Time_as_Local Dec 4, 2002 10:24:59 AM GMT+01:00
TLV_Computer_Name es45
CAM_hdr_type 199
CAM_hdr_class x00 Disk
CAM_hdr_subsystem x0000 0000
CAM_hdr_entries 11
Start CAM SCSI Subpackets START OF SUBPACKETS IN THIS ENTRY
CAM_ent_type 258 Module Name String
Module_Name_Str cdisk_rec_status
CAM_ent_type 256 Generic String
Generic_String Recovery progress event, this is NOT an error
CAM_ent_type 262 Informational Error String
Info_Error_String Information Message Detected (recovered)
CAM_ent_type 256 Generic String
Generic_String Hardware ID = 256
CAM_ent_type 257 Device Name String
Device_Name DEC HSG80 V86F
CAM_ent_type 256 Generic String
Generic_String Active CCB at time of error
CAM_ent_type 256 Generic String
Generic_String CCB request completed with an error
CAM_ent_type 1
cam_ccb_len 192
camfunc_code x01
cam_status x84
cam_path x07 Logical SCSI Bus #
cam_target x04 Logical Host Target #
cam_lun x1F Logical Unit Number (LUN)
cam_flags x0000 54C0
cam_dxfer_len 0
cam_sense_len 255
cam_cdb_len 6
cam_sglist_cnt 0
cam_scsi_status x02 Check Condition
cam_timeout 20
cam_msg_len 0
cam_vu_flags x0000
cam_tag_action x00
cam_sim_priv0 xFFFF FC01 0CEF C1C8
cam_sim_priv1 x0000 0000 DEC0 0DEC
cam_sim_priv2 x0000 0000 0000 0000
cam_sim_priv3 x0000 0000 0000 0000
cam_sim_priv4 x0000 0000 0000 0000
cam_sim_priv5 x0000 0000 0000 0000
cam_sim_priv6 x0000 0000 0000 0000
CAM_ent_type 256 Generic String
Generic_String Error, exception, or abnormal condition
CAM_ent_type 256 Generic String
Generic_String UNIT ATTENTION - Medium changed or target reset
CAM_ent_type 768 CAM Sense Data
Sense_byte_0 xF0
   Error Code[6:0] x70
   Valid[7] x1

Sense_Key x06
   Sense_Key[3:0] x6 Unit Attention

Info_bytes x0047 F7C0 Usually LBA
Additional_Sense_Len 152 Std Length for HSZ/HSGxx
Cmd_spec_info x0000 0000
ASCQ_ASC x0003 Peripheral Device Write Fault
   ASC[7:0] x3
   ASCQ[15:8] x0

FRU_code x00
Sense_Key_Specfic_Byte0x80
Sense_Key_Specfic_Bytes 61
Total_Num_of_Errs 1
Total_Retry_Cnt 0
ASC_ASCQ_Stack0 x0300
ASC_ASCQ_Stack1 x0000
Dev_Port 4 Physical Port #
Dev_Target 0 Physical Target #
Dev_LUN 0
HS_Instance_Code x0328 450A
Template_Type x51 Disk Transfer Error Event
Template_Flags x20
Command_Opcode x2A
Sense_Data_Qual x80
Original_CDB0 x00 0000 0000
Original_CDB1 x00 0000 0000
Host_ID x00
Ctrl_Serialnum ZG94416851
Ctlr_Firmware_Rev V86F
LUN_Status x00
Dev_Prod_ID BF03665223
Device_Type x00 Direct Access (Disk)
Dev_Sense_byte_0 xF0
Dev_Segment_Num 0
Dev_Sense_Key x01
Dev_Info_bytes x0047 F7C0
Dev_Command_spec_info x0000 0000
DEV_ASCQ_ASC x0003 Peripheral Device Write Fault
   ASC[7:0] x3
   ASCQ[15:8] x0

Dev_FRU_code x00
Dev_Sense_Key_Specific_Byte0x80
Dev_Sense_Key_Specific_Bytes 61

---------- Problem Found: Low Voltage Condition - ASC x03 ASCQ x00 at Dec 2, 2002 11:40:54 AM GMT+01:00 ----------

Managed Entity:
  Computer Name: es45
  HS Type: HSG80 Serial #: ZG94416851

Service Obligation Data:

   Service Obligation: Valid
   Service Obligation Number: invalid
   System Serial Number: invalid
   Service Provider Company Name: Compaq

Brief Description:
Low Voltage Condition - ASC x03 ASCQ x00

Callout ID:
  FF03000100100107

Severity:
2

Reporting Node:
es45

Full Description:
  This is a drive detected condition that relates to the
   o detection of a marginal voltage condition that occurs
     at the time the device is in process id writing
   o or a servo voltage correction signal that is exceeding
     internal drive designed parameters.
  
  This is typically NOT a drive issue. This is an environmental
  or power problem where insufficient voltage is being supplied
  to the drive. It is possible that the enclosure is transmitting
  mechanical motion to the drive during heavy seek loads. This
  can cause the head/arm assembly to have significant problems
  in maintaining tight track following margins that are
  inherent on write operations versus read operations.
  
  Heavy seek activity with multiple drives, or having a tape
  drive operating (lots of +12VDC needed to drive the tape
  servo) in the same enclosure with low drive input power is
  most likely the problem and needs to be
  investigated/addressed.
  
  Drives of Seagate origin, in particular the RZ28M, RZ25L,
  RZ29B, and RZ28B have been known to log these events more
  often in typical applications.
  
  Consult Blitz TD 1730A and TD 2130-A for more information.
  
                            NOTE:
  For FRU codes of 01, 02, 07, 09, 0A and 0B refer to Blitz TD
  1730 as these are Input Power Problem related codes.
  
  For FRU codes of 00 or 11(hex), you might wish to verify or
  order and install vibration dampening material that would be
  put on an SBB by ordering kit part number 70-32877-01
  from Field Service Logistics. This kit contains enough material
  to modify 7 SBBs. Refer to Blitz TD 2130A (or later rev).
  
  The Reported FRU code that is printed in the event log for
  the specific event will decode as follows:
  
  0x01 Format Write Fault (Position Error)
  0x02 Format Voltage Fault
  0x07 Format Servo Write Fault
  0x09 Verify Servo Write Fault
  0x0A Verify Servo Voltage Fault
  0x0B Voltage Write Fault
  
  FRU Code: x00

FRU List:
  Description: Disk Configuration Issue
  Physical ID : HSG80: Serial #: ZG94416851
                        Port: x04 Target: x00 LUN: x00
  SCSI Inquiry String: BF03665223
  Probability: Medium
                        Power Supply
  Probability: Low
                        Disk Drive

Evidence:
  Last Time Stamp: Mon, 2 Dec 2002 11:40:32 +0100
  Storage KRS Rev: V2.20
  Unique ID: Prefix: 56528 Count: 743
  Sense_Key: x1
  ASC: x03
  ASCQ: x00
  Device Revision: B014
 
Summary of errors for this device.
  Date and Time InfoBytes Key/ASC/Q HS_Instance
  _____________ _________ __________ __________
  Mon, 2 Dec 2002 11:40:32 +0100 x008E5E9C x1/x03/x00 x0328450A
  Mon, 2 Dec 2002 11:40:32 +0100 x008E5E9C x1/x03/x00 x0328450A
  Mon, 2 Dec 2002 11:40:32 +0100 x008E5E9C x1/x03/x00 x0328450A
  Mon, 2 Dec 2002 11:40:32 +0100 x008E5E9C x1/x03/x00 x0328450A
  Mon, 2 Dec 2002 11:39:27 +0100 x008E5E08 x1/x0C/x81 x0326450A
  Mon, 2 Dec 2002 11:39:27 +0100 x008E5E08 x1/x0C/x81 x0326450A
  Mon, 2 Dec 2002 11:39:27 +0100 x008E5E08 x1/x0C/x81 x0326450A
  Mon, 2 Dec 2002 11:39:27 +0100 x008E5E08 x1/x0C/x81 x0326450A
  Mon, 2 Dec 2002 11:37:08 +0100 x00C1DE50 x1/x03/x00 x0328450A
  Mon, 2 Dec 2002 11:37:08 +0100 x00C1DE50 x1/x03/x00 x0328450A
  Mon, 2 Dec 2002 11:37:08 +0100 x00C1DE50 x1/x03/x00 x0328450A
  Mon, 2 Dec 2002 11:37:08 +0100 x00C1DE50 x1/x03/x00 x0328450A
  Mon, 2 Dec 2002 11:35:16 +0100 x00A30AF9 x1/x03/x00 x0328450A
  Mon, 2 Dec 2002 11:35:16 +0100 x00A30AF9 x1/x03/x00 x0328450A
  Mon, 2 Dec 2002 11:35:16 +0100 x00A30AF9 x1/x03/x00 x0328450A
  Mon, 2 Dec 2002 11:35:16 +0100 x00A30AF9 x1/x03/x00 x0328450A
  Tue, 26 Nov 2002 14:05:52 +0100 x00A26297 x1/x03/x00 x0328450A
  Tue, 26 Nov 2002 14:05:51 +0100 x00A26297 x1/x03/x00 x0328450A
  Tue, 26 Nov 2002 14:05:51 +0100 x00A26297 x1/x03/x00 x0328450A
  Tue, 26 Nov 2002 14:05:51 +0100 x00A26297 x1/x03/x00 x0328450A
  Tue, 26 Nov 2002 06:56:08 +0100 x00A3C711 x1/x03/x00 x0328450A
  Tue, 26 Nov 2002 06:56:07 +0100 x00A3C711 x1/x03/x00 x0328450A
  Tue, 26 Nov 2002 06:56:07 +0100 x00A3C711 x1/x03/x00 x0328450A
  Tue, 26 Nov 2002 06:56:07 +0100 x00A3C711 x1/x03/x00 x0328450A
  Mon, 25 Nov 2002 18:31:11 +0100 x010903CE x1/x03/x00 x0328450A
  Mon, 25 Nov 2002 18:31:10 +0100 x010903CE x1/x03/x00 x0328450A
  Mon, 25 Nov 2002 18:31:10 +0100 x010903CE x1/x03/x00 x0328450A
  Mon, 25 Nov 2002 18:31:10 +0100 x010903CE x1/x03/x00 x0328450A
  Mon, 25 Nov 2002 18:24:53 +0100 x00A832A0 x1/x03/x00 x0328450A
  Mon, 25 Nov 2002 18:24:52 +0100 x00A832A0 x1/x03/x00 x0328450A
  Mon, 25 Nov 2002 18:24:52 +0100 x00A832A0 x1/x03/x00 x0328450A
  Mon, 25 Nov 2002 18:24:52 +0100 x00A832A0 x1/x03/x00 x0328450A
  Mon, 25 Nov 2002 15:46:21 +0100 x007B13A0 x1/x03/x00 x0328450A
  Mon, 25 Nov 2002 15:46:21 +0100 x007B13A0 x1/x03/x00 x0328450A
  Mon, 25 Nov 2002 15:46:21 +0100 x007B13A0 x1/x03/x00 x0328450A
  Mon, 25 Nov 2002 15:46:21 +0100 x007B13A0 x1/x03/x00 x0328450A
  Mon, 25 Nov 2002 14:23:29 +0100 x00A46DA4 x1/x03/x00 x0328450A
  Mon, 25 Nov 2002 14:23:29 +0100 x00A46DA4 x1/x03/x00 x0328450A
  Mon, 25 Nov 2002 14:23:29 +0100 x00A46DA4 x1/x03/x00 x0328450A
  Mon, 25 Nov 2002 14:23:28 +0100 x00A46DA4 x1/x03/x00 x0328450A
  Mon, 25 Nov 2002 14:19:59 +0100 x00A3663C x1/x03/x00 x0328450A
  Mon, 25 Nov 2002 14:19:59 +0100 x00A3663C x1/x03/x00 x0328450A
  Mon, 25 Nov 2002 14:19:58 +0100 x00A3663C x1/x03/x00 x0328450A
  Mon, 25 Nov 2002 14:19:58 +0100 x00A3663C x1/x03/x00 x0328450A



This archive was generated by hypermail 2.1.7 : Sat Apr 12 2008 - 10:49:10 EDT