disappearing A5000

From: Steve Camp (steve@aslan.camp.com)
Date: Thu Mar 27 2003 - 14:04:22 EST


I have an E220R running Solaris 9 that has an A5000 that just up and
"disappears" after a time. After which, I am no longer able to communicate
with the array itself, although the disks still appear to be usuable, for
a time, anyway. I am getting ready to run STORTOOLs to see if I can
figure out what is wrong, but could sure use the benefit of other
people's experience, knowledge, wisdom etc.

Also, the A5000 is direct attached to a Sun-purchased FC/100P HBA. This
HBA uses the QLC 2100 chipset, so it is NOT fabric aware. Also, since
the array is direct attached, no FC switches are involved.

Any help, suggestions etc appreciated.

--
Steve Camp
Camp Technologies, LLC
steve@camp.com
Problem description:
====================
An A5000 array "disappears" after system has been running for a time.
Immediately after reboot, the A5000 array, "SASHIMI", appears in 
'luxadm probe' output:
    # luxadm probe
    Found Enclosure(s):
    SENA               Name:ahi   Node WWN:508002000000de20   
      Logical Path:/dev/es/ses0
      Logical Path:/dev/es/ses1
    SENA               Name:SASHIMI   Node WWN:5080020000024110   
      Logical Path:/dev/es/ses2
      Logical Path:/dev/es/ses3
However, after a time, the array appears to "disappear", and no longer
appears in 'luxadm probe' output.  When this occurs, 'luxadm display 
SASHIMI' will no longer work.
Does anyone have any ideas what may be wrong?  I suspect hardware, but
am unsure if it is the Photon I/B board, or the FC/100P HBA in the E220R.
Additional Details:
===================
E220R running Solaris 9.  The latest Solaris 9 Recommended Patch Cluster
had been installed as of a few weeks ago, when this problem first
manifested itself.  The patches applied did not appear to solve the problem.
[ prtdiag -v output ]
    # prtdiag -v
    System Configuration:  Sun Microsystems  sun4u Sun Enterprise 220R (2 X UltraSPARC-II 360MHz)
    System clock frequency: 120 MHz
    Memory size: 768 Megabytes
    ========================= CPUs =========================
			Run   Ecache   CPU    CPU
    Brd  CPU   Module   MHz     MB    Impl.   Mask
    ---  ---  -------  -----  ------  ------  ----
     0     0     0      360     4.0   US-II    2.0
     0     2     2      360     4.0   US-II    2.0
    ========================= IO Cards =========================
	 Bus   Freq
    Brd  Type  MHz   Slot        Name                          Model
    ---  ----  ----  ----------  ----------------------------  --------------------
     0   PCI    33     On-Board  network-SUNW,hme                                 
     0   PCI    33     On-Board  scsi-glm/disk (block)         Symbios,53C875     
     0   PCI    33     On-Board  scsi-glm/disk (block)         Symbios,53C875     
     0   PCI    33        PCI 2  SUNW,ifp-pci1077,2100/ssd (b+                    
     0   PCI    33        PCI 3  scsi-glm/disk (block)         Symbios,53C875     
     0   PCI    33        PCI 3  scsi-glm/disk (block)         Symbios,53C875     
     0   PCI    33        PCI 4  pci-pci1011,25.4/pci108e,100+ pci-bridge         
     0   PCI    33        PCI 4  SUNW,qfe-pci108e,1001         SUNW,pci-qfe       
     0   PCI    33        PCI 4  SUNW,qfe-pci108e,1001         SUNW,pci-qfe       
     0   PCI    33        PCI 4  SUNW,qfe-pci108e,1001         SUNW,pci-qfe       
     0   PCI    33        PCI 4  SUNW,qfe-pci108e,1001         SUNW,pci-qfe       
     0   PCI    33      PCI66 1  SUNW,ifp-pci1077,2100.4/ssd +                    
    No failures found in System
    ===========================
    ========================= HW Revisions =========================
    ASIC Revisions:
    ---------------
    PCI: pci Rev 4
    PCI: pci Rev 4
    Cheerio: ebus Rev 1
    System PROM revisions:
    ----------------------
      OBP 3.31.0 2001/07/25 20:31   POST 2.0.3 2000/07/31 15:28
[ 'luxadm probe' output when problem is occurring ]
    # luxadm probe
    Found Enclosure(s):
    SENA               Name:ahi   Node WWN:508002000000de20   
      Logical Path:/dev/es/ses0
      Physical Path:/devices/pci@1f,2000/SUNW,ifp@1/ses@w508002000000de21,0:0
      Logical Path:/dev/es/ses1
      Physical Path:/devices/pci@1f,2000/SUNW,ifp@1/ses@w508002000000de22,0:0
    Found Fibre Channel device(s):
      Node WWN:20000020371383be  Device Type:Disk device
	Logical Path:/dev/rdsk/c5t32d0s2
	Physical Path:
	 /devices/pci@1f,4000/SUNW,ifp@2/ssd@w21000020371383be,0:c,raw
      Node WWN:2000002037138109  Device Type:Disk device
	Logical Path:/dev/rdsk/c5t33d0s2
	Physical Path:
	 /devices/pci@1f,4000/SUNW,ifp@2/ssd@w2100002037138109,0:c,raw
      Node WWN:2000002037137ae6  Device Type:Disk device
	Logical Path:/dev/rdsk/c5t34d0s2
	Physical Path:
	 /devices/pci@1f,4000/SUNW,ifp@2/ssd@w2100002037137ae6,0:c,raw
      Node WWN:20000020371382dc  Device Type:Disk device
	Logical Path:/dev/rdsk/c5t35d0s2
	Physical Path:
	 /devices/pci@1f,4000/SUNW,ifp@2/ssd@w21000020371382dc,0:c,raw
      Node WWN:200000203713780f  Device Type:Disk device
	Logical Path:/dev/rdsk/c5t36d0s2
	Physical Path:
	 /devices/pci@1f,4000/SUNW,ifp@2/ssd@w210000203713780f,0:c,raw
      Node WWN:2000002037138105  Device Type:Disk device
	Logical Path:/dev/rdsk/c5t37d0s2
	Physical Path:
	 /devices/pci@1f,4000/SUNW,ifp@2/ssd@w2100002037138105,0:c,raw
      Node WWN:20000020371380df  Device Type:Disk device
	Logical Path:/dev/rdsk/c5t38d0s2
	Physical Path:
	 /devices/pci@1f,4000/SUNW,ifp@2/ssd@w21000020371380df,0:c,raw
      Node WWN:20000020371395e9  Device Type:Disk device
	Logical Path:/dev/rdsk/c5t48d0s2
	Physical Path:
	 /devices/pci@1f,4000/SUNW,ifp@2/ssd@w21000020371395e9,0:c,raw
      Node WWN:200000203713a5dd  Device Type:Disk device
	Logical Path:/dev/rdsk/c5t49d0s2
	Physical Path:
	 /devices/pci@1f,4000/SUNW,ifp@2/ssd@w210000203713a5dd,0:c,raw
      Node WWN:2000002037134cae  Device Type:Disk device
	Logical Path:/dev/rdsk/c5t50d0s2
	Physical Path:
	 /devices/pci@1f,4000/SUNW,ifp@2/ssd@w2100002037134cae,0:c,raw
      Node WWN:2000002037139b21  Device Type:Disk device
	Logical Path:/dev/rdsk/c5t51d0s2
	Physical Path:
	 /devices/pci@1f,4000/SUNW,ifp@2/ssd@w2100002037139b21,0:c,raw
      Node WWN:200000203713a06a  Device Type:Disk device
	Logical Path:/dev/rdsk/c5t52d0s2
	Physical Path:
	 /devices/pci@1f,4000/SUNW,ifp@2/ssd@w210000203713a06a,0:c,raw
      Node WWN:200000203713aa20  Device Type:Disk device
	Logical Path:/dev/rdsk/c5t53d0s2
	Physical Path:
	 /devices/pci@1f,4000/SUNW,ifp@2/ssd@w210000203713aa20,0:c,raw
      Node WWN:2000002037138359  Device Type:Disk device
	Logical Path:/dev/rdsk/c5t54d0s2
	Physical Path:
	 /devices/pci@1f,4000/SUNW,ifp@2/ssd@w2100002037138359,0:c,raw
[ FC/100P Firmware ]
    # luxadm qlgc_s_download
      Found Path to 2 FC100/P, ISP2200, ISP23xx Devices
      Opening Device: /devices/pci@1f,4000/SUNW,ifp@2:devctl
****  Detected FCode Version:       No version available for this FCode
                                    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
                                    ???????????????????????????????????
      Opening Device: /devices/pci@1f,2000/SUNW,ifp@1:devctl
      Detected FCode Version:       FC100/P FC-AL Host Adapter Driver: 1.9 00/03/10
      Complete
Attempts to update firmware appear to fail:
    # cd /var/tmp/patches/109399-03
    # ls
    README.109399-03   flash-upgrade      ifp2100-1.9R.prom  patchinfo
    # ls -l
    total 142
    -r--r--r--   1 root     sys         4184 Mar 16  2001 README.109399-03
    -r-xr-xr-x   1 root     sys          132 Mar 12  2001 flash-upgrade
    -r--r--r--   1 root     sys        65396 Mar 12  2001 ifp2100-1.9R.prom
    -rw-r--r--   1 root     sys          392 Mar 12  2001 patchinfo
    # ./flash-upgrade
    Warning: Cannot read boot device link, check /etc/mnttab.
    Do not upgrade FCode on adapters controlling the boot device.
      Found Path to 2 FC100/P, ISP2200, ISP23xx Devices
      Opening Device: /devices/pci@1f,4000/SUNW,ifp@2:devctl
      Detected FCode Version:       No version available for this FCode
      New FCode Version:            FC100/P FC-AL Host Adapter Driver: 1.9 00/03/10
    Warning: Installed FCode has a blank or unrecognized version banner.
      Opening Device: /devices/pci@1f,2000/SUNW,ifp@1:devctl
      Detected FCode Version:       FC100/P FC-AL Host Adapter Driver: 1.9 00/03/10
      New FCode Version:            FC100/P FC-AL Host Adapter Driver: 1.9 00/03/10
    WARNING!! This program will update the FCode in this FC100/PCI, ISP2200/PCI device.
    This may take a few (5) minutes. Please be patient.
    Do you wish to continue ? (y/n) y
      Loading FCode: ifp2100-1.9R.prom
      Successful FCode download: /devices/pci@1f,2000/SUNW,ifp@1:devctl
      Complete
      Found Path to 2 FC100/P, ISP2200, ISP23xx Devices
      Opening Device: /devices/pci@1f,4000/SUNW,ifp@2:devctl
***   Detected FCode Version:       No version available for this FCode
      Opening Device: /devices/pci@1f,2000/SUNW,ifp@1:devctl
      Detected FCode Version:       FC100/P FC-AL Host Adapter Driver: 1.9 00/03/10
      Complete
[ Notes regarding 'format' output ]
    Format may or may not show the disks.  That is, sometimes, when "SASHIMI"
    disappears from 'luxadm probe', the photon disks in SASHIMI show up in
    'format', and other times they appear, but are shown as 
    "<drive not available>".  When they appear as "<drive not available>", one
    may not even prtvtoc the disk.  When the drive shows up in 'format' as a
    "<SUN9.0G cyl 4924 alt 2 hd 27 sec 133>" disk, then 'prtvtoc' will work
    on the disk, even if SASHIMI has disappeared from "luxadm probe".
[ Patch Notes ]
    According to InfoDoc 43212, "CPRE-NWS A5x00, T3/T3+, E3500 & SSA 
    FCAL Disk Firmware/Patch Matrix Summary Rev. 3.15", the following patches
    are needed for Solaris 9:
	Solaris 9 patches
	Brief Description of Patch	Patch No.		Status
	----------------------------------------------------------------------------
	fctl, fc & fp (leadville) 	113040-03 		INSTALLED
	fcip (leadville) 		113041-02 		not installed 
	Qlc 22xx (leadville) 		113042-03 		no QLC 22xx installed
	T3 disc & H/W F/W 		109115-12 (c,t) 	no T3
	T3+ disc & H/W F/W 		112276-06 (c) 		no T3
	SE3300/3310 F/W 		113722-01 		no SE33xx
	luxadm 				113043-02 		INSTALLED
	Disc F/W (A5x00) 		111535-03(v) (t) 	CURRENT
	Multi-Path (STMS)		113039-02 		INSTALLED
	cfgadm fp 			113044-02 		not installed
	FC100/p FCode(PCI) 		109399-03 (c) 		unsure, see above
	FC100/s FCode(Sbus) 		109400-03 (c) 		not S-Bus
	Qlc 2200 Fcode 			111853-01 (c,f) 	no QLC 22xx
	SVM 				113026-02		* don't believe 
					113069-04		* this is 
					113276-01		* an SVM
					113282-01 		* problem
					
	(c) The patches 103346, 109115, 109399, 109400, 109962 and 111853 do not appear
				in showrev -p.  To see the disc firmware revision levels, you may use 
				"luxadm inquiry /dev/rdsk/c[#-#]t*s2 | grep Rev"...
				The FCode revision level for the SOC+ I/O System Board and the
				SOC+HA Sbus cards can be checked by "prtconf -vp | grep FCode"... 
				or you can look at the output of the explorer script 2.2.3 or later,
				for both...
				
	f) Check the README files for dependencies on other patches... MPxIO/STMS
				requires Solaris 8, 04/01 or above.....and no, it's not planned
				to be integrated into a future Solaris 8 update...The product MPxIO has
				been renamed STMS, StorEdge Traffic Management System... Also, patch
				111293 (latest) may be required for Solaris 8 and 9.
				
	(t) This patch should still apply to the Solaris 9 OS even though the Readme does not
				show such support.
	(u) Patches 108528-18 and 111293 (latest) show up as Solaris 8 only patches. They may
	    also be required for Solaris 9.                                   
	(v) Patch 111535-03 issued Sept 2002 incorporates patch 109962-07.
[ DiskInfo from 'explorer' output ]
# more diskinfo
AVAILABLE SCSI DEVICES:
   Location     Vendor          Product         Rev  Serial #
    c0t0d0      FUJITSU    MAG3091L SUN9.0G     1111 0003492729
    c0t1d0      FUJITSU    MAG3091L SUN9.0G     1111 9941447569
    c4t0d0      SEAGATE    ST19171FCSUN9.0G     7F7E 9745M78662
    c4t1d0      SEAGATE    ST19171FCSUN9.0G     7F7E 9911Y52167
    c4t3d0      SEAGATE    ST19171FCSUN9.0G     7F7E 9745M92896
    c4t4d0      SEAGATE    ST19171FCSUN9.0G     7F7E 9745M42737
    c4t5d0      SEAGATE    ST19171FCSUN9.0G     7F7E 9847X75328
   c4t18d0      SEAGATE    ST19171FCSUN9.0G     7F78 9737J82589
   c4t19d0      SEAGATE    ST19171FCSUN9.0G     7F78 9737J82854
   c4t20d0      SEAGATE    ST19171FCSUN9.0G     7F78 9737J81240
   c4t21d0      SEAGATE    ST19171FCSUN9.0G     7F78 9737J67985
   c4t22d0      SEAGATE    ST19171FCSUN9.0G     7F78 9737J81692
   c5t32d0      SEAGATE    ST19171FCSUN9.0G     7F7E 9828W72557
   c5t33d0      SEAGATE    ST19171FCSUN9.0G     7F7E 9828W72475
   c5t34d0      SEAGATE    ST19171FCSUN9.0G     7F7E 9828W70142
   c5t35d0      SEAGATE    ST19171FCSUN9.0G     7F7E 9828W67282
   c5t36d0      SEAGATE    ST19171FCSUN9.0G     7F7E 9828W69442
   c5t37d0      SEAGATE    ST19171FCSUN9.0G     7F7E 9828W72342
   c5t38d0      SEAGATE    ST19171FCSUN9.0G     7F7E 9828W72105
   c5t48d0      SEAGATE    ST19171FCSUN9.0G     7F7E 9828W79101
   c5t49d0      SEAGATE    ST19171FCSUN9.0G     7F7E 9829W83388
   c5t50d0      SEAGATE    ST19171FCSUN9.0G     7F7E 9828W48352
   c5t51d0      SEAGATE    ST19171FCSUN9.0G     7F7E 9828W80143
   c5t52d0      SEAGATE    ST19171FCSUN9.0G     7F7E 9828W81876
   c5t53d0      SEAGATE    ST19171FCSUN9.0G     7F7E 9829W84420
   c5t54d0      SEAGATE    ST19171FCSUN9.0G     7F7E 9828W73062
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers


This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:26:04 EDT