SAN - HDS - JNI HBA - SCSI tran_err

From: ron.gulls@talk21.com
Date: Tue Nov 12 2002 - 13:00:56 EST


Hello,

We have 2 HDS 7700Es connected to our SAN via Brocade switches. Recently,
whenever we perform a zone
reconfiguration a number of servers (about 11) tend to loose the LUNS (disks)
from the HDS. The only way to recover is to reboot the servers. After the last
two attempts of rezoning, I traced the servers and realise all of them are
getting the LUNS from HDS Port 2N. Currently HDS are analysing the dumps from
the HDS 7700e to find the
cause of this.

The servers are running Solaris 2.6/7, various versions of VM/VXFS, JNI FC64
1063 SBUS card / JNI FCE 6410 PCI card.

During the last zone reconfiguration even after rebooting the servers they
didn't see the LUNS, Only way to
overcome this problem was to load the lun manager and save the configuration
without changing anything.

Has anybody else come across this type of problems? If yes what was the
resolution?

I heard SANs connected to EMC also show these type of connectivity issues time
to time where LUNS disappear.

===============================

from the messages file:

Nov 7 10:09:59 unix: fcaw0: Target 0: Port 021900 (WWN 500060e801c35e1c)
offline.
Nov 7 10:10:01 unix: fcaw0: Target 0: Port 021900 (WWN 500060e801c35e1c)
online.
Nov 7 10:10:01 unix: WARNING: /sbus@69,0/fcaw@1,0/sd@0,41 (sd15):
Nov 7 10:10:01 SCSI transport failed: reason 'tran_err': retrying command
Nov 7 10:12:06 unix: fcaw0: Target 0 Lun 65: Resetting...
Nov 7 10:12:09 unix: WARNING: /sbus@69,0/fcaw@1,0/sd@0,41 (sd15):
Nov 7 10:12:09 SCSI transport failed: reason 'timeout': retrying command
Nov 7 10:12:09 unix: fcaw0: Target 0 Lun 65: Resetting...
Nov 7 10:12:33 last message repeated 8 times
Nov 7 10:12:36 unix: WARNING: /sbus@69,0/fcaw@1,0/sd@0,41 (sd15):
Nov 7 10:12:36 SCSI transport failed: reason 'reset': retrying command
Nov 7 10:13:41 unix: fcaw0: Target 0 Lun 65: Resetting...
Nov 7 10:13:44 unix: WARNING: /sbus@69,0/fcaw@1,0/sd@0,41 (sd15):
Nov 7 10:13:44 SCSI transport failed: reason 'timeout': retrying command
Nov 7 10:13:44 unix: fcaw0: Target 0 Lun 65: Resetting...
Nov 7 10:14:42 last message repeated 19 times
Nov 7 10:14:45 unix: WARNING: /sbus@69,0/fcaw@1,0/sd@0,42 (sd16):
Nov 7 10:14:45 SCSI transport failed: reason 'reset': retrying command
Nov 7 10:14:45 unix: WARNING: /sbus@69,0/fcaw@1,0/sd@0,41 (sd15):
Nov 7 10:14:45 SCSI transport failed: reason 'reset': retrying command
Nov 7 10:15:48 unix: fcaw0: Target 0 Lun 65: Resetting...
Nov 7 10:15:51 unix: WARNING: /sbus@69,0/fcaw@1,0/sd@0,41 (sd15):
Nov 7 10:15:51 SCSI transport failed: reason 'timeout': retrying command
Nov 7 10:15:51 unix: fcaw0: Target 0 Lun 65: Resetting...
Nov 7 10:19:58 last message repeated 81 times
Nov 7 10:20:02 unix: fcaw0: Target 0 Lun 66: Resetting...
Nov 7 10:20:05 unix: WARNING: /sbus@69,0/fcaw@1,0/sd@0,42 (sd16):

--------------------
talk21 your FREE portable and private address on the net at
http://www.talk21.com
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:25:16 EDT