FCAL Link Down

From: DAUBIGNE Sebastien - BOR ( SDaubigne@bordeaux-bersol.sema.slb.com ) (SDaubigne@bordeaux-bersol.sema.slb.com)
Date: Mon May 05 2003 - 08:08:36 EDT


Our server (E6500) randomly (less than 1/month) catch link errors on one of
its FCAL link (JNI FC64-1063) to an IBM array (ESS 2105-800). The link goes
offline during one second, then it resets and goes up.
This does not produce any downtime as the link is multipathed with Veritas
DMP, so the errors are transparent.

But I'm just curious as if this type of error is usual for FCAL or if it is
a consequence of any bug/misconfiguration/malfunction ?

Here is the syslog trace, I will summarise.

May 1 03:56:58 iris unix: fcaw5: Watchdog: Hang Detected (OCQ).
Resetting...
May 1 03:56:58 iris unix: fcaw5: LINK DOWN
May 1 03:56:58 iris unix: fcaw5: Target 0: Port 0000ef
(5005076300c09a1e:5005076300c49a1e) offline.
May 1 03:56:58 iris unix: WARNING: /sbus@a,0/fcaw@1,0/sd@0,8 (sd1261):
May 1 03:56:58 iris unix: SCSI transport failed: reason 'tran_err':
retrying command
May 1 03:56:58 iris unix: WARNING: /sbus@a,0/fcaw@1,0/sd@0,b (sd1222):
May 1 03:56:58 iris unix: SCSI transport failed: reason 'tran_err':
retrying command
May 1 03:56:58 iris unix: NOTICE: fcaw5 LOOP Initialization Complete,
AL_PA=01
May 1 03:56:58 iris unix: fcaw5: LINK UP (98000200)
May 1 03:56:58 iris unix: fcaw5: Host: Port 000001
(100000e06940747a:200000e0694074c9)
May 1 03:56:59 iris unix: fcaw5: Target 0: Port 0000ef
(5005076300c09a1e:5005076300c49a1e) online.

---
Sebastien DAUBIGNE 
sdaubigne@bordeaux-bersol.sema.slb.com
<mailto:sdaubigne@bordeaux-bersol.sema.slb.com>  - (+33)5.57.26.56.36
SchlumbergerSema - SGS/DWH/Pessac
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers


This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:26:20 EDT