SCSI Bus Transition

From: Pablo Jejcic (pablo.jejcic@gmail.com)
Date: Sun Dec 02 2007 - 15:07:02 EST


Hi Gurus,
I need some confirmation... I'm troubleshooting a remote server:

SunFire V240 + D2 JB.

We have 2 RAID 5 configured on the server.

Everything was working fine until we moved the box to the server room into a
controlled environment... now 2-3 times a week, we get the following set of
errors:

WARNING: /pci@1d,700000/pci@1/scsi@4 (qus0):
        SCSI Bus Transition
WARNING: /pci@1d,700000/pci@1/scsi@4 (qus0):
        Received unexpected SCSI Reset

Then a few hours after we get the warnings, we loose the disks, the RAIDs,
everything on the external array....

My guesses here:
1- Problem with the termination of the SCSI chain - the D2 have automatic
terminators, but I'm guessing some problem with them can be causing this.
2- Problems with the SCSI cables, some of the pins, or something is wrong,
and they might be a bit loose, with the vibration from the storoage array
the pop out, and we start getting the issues.
3- SCSI controller issues... but I don't understand how this could be the
cause as the errors should be more frequent, or they should be there all the
time.
4- a couple fo the disks on one of the pictures I got of the array look
that they don't have the "cover" on (the tray to slot them into the
array)... so they might be moving... but why all the other ones go off?

The server is in a very humid environment, but we moved it into the data
centre because we thought that the A/C will help to reduce the problem...
but it just made it worse.

Any comments, ideas, suggestions are very welcome

Thanks a lot in advance,

Pablo.-.
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:42:34 EDT