E4500 mystery crash

From: JESSE CARROLL (jesse-carroll@usa.net)
Date: Thu Apr 08 2004 - 11:14:14 EDT


Several times in the past 6 or so weeks one of our E4500's had either hung,
requiring a power off/on, or suddenly rebooted. The only indication we got
is
the following console messages:

TL=0000.0000.0000.0005 TT=0000.0000.0000.0068
   TPC=0000.0000.f000.3014
TL=0000.0000.0000.0004 TT=0000.0000.0000.0034
   TPC=0000.0000.1000.8574
TL=0000.0000.0000.0003 TT=0000.0000.0000.0068
   TPC=0000.0000.f000.3014
TL=0000.0000.0000.0002 TT=0000.0000.0000.0030
   TPC=0000.0000.1000.8574
TL=0000.0000.0000.0001 TT=0000.0000.0000.0068
   TPC=0000.0000.60fe.44d0

Software Power ON

The OS is 2.6, Sun Cluster 2.2 and Resource Manager. The other node in the
cluster is running just fine so I'm inclined to say it is a hardware issue
rather than software, but I'm not completly ruling that out either. Our
suspicion is a CPU but without any further information Sun wont venture a
real guess. As this is a mission critical (actual dollars earned) 24x7
system
we can not afford to have an extensive outage for hardware testing for
another
1 = weeks. We are implementing a contingency in case the beast dies again,
but Id prefer to fix the current system. As anyone seen similar symptoms?
If so was there a viable solution?

JC
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:28:26 EDT