Sun E420R CPU Error

From: Piszcz, Justin (jpiszcz@servervault.com)
Date: Tue Dec 20 2005 - 10:27:12 EST


Hello,

I have a Sun E420R and I got this in my dmesg this morning:

Dec 20 03:10:32 box SUNW,UltraSPARC-II: [ID 368043 kern.info] NOTICE:
[AFT2] errID 0x0006aade.83bf1e40 CBB event on CPU1
Dec 20 03:10:32 box SUNW,UltraSPARC-II: [ID 231182 kern.info] [AFT2]
errID 0x0006aade.83bf1e40 PA=0x00000000.003237c0
Dec 20 03:10:32 box E$tag 0x00000000.0e400006 E$State: Shared
E$parity 0x07
Dec 20 03:10:32 box SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x00): 0x000018af.00010000
Dec 20 03:10:32 box SUNW,UltraSPARC-II: [ID 989652 kern.info] [AFT2]
E$Data (0x08): 0x00000000.08000000 *Bad* PSYND=0x0008
Dec 20 03:10:32 box SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x10): 0x0000f1aa.000000b4
Dec 20 03:10:32 box SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x18): 0x00000000.012bdc8f
Dec 20 03:10:32 box SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x20): 0xffffffff.00000000
Dec 20 03:10:32 box SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x28): 0x00000000.ffffffff
Dec 20 03:10:32 box SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x30): 0x000041c0.00000000
Dec 20 03:10:32 box SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x38): 0x4254237b.43151997
Dec 20 03:10:32 box SUNW,UltraSPARC-II: [ID 812997 kern.info] NOTICE:
[AFT2] Orphan CP event on CPU1, errID 0x0006aade.88db4893
Dec 20 03:10:32 box AFSR 0x00000000.00000000 AFAR
0xffffffff.ffffffff
Dec 20 03:10:32 box AFSR.PSYND 0x0000(Score 05) AFSR.ETS 0x00
Dec 20 03:10:32 box UDBH 0x0000 UDBH.ESYND 0x00 UDBL 0x0000
UDBL.ESYND 0x00
Dec 20 03:10:32 box SUNW,UltraSPARC-II: [ID 770101 kern.info] NOTICE:
[AFT2] No error found in ecache (No fault PA available CPU1, errID
0x0006aade.88db4893
Dec 20 03:10:32 box AFSR 0x00000000.00000000 AFAR
0xffffffff.ffffffff
Dec 20 03:10:32 box AFSR.PSYND 0x0000(Score 05) AFSR.ETS 0x00
Dec 20 03:10:32 box UDBH 0x0000 UDBH.ESYND 0x00 UDBL 0x0000
UDBL.ESYND 0x00
Anyone have any clue if this is serious? The box currently has 4 CPUs
and this is occurring on one of them.

                    Run Ecache CPU CPU
Brd CPU Module MHz MB Impl. Mask
--- --- ------- ----- ------ ------ ----
 0 0 0 450 4.0 US-II 10.0
 0 1 1 450 4.0 US-II 10.0
 0 2 2 450 4.0 US-II 10.0
 0 3 3 450 4.0 US-II 10.0

Should I wait to see if the errors will persist or should I replace the
CPU as soon as possible?

Justin.
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:37:56 EDT