ecache and memory errors on ultra60

From: RJ45 (rj45@slacknet.com)
Date: Sat Jun 21 2003 - 12:48:40 EDT


Hello
I got 2 errors occourring 4 months of distance from each other.
>From the log file they looks like not correlated, one related to cache and
one related to memory. I report here the errors and I Ask confirmation
that they should be not correlated to each other:

Jun 21 07:17:16 venus SUNW,UltraSPARC-II: [ID 637375 kern.info] [AFT0]
Corrected Memory Error detected by CPU0, errID 0x00116acb.a0c969
45
Jun 21 07:17:16 venus AFSR 0x00000000.00100000<CE> AFAR
0x00000000.2e924788
Jun 21 07:17:16 venus AFSR.PSYND 0x0000(Score 05) AFSR.ETS 0x00
Fault_PC 0x1024f2c
Jun 21 07:17:16 venus UDBL Syndrome 0x8f Memory Module U0802
Jun 21 07:17:16 venus SUNW,UltraSPARC-II: [ID 598210 kern.info] [AFT0]
errID 0x00116acb.a0c96945 Corrected Memory Error on U0802 is Pe
rsistent
Jun 21 07:17:16 venus SUNW,UltraSPARC-II: [ID 756058 kern.info] [AFT0]
errID 0x00116acb.a0c96945 ECC Data Bit 43 was in error and corre
cted
Jun 21 10:04:10 venus hme: [ID 786680 kern.notice] SUNW,hme0 : No response
from Ethernet network : Link down -- cable problem?
Jun 21 10:09:30 venus last message repeated 5 times
Jun 21 10:10:34 venus hme: [ID 786680 kern.notice] SUNW,hme0 : No response
from Ethernet network : Link down -- cable problem?
Jun 21 10:16:58 venus last message repeated 22 times

this is the first error.
I hope it is a cosmic ray, it is the first time I get this kind of error
in 1 year with ultra60.

the second error follows, it happened 4 months ago but it looks like
complitely different stuff to me, and I never had it repeated after it
occurred the first time. To test the CPUs heavily I Am running a
distributed.net heavy application on both CPUs since 4 months ago, and the
error did not occour again.

Feb 21 12:11:29 venus SUNW,UltraSPARC-II: [ID 160129 kern.info]
NOTICE: [AFT2] errID 0x0001c1fe.1385ab88 CBI event on CPU0
Feb 21 12:11:29 venus SUNW,UltraSPARC-II: [ID 172281 kern.info] [AFT2]
errID 0x0001c1fe.1385ab88 PA=0x00000000.0020c500
Feb 21 12:11:29 venus E$tag 0x00000000.0c400004 E$State: Shared
E$parity 0x06
Feb 21 12:11:29 venus SUNW,UltraSPARC-II: [ID 989652 kern.info] [AFT2]
E$Data (0x00): 0x00000000.00008000 *Bad* PSYND=0x0200
Feb 21 12:11:29 venus SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x08): 0x00000000.00000000
Feb 21 12:11:29 venus SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x10): 0x00000000.00000000
Feb 21 12:11:29 venus SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x18): 0x00000000.00000000
Feb 21 12:11:29 venus SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x20): 0x00000000.00000000
Feb 21 12:11:29 venus SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x28): 0x00000000.00000000
Feb 21 12:11:29 venus SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x30): 0x00000000.00000000
Feb 21 12:11:29 venus SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data (0x38): 0x00000000.00000000
Feb 21 12:11:29 venus SUNW,UltraSPARC-II: [ID 968459 kern.info]
NOTICE: [AFT2] Orphan CP event on CPU0, errID 0x0001c1fe.18aab0f1
Feb 21 12:11:29 venus AFSR 0x00000000.00000000 AFAR
0xffffffff.ffffffff
Feb 21 12:11:29 venus AFSR.PSYND 0x0000(Score 05) AFSR.ETS 0x00
Feb 21 12:11:29 venus UDBH 0x0000 UDBH.ESYND 0x00 UDBL 0x0000
UDBL.ESYND 0x00
Feb 21 12:11:29 venus SUNW,UltraSPARC-II: [ID 925563 kern.info]
NOTICE: [AFT2] No error found in ecache (No fault PA available CPU0, er
rID 0x0001c1fe.18aab0f1
Feb 21 12:11:29 venus AFSR 0x00000000.00000000 AFAR
0xffffffff.ffffffff
Feb 21 12:11:29 venus AFSR.PSYND 0x0000(Score 05) AFSR.ETS 0x00
Feb 21 12:11:29 venus UDBH 0x0000 UDBH.ESYND 0x00 UDBL 0x0000
UDBL.ESYND 0x00

thank you very much

Rick
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:26:37 EDT