4500 Crash - CPU or Memory

From: Rebstock, Roland (Roland.Rebstock@usi.net)
Date: Thu Dec 12 2002 - 06:14:35 EST


Can someone tell me what happend here? This is a E4500, Solaris 7 Rev
18, 8GB Ram, 8 400 8mb CPUs

Dec 12 03:19:48 msuawi11 unix: WARNING: [AFT1] WP event on CPU5, errID
0x0007a79f.9d1d57b0

Dec 12 03:19:48 msuawi11 AFSR 0x00000000.00800080<WP> AFAR
0x00000187.dbff77d0

Dec 12 03:19:48 msuawi11 AFSR.PSYND 0x0080(Score 95) AFSR.ETS 0x00
Fault_PC 0xff31b1c0

Dec 12 03:19:48 msuawi11 UDBH 0x0000 UDBH.ESYND 0x00 UDBL 0x0000
UDBL.ESYND 0x00

Dec 12 03:20:03 msuawi11 unix: WARNING: [AFT1] Uncorrectable Memory
Error on CPU4 Data access at TL=0, errID 0x0007a7a3.145c3f81

Dec 12 03:20:03 msuawi11 AFSR 0x00000000.80200000<PRIV,UE> AFAR
0x00000001.833131f8

Dec 12 03:20:03 msuawi11 AFSR.PSYND 0x0000(Score 05) AFSR.ETS 0x00
Fault_PC 0x100276ac

Dec 12 03:20:03 msuawi11 UDBH 0x0000 UDBH.ESYND 0x00 UDBL 0x0203<UE>
UDBL.ESYND 0x03

Dec 12 03:20:03 msuawi11 UDBL Syndrome 0x3 Memory Module Board 6 J3101
J3201 J3301 J3401 J3501 J3601 J3701 J3801

Dec 12 03:20:03 msuawi11 unix: WARNING: [AFT1] errID 0x0007a7a3.145c3f81
Syndrome 0x3 indicates that this may not be a memory module problem

Dec 12 03:20:03 msuawi11 unix: [AFT2] errID 0x0007a7a3.145c3f81
PA=0x00000001.833131f8

Dec 12 03:20:03 msuawi11 E$tag 0x00000000.1ec03066 E$State: Exclusive
E$parity 0x0f

Dec 12 03:20:03 msuawi11 unix: [AFT2] E$Data (0x00): 0x2e2e6b65.66696c65

Dec 12 03:20:03 msuawi11 unix: [AFT2] E$Data (0x08): 0x65786563.6c2e616c

Dec 12 03:20:03 msuawi11 unix: [AFT2] E$Data (0x10): 0x2e2e7269.70747361

Dec 12 03:20:03 msuawi11 unix: [AFT2] E$Data (0x18): 0x00000000.00000000

Dec 12 03:20:03 msuawi11 unix: [AFT2] E$Data (0x20): 0x00000000.00000000

Dec 12 03:20:03 msuawi11 unix: [AFT2] E$Data (0x28): 0x00000000.00000000

Dec 12 03:20:03 msuawi11 unix: [AFT2] E$Data (0x30): 0x00000000.00000000

Dec 12 03:20:03 msuawi11 unix: [AFT2] E$Data (0x38): 0x10000000.00000000
*Bad* PSYND=0x00ff

Dec 12 03:20:03 msuawi11 unix: NOTICE: Scheduling clearing of error on
page 0x00000001.83312000

Dec 12 03:20:03 msuawi11 unix: [AFT3] errID 0x0007a7a3.145c3f81 Above
Error detected by protected Kernel code

Dec 12 03:20:03 msuawi11 that will try to clear error from system

Dec 12 03:20:03 msuawi11 unix: WARNING: [AFT1] Uncorrectable Memory
Error on CPU4 Data access at TL=0, errID 0x0007a7a3.175c6352

Dec 12 03:20:03 msuawi11 AFSR 0x00000000.80200000<PRIV,UE> AFAR
0x00000001.833131f8

Dec 12 03:20:03 msuawi11 AFSR.PSYND 0x0000(Score 05) AFSR.ETS 0x00
Fault_PC 0x100276ac

Dec 12 03:20:03 msuawi11 UDBH 0x0000 UDBH.ESYND 0x00 UDBL 0x0203<UE>
UDBL.ESYND 0x03

Dec 12 03:20:03 msuawi11 UDBL Syndrome 0x3 Memory Module Board 6 J3101
J3201 J3301 J3401 J3501 J3601 J3701 J3801

Dec 12 03:20:03 msuawi11 unix: WARNING: [AFT1] errID 0x0007a7a3.175c6352
Syndrome 0x3 indicates that this may not be a memory module problem

Dec 12 03:20:03 msuawi11 unix: [AFT2] errID 0x0007a7a3.175c6352
PA=0x00000001.833131f8

Dec 12 03:20:03 msuawi11 E$tag 0x00000000.1ec03066 E$State: Exclusive
E$parity 0x0f

Dec 12 03:20:03 msuawi11 unix: [AFT2] E$Data (0x00): 0x2e2e6b65.66696c65

Dec 12 03:20:03 msuawi11 unix: [AFT2] E$Data (0x08): 0x65786563.6c2e616c

Dec 12 03:20:03 msuawi11 unix: [AFT2] E$Data (0x10): 0x2e2e7269.70747361

Dec 12 03:20:03 msuawi11 unix: [AFT2] E$Data (0x18): 0x00000000.00000000

Dec 12 03:20:03 msuawi11 unix: [AFT2] E$Data (0x20): 0x00000000.00000000

Dec 12 03:20:03 msuawi11 unix: [AFT2] E$Data (0x28): 0x00000000.00000000

Dec 12 03:20:03 msuawi11 unix: [AFT2] E$Data (0x30): 0x00000000.00000000

Dec 12 03:20:03 msuawi11 unix: [AFT2] E$Data (0x38): 0x10000000.00000000
*Bad* PSYND=0x00ff

Dec 12 03:20:03 msuawi11 unix: NOTICE: Scheduling clearing of error on
page 0x00000001.83312000

Dec 12 03:20:03 msuawi11 unix: [AFT3] errID 0x0007a7a3.175c6352 Above
Error detected by protected Kernel code

Dec 12 03:20:03 msuawi11 that will try to clear error from system

Dec 12 03:20:26 msuawi11 unix: WARNING: [AFT1] Uncorrectable Memory
Error on CPU12 Data access at TL=0, errID 0x0007a7a8.464761c5

Dec 12 03:20:26 msuawi11 AFSR 0x00000000.80200000<PRIV,UE> AFAR
0x00000001.833131f8

Dec 12 03:20:26 msuawi11 AFSR.PSYND 0x0000(Score 05) AFSR.ETS 0x00
Fault_PC 0x10035db0

Dec 12 03:20:26 msuawi11 UDBH 0x0000 UDBH.ESYND 0x00 UDBL 0x0203<UE>
UDBL.ESYND 0x03

Dec 12 03:20:26 msuawi11 UDBL Syndrome 0x3 Memory Module Board 6 J3101
J3201 J3301 J3401 J3501 J3601 J3701 J3801

Dec 12 03:20:26 msuawi11 unix: WARNING: [AFT1] errID 0x0007a7a8.464761c5
Syndrome 0x3 indicates that this may not be a memory module problem

Dec 12 03:20:26 msuawi11 unix: [AFT2] errID 0x0007a7a8.464761c5
PA=0x00000001.833131f8

Dec 12 03:20:26 msuawi11 E$tag 0x00000000.0fc03066 E$State: Modified
E$parity 0x07

Dec 12 03:20:26 msuawi11 unix: [AFT2] E$Data (0x00): 0x2e2e6b65.66696c65

Dec 12 03:20:26 msuawi11 unix: [AFT2] E$Data (0x08): 0x65786563.6c2e616c

Dec 12 03:20:26 msuawi11 unix: [AFT2] E$Data (0x10): 0x2e2e7269.70747361

Dec 12 03:20:26 msuawi11 unix: [AFT2] E$Data (0x18): 0x00000000.00000000

Dec 12 03:20:26 msuawi11 unix: [AFT2] E$Data (0x20): 0x000002a1.004ddd60

Dec 12 03:20:26 msuawi11 unix: [AFT2] E$Data (0x28): 0x00000000.00000000

Dec 12 03:20:26 msuawi11 unix: [AFT2] E$Data (0x30): 0x00000000.00000000

Dec 12 03:20:26 msuawi11 unix: [AFT2] E$Data (0x38): 0x10000000.00000000
*Bad* PSYND=0x00ff

Dec 12 03:20:26 msuawi11 unix: panic[cpu12]/thread=2a1004ddd60:

Dec 12 03:20:26 msuawi11 unix: [AFT1] errID 0x0007a7a8.464761c5 UE
Error(s)

Dec 12 03:20:26 msuawi11 See previous message(s) for details

Dec 12 03:20:26 msuawi11 unix:

Dec 12 03:20:26 msuawi11 unix: syncing file systems...

Dec 12 03:20:46 msuawi11 unix: panic[cpu12]/thread=2a1000abd60:

Dec 12 03:20:46 msuawi11 unix: panic sync timeout

Dec 12 03:20:46 msuawi11 unix:
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:25:27 EDT