memory problems

From: Robert Helmer (robert@roberthelmer.com)
Date: Mon Dec 29 2003 - 20:20:19 EST


Hello,

I am testing some RAM in an E420R that has known problems. It is
causing intermittent system crashes, and I get errors in
/var/adm/messages periodically that indicate that memory is the
problem.

However, it is not clear to me from the error messages which
stick of RAM are bad exactly. Here is an example:

Dec 9 02:06:33 dev2 SUNW,UltraSPARC-II: [ID 361612 kern.warning] WARNING: [AFT1] Uncorrectable Memory Error on CPU1 Data access at TL>0, errID 0x00027562.6f9cad81
Dec 9 02:06:33 dev2 AFSR 0x00000001<ME>.80300000<PRIV,UE,CE> AFAR 0x00000000.fd039aa0
Dec 9 02:06:33 dev2 AFSR.PSYND 0x0000(Score 05) AFSR.ETS 0x00 Fault_PC 0x100accac
Dec 9 02:06:33 dev2 UDBH 0x01a4<CE> UDBH.ESYND 0xa4 UDBL 0x020a<UE> UDBL.ESYND 0x0a
Dec 9 02:06:33 dev2 UDBL Syndrome 0xa Memory Module U1304 U0304 U1303 U0303
Dec 9 02:06:33 dev2 SUNW,UltraSPARC-II: [ID 318453 kern.info] [AFT2] errID 0x00027562.6f9cad81 PA=0x00000000.fd039aa0
Dec 9 02:06:33 dev2 E$tag 0x00000000.0a401fa0 E$State: Shared E$parity 0x05
Dec 9 02:06:33 dev2 SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2] E$Data (0x00): 0x00000000.000c001c
Dec 9 02:06:33 dev2 SUNW,UltraSPARC-II: [ID 989652 kern.info] [AFT2] E$Data (0x08): 0x80000000.107c07b6 *Bad* PSYND=0x00ff
Dec 9 02:06:33 dev2 SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2] E$Data (0x10): 0x00000000.000c001c
Dec 9 02:06:33 dev2 SUNW,UltraSPARC-II: [ID 989652 kern.info] [AFT2] E$Data (0x18): 0x81000000.1dbc27b6 *Bad* PSYND=0x00ff
Dec 9 02:06:33 dev2 SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2] E$Data (0x20): 0x00000000.000c001c
Dec 9 02:06:33 dev2 SUNW,UltraSPARC-II: [ID 989652 kern.info] [AFT2] E$Data (0x28): 0x82000000.5a7c47b6 *Bad* PSYND=0x00ff
Dec 9 02:06:33 dev2 SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2] E$Data (0x30): 0x00000000.000c001c
Dec 9 02:06:33 dev2 SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2] E$Data (0x38): 0x83000000.a77c67b6
Dec 9 02:06:33 dev2 unix: [ID 836849 kern.notice]
Dec 9 02:06:33 dev2 panic[cpu1]/thread=300088e1820:
Dec 9 02:06:33 dev2 unix: [ID 676562 kern.notice] [AFT1] errID 0x00027562.6f9cad81 UE Error(s)
Dec 9 02:06:33 dev2 See previous message(s) for details
Dec 9 02:06:33 dev2 unix: [ID 100000 kern.notice]
Dec 9 02:06:33 dev2
Dec 9 02:06:33 dev2 genunix: [ID 723222 kern.notice] 000002a10064b3a0 SUNW,UltraSPARC-II:cpu_aflt_log+4e0 (2a10064b45e, 1, 10140858, 2a10064b5e8, 2a10064b4ab, 10140880)

It prints U1304 U0304 U1303 U0303, does this mean that between two and
four sticks are bad?

Thanks,
Rob Helmer
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:27:45 EDT