machine freeze

From: Galen Johnson (gjohnson@trantor.org)
Date: Wed Oct 29 2003 - 14:16:30 EST


Hi folks...

Just last night I had a machine lock up (which I found out the hard way
this morning) I tracked down the following in my messages file:

Oct 28 23:13:14 localhost Fault_PC 0x1043218
Oct 28 23:13:14 localhost SUNW,UltraSPARC-III+: [ID 537287 kern.info]
[AFT2] No error found in D$
Oct 28 23:15:17 localhost SUNW,UltraSPARC-III+: [ID 232771 kern.info]
NOTICE: [AFT0] DPE Event on CPU2 in Privileged mode at TL=0, errID
0x00042dd5.64
3ba8b0
Oct 28 23:15:17 localhost Fault_PC 0x1043218
Oct 28 23:15:17 localhost SUNW,UltraSPARC-III+: [ID 537287 kern.info]
[AFT2] No error found in D$
Oct 28 23:15:28 localhost SUNW,UltraSPARC-III+: [ID 834183 kern.info]
NOTICE: [AFT0] DPE Event on CPU2 in Privileged mode at TL=0, errID
0x00042dd8.0c
ff7178
Oct 28 23:15:28 localhost Fault_PC 0x10431e0
Oct 28 23:15:28 localhost SUNW,UltraSPARC-III+: [ID 537287 kern.info]
[AFT2] No error found in D$
Oct 28 23:16:52 localhost SUNW,UltraSPARC-III+: [ID 504187 kern.info]
NOTICE: [AFT0] DPE Event on CPU2 in Privileged mode at TL=0, errID
0x00042deb.9c
258b54
Oct 28 23:16:52 localhost Fault_PC 0x10431e0
Oct 28 23:16:52 localhost SUNW,UltraSPARC-III+: [ID 537287 kern.info]
[AFT2] No error found in D$
Oct 28 23:17:11 localhost SUNW,UltraSPARC-III+: [ID 771256 kern.info]
NOTICE: [AFT0] DDSPE Event on CPU2 in Privileged mode at TL=0, errID
0x00042def.
de7d1e50
Oct 28 23:17:11 localhost Fault_PC 0x1048c48
Oct 28 23:17:11 localhost SUNW,UltraSPARC-III+: [ID 838760 kern.info]
[AFT2] D$Parity (0x0:3:0x10) 0x05
Oct 28 23:17:11 localhost SUNW,UltraSPARC-III+: [ID 810393 kern.info]
[AFT2] D$Data (0x00) 0x00000003.0000f000 0x001b0000.0000f000
Oct 28 23:17:11 localhost SUNW,UltraSPARC-III+: [ID 810393 kern.info]
[AFT2] D$Data (0x10) 0x000082a1.00010d40 *Bad* 0x000002a1.00010d40
Oct 28 23:17:11 localhost SUNW,UltraSPARC-III+: [ID 441589 kern.info]
[AFT2] D$Tag (0x0:3) 0x4b0fe800 D$state Invalid D$utag 0x0500 D$snp
0x4b0fe800
Oct 28 23:17:11 localhost SUNW,UltraSPARC-III+: [ID 550578 kern.info]
[AFT2] PAtag 0x0b0.fe800000 PAsnp 0x0b0.fe800000 VAutag 0x000000
Oct 28 23:17:11 localhost SUNW,UltraSPARC-III+: [ID 304053 kern.info]
[AFT2] Parity errors found = 509

To me this looks like a bad processor...but the parity errors make me
think it's also a memory problem. I don't recognize the [AFT?] message
designation which is why I'm a bit confused. This is a v880 running
solaris 9. I'm going to try to get it to repeat. Any thoughts? (I'm
thinking of reseating all the hardware [which I should have done when we
unpacked the system])

=G=
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:27:22 EDT