Request information to understan which bit is defective in the memory word

From: Asiye Yiğit (Asiye.Yigit@gantek.com)
Date: Wed Nov 16 2005 - 11:50:38 EST


Hi Gurus,
I have some "correctable memory error" on my system. I have found the
Following from the some documents:

"whenever solaris os reports at least 4 Ces, two from one bit position
With unique address, and two from another bit position also with
Unique address, and the lower 6 bits of all the addresses are the same,
(AFARs show the same modulo 64 checkword address). Replace the DIMM
containing
The addresses"

How can I identify that the above rule is provided in messages?

Are there any document to identify that?

Nov 16 00:14:44 bgwkibris SUNW,UltraSPARC-II: [ID 141506 kern.info]
[AFT0] Corrected Memory Error detected by CPU0, errID
0x004e6912.520ffda7
Nov 16 00:14:44 bgwkibris AFSR 0x00000000.00100000<CE> AFAR
0x00000000.27319b48
Nov 16 00:14:44 bgwkibris AFSR.PSYND 0x0000(Score 05) AFSR.ETS 0x00
Fault_PC 0x100335e4
Nov 16 00:14:44 bgwkibris UDBL Syndrome 0xe9 Memory Module U0702
Nov 16 00:14:44 bgwkibris SUNW,UltraSPARC-II: [ID 323042 kern.info]
[AFT0] errID 0x004e6912.520ffda7 Corrected Memory Error on U0702 is
Intermittent
Nov 16 00:14:44 bgwkibris SUNW,UltraSPARC-II: [ID 618474 kern.info]
[AFT0] errID 0x004e6912.520ffda7 ECC Data Bit 28 was in error and
corrected

Nov 16 03:58:17 bgwkibris pcipsy: [ID 854591 kern.info] NOTICE:
correctable error detected by pci0 (upa mid 1f) during
Nov 16 03:58:17 bgwkibris DVMA read transaction
Nov 16 03:58:17 bgwkibris pcipsy: [ID 750218 kern.info]
AFSR=40e90000.3f800000 AFAR=00000000.27319b48,
Nov 16 03:58:17 bgwkibris double word offset=1, Memory Module
U0702 id 31.
Nov 16 03:58:17 bgwkibris pcipsy: [ID 916270 kern.info] syndrome bits e9
Nov 16 03:58:17 bgwkibris SUNW,UltraSPARC-II: [ID 782184 kern.info]
[AFT0] errID 0x004e7545.1ef7b4c6 Corrected Memory Error on U0702 is
Intermittent
Nov 16 03:58:17 bgwkibris SUNW,UltraSPARC-II: [ID 590082 kern.info]
[AFT0] errID 0x004e7545.1ef7b4c6 ECC Data Bit 28 was in error and
corrected
Nov 16 03:58:17 bgwkibris SUNW,UltraSPARC-II: [ID 431227 kern.info]
[AFT0] Corrected Memory Error detected by CPU0, errID
0x004e7545.1fef582a
Nov 16 03:58:17 bgwkibris AFSR 0x00000000.00100000<CE> AFAR
0x00000000.27319b48
Nov 16 03:58:17 bgwkibris AFSR.PSYND 0x0000(Score 05) AFSR.ETS 0x00
Fault_PC 0x101369a8
Nov 16 03:58:17 bgwkibris UDBL Syndrome 0xe9 Memory Module U0702
Nov 16 03:58:17 bgwkibris SUNW,UltraSPARC-II: [ID 474645 kern.info]
[AFT0] errID 0x004e7545.1fef582a Corrected Memory Error on U0702 is
Intermittent
Nov 16 03:58:17 bgwkibris SUNW,UltraSPARC-II: [ID 565294 kern.info]
[AFT0] errID 0x004e7545.1fef582a ECC Data Bit 28 was in error and
corrected

Nov 16 07:44:16 bgwkibris pcipsy: [ID 854591 kern.info] NOTICE:
correctable error detected by pci0 (upa mid 1f) during
Nov 16 07:44:16 bgwkibris DVMA read transaction
Nov 16 07:44:16 bgwkibris pcipsy: [ID 750218 kern.info]
AFSR=40e90000.3f800000 AFAR=00000000.27319b48,
Nov 16 07:44:16 bgwkibris double word offset=1, Memory Module
U0702 id 31.
Nov 16 07:44:16 bgwkibris pcipsy: [ID 916270 kern.info] syndrome bits e9
Nov 16 07:44:16 bgwkibris SUNW,UltraSPARC-II: [ID 741292 kern.info]
[AFT0] errID 0x004e819a.33a6a443 Corrected Memory Error on U0702 is
Intermittent
Nov 16 07:44:16 bgwkibris SUNW,UltraSPARC-II: [ID 827599 kern.info]
[AFT0] errID 0x004e819a.33a6a443 ECC Data Bit 28 was in error and
corrected
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:36:47 EDT