ES450 Reboot

From: stv (stvsmth@gmail.com)
Date: Tue Oct 24 2006 - 12:14:45 EDT


Over the weekend our server rebooted itself. There is a nightly Oracle
export that creates a named pipe & pipes the oracle export into the
pipe & then into gzip ... apparently gzip caused the error.

I had some memory errors reported earlier in the month, but they
seemed to recover (along the lines of ( [ID 796456 kern.info] [AFT0]
errID 0x00009798.b171c261 ECC Data Bit 17 was in error and corrected)

So is this a memory issue? I'm unclear as to the meaning of

Syndrome 0x3 indicates that
    this may not be a memory module problem

SUNW,UltraSPARC-II: [ID 394992 kern.warning]
    WARNING: [AFT1] Uncorrectable Memory Error on CPU2
    Data access at TL=0, errID 0x0008c91f.8ff19baf
AFSR 0x00000000.00200000<UE> AFAR 0x00000000.394d5990
AFSR.PSYND 0x0000(Score 05) AFSR.ETS 0x00 Fault_PC 0x15b40
UDBH 0x0203<UE> UDBH.ESYND 0x03 UDBL 0x0000 UDBL.ESYND 0x00
UDBH Syndrome 0x3 Memory Module 170x
SUNW,UltraSPARC-II: [ID 682345 kern.warning] WARNING: [AFT1]
    errID 0x0008c91f.8ff19baf Syndrome 0x3 indicates that
    this may not be a memory module problem
SUNW,UltraSPARC-II: [ID 432663 kern.info] [AFT2]
    errID 0x0008c91f.8ff19baf PA=0x00000000.394d5990
    E$tag 0x00000000.08400729 E$State: Shared E$parity 0x04
SUNW,UltraSPARC-II: [ID 359263 kern.info]
    [AFT2] E$Data (0x00): 0x00000000.00000000
SUNW,UltraSPARC-II: [ID 359263 kern.info]
    [AFT2] E$Data (0x08): 0x00000000.00000000
SUNW,UltraSPARC-II: [ID 989652 kern.info]
    [AFT2] E$Data (0x10): 0x00000000.10000000 *Bad* PSYND=0xff00
SUNW,UltraSPARC-II: [ID 359263 kern.info]
    [AFT2] E$Data (0x18): 0x00000000.00000000
SUNW,UltraSPARC-II: [ID 359263 kern.info]
    [AFT2] E$Data (0x20): 0x00000000.00000000
SUNW,UltraSPARC-II: [ID 359263 kern.info]
    [AFT2] E$Data (0x28): 0x00000000.00000000
SUNW,UltraSPARC-II: [ID 359263 kern.info]
    [AFT2] E$Data (0x30): 0x00000000.00000000
SUNW,UltraSPARC-II: [ID 359263 kern.info]
    [AFT2] E$Data (0x38): 0x00000000.00000000
SUNW,UltraSPARC-II: [ID 587988 kern.warning] WARNING: [AFT1]
    AFAR was derived from UE report, CP event on CPU0 (
    caused Data access error on CPU2), errID 0x0008c91f.8ff19baf
    AFSR 0x00000000.01000800<CP> AFAR 0x00000000.394d5990
    AFSR.PSYND 0x0800(Score 95) AFSR.ETS 0x00
    UDBH 0x00a1 UDBH.ESYND 0xa1 UDBL 0x0000 UDBL.ESYND 0x00
SUNW,UltraSPARC-II: [ID 432663 kern.info] [AFT2]
    errID 0x0008c91f.8ff19baf PA=0x00000000.394d5990
    E$tag 0x00000000.19400729 E$State: Owner E$parity 0x0c
SUNW,UltraSPARC-II: [ID 359263 kern.info]
    [AFT2] E$Data (0x00): 0x00000000.00000000
SUNW,UltraSPARC-II: [ID 359263 kern.info]
    [AFT2] E$Data (0x08): 0x00000000.00000000
SUNW,UltraSPARC-II: [ID 989652 kern.info]
    [AFT2] E$Data (0x10): 0x00000000.10000000 *Bad* PSYND=0x0800
SUNW,UltraSPARC-II: [ID 359263 kern.info]
    [AFT2] E$Data (0x18): 0x00000000.00000000
SUNW,UltraSPARC-II: [ID 359263 kern.info]
    [AFT2] E$Data (0x20): 0x00000000.00000000
SUNW,UltraSPARC-II: [ID 359263 kern.info]
    [AFT2] E$Data (0x28): 0x00000000.00000000
SUNW,UltraSPARC-II: [ID 359263 kern.info]
   [AFT2] E$Data (0x30): 0x00000000.00000000
SUNW,UltraSPARC-II: [ID 359263 kern.info]
   [AFT2] E$Data (0x38): 0x00000000.00000000
unix: [ID 321153 kern.notice] NOTICE: Scheduling clearing
    of error on page 0x00000000.394d4000
SUNW,UltraSPARC-II: [ID 620635 kern.info] [AFT3]
    errID 0x0008c91f.8ff19baf Above Error is in User Mode
    and is fatal: will reboot
unix: [ID 855177 kern.warning] WARNING: [AFT1] initiating reboot
    due to above error in pid 29280 (gzip)
pseudo: [ID 129642 kern.info] pseudo-device: tod0
genunix: [ID 936769 kern.info] tod0 is /pseudo/tod@0
syslogd: going down on signal 15
unix: [ID 221039 kern.notice] NOTICE:
    Previously reported error on page 0x00000000.394d4000 cleared
genunix: [ID 672855 kern.notice] syncing file systems...
genunix: [ID 904073 kern.notice] done
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:41:04 EDT