Memory/CPU failure ES40

From: Christian Becker (Christian.Becker@Mathematik.Uni-Dortmund.DE)
Date: Mon Aug 05 2002 - 10:38:59 EDT


Hi folks,

I administrate a Compaq Alpha Server ES40 with four EV67 CPUs 666Mhz
with
8 GByte RAM with the following memory organisation:

    0 2048Mb 0000000000000000 4-Way
    1 2048Mb 0000000080000000 4-Way
    2 2048Mb 0000000100000000 4-Way
    3 2048Mb 0000000180000000 4-Way

After one year of running a few weeks ago the following messages appear
in the syslog:

Jul 22 16:12:15 ragnaroek vmunix: WARNING: too many Processor corrected
errors detected on cpu 2. Reporting suspended.
Jul 22 16:13:27 ragnaroek vmunix: WARNING: too many Processor corrected
errors detected on cpu 1. Reporting suspended.
Jul 22 16:14:18 ragnaroek vmunix: WARNING: too many Processor corrected
errors detected on cpu 3. Reporting suspended.

and a few hours later the machine goes to the console prompt.

During the memory test the following messages appear:

EV6 Correctable Memory Fill ECC Error on CPU 0
C_ADDR: 00000000A8FC5BC0
C_SYNDROME_1: 0000000000000057
C_SYNDROME_0: 0000000000000000

EV6 Correctable Dcache ECC Error on CPU 0

EV6 Correctable Memory Fill ECC Error on CPU 0
C_ADDR: 00000000A8FD2BC0
C_SYNDROME_1: 0000000000000057
C_SYNDROME_0: 0000000000000000

First, I thought, it's an defect DRAM module, located in bank 1 because
of the
C_ADDR information. But after removing bank the error still occurs.

So, my question, it is a memory or CPU problem, and, if it's a memoery
problem,
how can I determine the defect DRAM Chip? I haven't found any suitable
documentation.

Many thanks & Bye,
Christian

-- 
         v          
      ..d8b..       Dipl.inform. Christian Becker 
  ..:::d888b:::..
 :::::d88888b:::::  Institut fuer Angewandte Mathematik & Numerik, LS3
:::::d8888888b::::: Universitaet Dortmund 
::::d888888888b:::: Vogelpothsweg 87, 44227 Dortmund, Germany 
 ::{8888P"::"V8,::  Voicemail: +49 231 755 5934 FAX: +49 231 755 5933 
  :D8P":::::::VD:   mailto:Christian.Becker@mathematik.uni-dortmund.de  
  dP  ```````   Y


This archive was generated by hypermail 2.1.7 : Sat Apr 12 2008 - 10:48:48 EDT