FOLLOW-UP: ALOM-Problem on a V240, what's debug??

From: Harald Husemann (harald.husemann@materna.de)
Date: Fri Jan 30 2004 - 12:53:02 EST


<Original posting below>

Hi agn,

I just switched the keyswitch to diag mode, pressed the power button,
and viola, the system boots. But, I saw some msg. in the POST output:

=========================/snip/============================================
0>ERROR: TEST = Data Bitwalk on Slave 10>H/W under test = MB/P1/B0/D1,
Motherboard0>Repair Instructions: Replace items in order listed by 'H/W
under test' above
0>MSG = Pin 39 failed on MB/P1/B0/D1, Motherboard
0>END_ERROR
=====================/snap/================================================
(...)
=====================/snip/===============================================1>ERROR:0> POST toplevel status has the following failures:
0> MB/P1/B0/D1, Motherboard0>END_ERROR
0>
0>POST: Return to OBP.SC Alert: Host System has ResetKeyswitch set to
diagnostic position.
====================/snap/=================================================
(...)
======================/snip/===============================================Power On Selftest Failed. CPU: 0 cause: MB/P1/B0/D1, Motherboard CPU: 1 cause: MB/P1/B0/D1, MotherboardERROR: CPU1 has 1024/1024MB of memory disabledERROR: POST failedSC Alert: CPU1 B0/D1 J0602 side 1, CPU Module C1 has been failed by POSTSC Alert:Rebooting with command: bootBoot device: disk File and args:
====================/snap/================================================

I think the motherboard is broken, and I should open a ticket at SUN,
right?? As I said, the system is up and I can login - but, it's a
productive system, so it's dangerous to leave it in this state, :-)

(And, I'm still interested what this "debug:" prompt is, :-))

Have a nice weekend,

Harald

On Fri, 2004-01-30 at 18:01, Harald Husemann wrote:
> Hi gurus,
>
> I have a strange problem here:
> I can't get one of our V240's back to work, the serial console only
> shows
>
> debug:
>
> Hm... I expected something like sc> or ok>, :-)
> Every command I've tried is answered by "Corrected ECC Error", and then
> the "debug:" reappears.
> I tried exit, quit, reset -x, .#, even send a "BREAK" to it, without
> success. The SC saw the "break", it answered
>
> telnet> send brk
>
> SC Alert: SC Request to send Break to host.
> Corrected ECC Error
> debug:
>
> But that was it, no attempt to boot or go back to the OPB- or
> ALOM-prompt.
>
> Searched SUNManager's archive, google and docs.sun.com, but found no
> hint...
>
> Hope someone of you has an idea, since it's a productive machine, and I
> have to get it back to work...
> Could it be there's a hardware failure?? Maybe the memory is damaged??
>
> Thanks, I hope someone has an idea,
>
> Harald
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:27:56 EDT