Server (and I) PANIC

From: Unix4me@aol.com
Date: Fri Aug 09 2002 - 00:09:06 EDT


Hello Gurus,

I have been having this problem w/ an ACE server. It panics and shutdown like
every 8 hours. I have people that authenticate against it all the time so I
go manually and go into single user to boot up the server after running fsck
or in some cases it boots up on its on after panic. The server has been
running up well for almost 4months now, but someone by mistake powered it
down ungracfully :-( then I started seeing this problem.

The errors that I get in the /var/adm/messages:

Aug 4 18:34:30 Server-Name prngd[143]: [ID 703563 daemon.notice] pr
ngd 0.9.17 (09 May 2001) started up for user root
Aug 4 18:34:30 Server-Name prngd[143]: [ID 710755 daemon.notice] ha
ve 7 out of 256 filedescriptors open
Aug 4 18:34:33 Server-Name ntpdate[184]: [ID 774510 daemon.notice]
step time server 140.162.8.3 offset 1.403586 sec
Aug 4 18:34:33 Server-Name savecore: [ID 570001 auth.error] reboot
after panic: [AFT1] errID 0x0000264f.d6ec4913 UE Error(s)
Aug 4 18:34:33 Server-Name See previous message(s) for details
Aug 4 18:34:35 Server-Name xntpd[247]: [ID 702911 daemon.notice] xn
tpd 3-5.93e Mon Sep 20 15:47:11 PDT 1999 (1)
Aug 4 18:34:35 Server-Name xntpd[247]: [ID 301315 daemon.notice] ti
ckadj = 5, tick = 10000, tvu_maxslew = 495, est. hz = 100
Aug 4 18:34:36 Server-Name xntpd[247]: [ID 798731 daemon.notice] us
ing kernel phase-lock loop 0041
Aug 4 18:34:36 Server-Name last message repeated 1 time
Aug 4 18:34:42 Server-Name sshd[301]: [ID 800047 auth.error] error:
 Bind to port 22 on 0.0.0.0 failed: Address already in use.
Aug 4 18:38:53 Server-Name xntpd[247]: [ID 774427 daemon.notice] ti
me reset (slew) 0.149460 s
Aug 5 06:15:32 Server-Name SUNW,UltraSPARC-IIe: [ID 300015 kern.war
ning] WARNING: [AFT1] Uncorrectable Memory Error on CPU0 at TL=0, errID
0x000026
4f.f25786ae
Aug 5 06:15:32 Server-Name AFSR 0x00000000.80300000<PRIV,UE,CE>
 AFAR 0x00000000.5e000cc0
Aug 5 06:15:32 Server-Name AFSR.PSYND 0x0000(Score 05) AFSR.ETS
 0x00 Fault_PC 0x10023eac
Aug 5 06:15:32 Server-Name UDBH 0x0259<UE> UDBH.ESYND 0x59 UDBL
 0x0000 UDBL.ESYND 0x00
Aug 5 06:15:32 Server-Name UDBH Syndrome 0x59 Memory Module DIM
M2
Aug 5 06:15:33 Server-Name SUNW,UltraSPARC-IIe: [ID 229191 kern.inf
o] [AFT2] errID 0x0000264f.f25786ae E$tag != PA from AFAR; E$line was
victimized
Aug 5 06:15:33 Server-Name dumping memory from PA 0x00000000.5e
000cc0 instead
5 06:15:33 Server-Name panic[cpu0]/thread=2a100017d40:
Aug 5 06:15:33 Server-Name unix: [ID 639439 kern.notice] [AFT1] err
ID 0x0000264f.f25786ae UE Error(s)

Aug 7 17:17:24 Server-Name SUNW,UltraSPARC-IIe: [ID 907841 kern.war
ning] WARNING: [AFT1] Uncorrectable Memory Error on CPU0 at TL=0, errID
0x000026
50.42ced824
Aug 7 17:17:24 Server-Name AFSR 0x00000000.80300000<PRIV,UE,CE>
 AFAR 0x00000000.5e000cc0
Aug 7 17:17:24 Server-Name AFSR.PSYND 0x0000(Score 05) AFSR.ETS
 0x00 Fault_PC 0x10023f60
Aug 7 17:17:24 Server-Name UDBH 0x0259<UE> UDBH.ESYND 0x59 UDBL
 0x0000 UDBL.ESYND 0x00
Aug 7 17:17:24 Server-Name UDBH Syndrome 0x59 Memory Module DIM
M2
Aug 7 17:17:25 Server-Name SUNW,UltraSPARC-IIe: [ID 414620 kern.inf
o] [AFT2] errID 0x00002650.42ced824 E$tag != PA from AFAR; E$line was
victimized
Aug 7 17:17:25 Server-Name dumping memory from PA 0x00000000.5e

Aug 7 17:19:43 Server-Name prngd[144]: [ID 703563 daemon.notice] pr
ngd 0.9.17 (09 May 2001) started up for user root
Aug 7 17:19:43 Server-Name prngd[144]: [ID 710755 daemon.notice] ha
ve 7 out of 256 filedescriptors open
Aug 7 17:19:46 Server-Name ntpdate[185]: [ID 774510 daemon.notice]
step time server 198.82.161.227 offset 0.926157 sec
Aug 7 17:19:46 Server-Name savecore: [ID 570001 auth.error] reboot
after panic: [AFT1] errID 0x00002650.42ced824 UE Error(s)
Aug 7 17:19:46 Server-Name See previous message(s) for details
Aug 7 17:19:48 Server-Name xntpd[220]: [ID 702911 daemon.notice] xn
tpd 3-5.93e Mon Sep 20 15:47:11 PDT 1999 (1)
Aug 7 17:19:49 Server-Name xntpd[220]: [ID 301315 daemon.notice] ti
ckadj = 5, tick = 10000, tvu_maxslew = 495, est. hz = 100
Aug 7 17:19:49 Server-Name xntpd[220]: [ID 798731 daemon.notice] us
ing kernel phase-lock loop 0041
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:24:44 EDT