Init using 25% cpu

From: Dave Lowenstein (dlowenst@mail.sdsu.edu)
Date: Tue Oct 28 2003 - 18:28:52 EST


I'm running an e420 with 4x450mhz sparc II procs and 4gb of ram using
solaris 8 kernel patch level 108528-16. it's had some problems in the last
week or so.

the first time, i logged in because the application (tomcat) was not
responding. I couldn't kill the tomcat or apache processes even as root. I
also tried typing init 6 to reboot but it wouldn't do anything. I had to
have someone hit stop-a and type sync to reboot the machine. a bunch of
junk was sent to the messages file, lots of control characters.

Then a few days later, the server was completely unresponsive and wouldn't
respond to stop-a. I had to hit the power switch to get it back up. a
computer that was logged in to it running top had a snapshot of the last
time top ran which showed syslogd as the top process which seemed out of
the ordinary.

right now, init is at the top of the process tree using 27% kernel time. a
truss of the process shows this, over and over again:

setcontext(0xFFBEF320)
    Incurred fault #6, FLTBOUNDS %pc = 0x00031D04
      siginfo: SIGSEGV SEGV_MAPERR addr=0x0000000C
    Received signal #11, SIGSEGV [caught]
      siginfo: SIGSEGV SEGV_MAPERR addr=0x0000000C
setcontext(0xFFBEF320)
    Incurred fault #6, FLTBOUNDS %pc = 0x00031D04
      siginfo: SIGSEGV SEGV_MAPERR addr=0x0000000C
    Received signal #11, SIGSEGV [caught]
      siginfo: SIGSEGV SEGV_MAPERR addr=0x0000000C
setcontext(0xFFBEF320)
    Incurred fault #6, FLTBOUNDS %pc = 0x00031D04
      siginfo: SIGSEGV SEGV_MAPERR addr=0x0000000C
    Received signal #11, SIGSEGV [caught]
      siginfo: SIGSEGV SEGV_MAPERR addr=0x0000000C

any ideas?

Dave Lowenstein
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:27:22 EDT