System Crash

From: sajid@adnoc.com
Date: Wed Mar 31 2004 - 04:35:06 EST


Hi Experts,

I have a E 3000 machine which crashes frequently. I have been trying to
isolate the issue by disabling all third party agents/applications etc. but
no success. The error message is as follows from /var/adm/messages.

Machine Spec:

2 CPU machine with 2 GB RAM and running Solaris 8
SAN attached with a Fiber HBA
GIGA and FE Interfaces
Oracle 8 DB
IBM Tivoli Backup Agent

Mar 31 09:13:02 myhost unix: [ID 251936 kern.notice] panic: ptl1 trap reason
0x2
Mar 31 09:13:02 myhost unix: [ID 554257 kern.notice] TL=0x1 TT=0x68
TICK=0x1eeed47e0c08a
Mar 31 09:13:02 myhost unix: [ID 860431 kern.notice] TPC=0x10118a5c
TnPC=0x10118a60 TSTATE=0x4480001605
Mar 31 09:13:02 myhost unix: [ID 554257 kern.notice] TL=0x2 TT=0x68
TICK=0x1eeed47e0c01c
Mar 31 09:13:02 myhost unix: [ID 860431 kern.notice] TPC=0x10007098
TnPC=0x1000709c TSTATE=0x9180001506
Mar 31 09:13:02 myhost unix: [ID 836849 kern.notice]
Mar 31 09:13:02 myhost ^Mpanic[cpu15]/thread=2a1001abd20:
Mar 31 09:13:02 myhost unix: [ID 715043 kern.notice] Kernel panic at trap
level 2
Mar 31 09:13:02 myhost unix: [ID 100000 kern.notice]
Mar 31 09:13:02 myhost genunix: [ID 723222 kern.notice] 000000001040c1f0
unix:sys_tl1_panic+8 (1044a378, 4, 0, 2000, 0, 2a1001aa95c)
Mar 31 09:13:02 myhost genunix: [ID 179002 kern.notice] %l0-3:
0000000000000005 0000000000001400 0000004480001605 000000001000723c
Mar 31 09:13:02 myhost %l4-7: 00000300074feb10 0000030006738ee0
000000000000000f 000000001040c2a0
Mar 31 09:13:02 myhost genunix: [ID 723222 kern.notice] 000000001040c340
genunix:vmem_xalloc+12c (1044a378, 1044a780, ffffffffffffffff, 0, 0, 0)
Mar 31 09:13:02 myhost genunix: [ID 179002 kern.notice] %l0-3:
000002a1001aaa68 ffffffffffffe000 000000001044a378 0000000000082000
Mar 31 09:13:02 myhost %l4-7: 0000000000000000 0000000000000000
0000000000000000 0000000000000000
Mar 31 09:13:02 myhost genunix: [ID 723222 kern.notice] 000002a1001aa270
genunix:vmem_alloc+34 (1041bd18, 2000900000008, 3a00000000, 0, 0, 0)
Mar 31 09:13:02 myhost genunix: [ID 179002 kern.notice] %l0-3:
00000000104eeb48 000002a1001aa408 0000000000000000 000000001041bd18
Mar 31 09:13:02 myhost %l4-7: 0400000000000200 0000000300000004
0000000010423748 000001ff00030010
Mar 31 09:13:02 myhost unix: [ID 100000 kern.notice]
Mar 31 09:13:02 myhost genunix: [ID 672855 kern.notice] syncing file
systems...
Mar 31 09:13:02 myhost jnic: [ID 578556 kern.notice] jnic0:
FcScsiTranStart(3): POLLED command
Mar 31 09:13:42 myhost unix: [ID 836849 kern.notice]
Mar 31 09:13:42 myhost ^Mpanic[cpu14]/thread=2a10004bd20:
Mar 31 09:13:42 myhost unix: [ID 715357 kern.notice] panic sync timeout
Mar 31 09:13:42 myhost unix: [ID 100000 kern.notice]
Mar 31 09:13:42 myhost genunix: [ID 353387 kern.notice] dumping to
/dev/dsk/c0t0d0s1, offset 1258422272
Mar 31 09:13:42 myhost genunix: [ID 409368 kern.notice] ^M100% done: 30562
pages dumped, compression ratio 2.90,

Does this make any sense to any of you? If yes, please let me know. Is there
any good tool to analyze the core dumps? I have another E 5500 machine which
was also behaving the same manner. There we replaced RAM and the problem
seems to be not that frequent. How can I make sure it is the RAM problem? E
5500 machine now crashes even when there is a 100% full file system. Can
file system getting filled up cause system crash?

Sorry , I have asked many questions but hope you gurus can help me. I don't
get a good support from my supplier on this issue.

Regards,

Mohammed Sajid
This e-mail may contain confidential and/or priviledged information. If you are not the intended recepient (or have received this e-mail in error) please notify the sender immediately and destroy this e-mail.
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:28:23 EDT