CPU Panic on E-4500

From: Bhavesh Shah (bshah@citadon.com)
Date: Fri Nov 11 2005 - 14:25:51 EST


Hi Gurus,
Recently my E-4500 Server crashed with CPU Panic. Asp per
/var/adm/messages i found CPU5 Extrenal cache courrpt issue.The Server
was running Oracle DB and after CPU crashed it came back fine after
reboot but for safety purpose i disabled that CPU5. However the ORacle
DB running on that box started giving wiered problem of time offset of 1
sec in Oracle DB sysdate package and caused our application to crash
intermittently because of time offset of 1 sec. When ran date command in
loop time looks perfectly fine without any offset. Finally i ran Oracle
DB on another node in VCS and problem was gone. I just want to
understand how CPU ECache can cause timeoffset of 1 sec spike
intermittently or there may be anohter issue like EPROM or Memroy? Any
Help/Pointer is greatly appreciated
Regards
B

Nov 9 23:46:58 sccpdb01 SUNW,UlraSPARC-II: [ID 212535 kern.warning]
WARNING: [AFT1] EDP event on CPU5 Instruct
on access at TL=0, errID 0x000371a8.8f2aa050
ov 9 23:46:58 sccpdb01 AFSR 0x00000000.80408000<PRIV,EDP> AFAR
0x00000000.
f220640
ov 9 23:46:58 sccpdb01 AFSR.PSYND 0x8000(Score 95) AFSR.ETS 0x00
Fault_PC
x1022062c
ov 9 23:46:58 sccpdb01 UDBH 0x0000 UDBH.ESYND 0x00 UDBL 0x0000
UDBL.ESYND
x00
ov 9 23:46:59 sccpdb01 SUNW,UltraSPARC-II: [ID 547230 kern.info] [AFT2]
errID
x000371a8.8f2aa050 PA=0x00000000.ff220640
ov 9 23:46:59 sccpdb01 E$tag 0x00000000.0a401fe4 E$State: Shared
E$parity
x05
ov 9 23:46:59 sccpdb01 SUNW,UltraSPARC-II: [ID 989652 kern.info] [AFT2]
E$Data
(0x00): 0x80114004.c8592000 *Bad* PSYND=0x8000
ov 9 23:46:59 sccpdb01 SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data
(0x08): 0x80a12000.1268007b
ov 9 23:46:59 sccpdb01 SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data
(0x10): 0x80a4a000.a13d6000
ov 9 23:46:59 sccpdb01 SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data
(0x18): 0x913d6000.92100018
ov 9 23:46:59 sccpdb01 SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data
(0x20): 0x7fff3e16.94102001
ov 9 23:46:59 sccpdb01 SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data
(0x28): 0xc85ea000.8b2c3007
ov 9 23:46:59 sccpdb01 SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data
(0x30): 0x913d6000.88010005
ov 9 23:46:59 sccpdb01 SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2]
E$Data
(0x38): 0xca012010.8a016001
ov 9 23:46:59 sccpdb01 SUNW,UltraSPARC-II: [ID 541153 kern.info] [AFT2]
errID
x000371a8.8f2aa050 AFAR was derived from E$Tag
ov 9 23:46:59 sccpdb01 unix: [ID 836849 kern.notice]
ov 9 23:46:59 sccpdb01 ^Mpanic[cpu5]/thread=30008ca54e0:
Nov 9 23:46:59 sccpdb01 unix: [ID 877852 kern.notice] [AFT1] errID
0x000371a8.8f2aa050 EDP Error(s)
Nov 9 23:46:59 sccpdb01 See previous message(s) for details
Nov 9 23:46:59 sccpdb01 unix: [ID 100000 kern.notice]
Nov 9 23:46:59 sccpdb01 genunix: [ID 723222 kern.notice]
000002a1011b6440 SUNW,UltraSPARC-II:cpu_aflt_log+568 (2a1011b64fe, 1,
1014df28, 2a10
11b6688, 2a1011b654b, 1014df50)
Nov 9 23:46:59 sccpdb01 genunix: [ID 179002 kern.notice] %l0-3:
0000000000000000 0000000000000003 000002a1011b6750 0000000000000010
Nov 9 23:46:59 sccpdb01 %l4-7: 0000000000800000 0000000000400000
0000000010743fc0 00000310028505a0
Nov 9 23:46:59 sccpdb01 genunix: [ID 723222 kern.notice]
000002a1011b6690 SUNW,UltraSPARC-II:cpu_async_error+868 (1, 2a1011b6750,
80408000, 0
, 140000080408000, 2a1011b6910)
Nov 9 23:46:59 sccpdb01 genunix: [ID 179002 kern.notice] %l0-3:
0000000000000001 000000000000000a 0000000000000000 0000000000000000
Nov 9 23:46:59 sccpdb01 %l4-7: 0000000000004208 0000000000000000
0000000000000000 0000030004699770
Nov 9 23:46:59 sccpdb01 genunix: [ID 723222 kern.notice]
000002a1011b6860 unix:prom_rtt+0 (40000, 2a1011b6ab8, 104a09e0, 1,
80000, 0)
Nov 9 23:46:59 sccpdb01 genunix: [ID 179002 kern.notice] %l0-3:
0000000000000007 0000000000001400 0000004480001606 0000000010145574
Nov 9 23:46:59 sccpdb01 %l4-7: 0000000000000000 000000000000b810
0000000000000000 000002a1011b6910
Nov 9 23:47:00 sccpdb01 genunix: [ID 723222 kern.notice]
000002a1011b69b0 vxio:volkiostart+268 (30008a377c0, a84a000, 104a09e0,
10493550, 7ff
fffff, 0)
Nov 9 23:47:00 sccpdb01 genunix: [ID 179002 kern.notice] %l0-3:
0000000000000005 0000000000000000 0000000000000000 0000030007f46b70
Nov 9 23:47:00 sccpdb01 %l4-7: 000002a1011b6b70 0000000000000005
0000030007f46bc8 0000030007f46bb0
Nov 9 23:47:00 sccpdb01 genunix: [ID 723222 kern.notice]
000002a1011b6ac0 vxio:vxiostrategy+a8 (30007f46b70, 2a1011b7394,
2a1011b6c78, 0, 0,
2000)
Nov 9 23:47:00 sccpdb01 genunix: [ID 179002 kern.notice] %l0-3:
000002a1011b755c 00000300021eb510 0000000027ca6000 0000000000002000
Nov 9 23:47:00 sccpdb01 %l4-7: 000002a1011b7540 0000000000000001
0000000000002000 0000030007f68c50
Nov 9 23:47:00 sccpdb01 genunix: [ID 723222 kern.notice]
000002a1011b6bc0 genunix:bdev_strategy+94 (1042ecf0, 30007f46b70, 20,
310025235a0, 2
a1011b7540, 2000)
Nov 9 23:47:00 sccpdb01 genunix: [ID 179002 kern.notice] %l0-3:
00000000102ca430 0000000010442518 0000000027ca7000 0000000000000000
Nov 9 23:47:00 sccpdb01 %l4-7: 0000000000000001 00000000f0f1f8f4
ffffffffffffffe8 0000000000000000
Nov 9 23:47:00 sccpdb01 genunix: [ID 723222 kern.notice]
000002a1011b6c90 vxfs:vx_dev_strategy+a0 (30007f46b70, 30007f46b70, 0,
0, 78265220,
2)
Nov 9 23:47:00 sccpdb01 genunix: [ID 179002 kern.notice] %l0-3:
0000000000000003 0000000000000000 0000000000000000 0000000000000000
Nov 9 23:47:00 sccpdb01 %l4-7: 00000000782652c0 0000000000000004
000000001041bfe0 0000000000000000
Nov 9 23:47:01 sccpdb01 genunix: [ID 723222 kern.notice]
000002a1011b6d70 vxfs:vx_io_startnowait+688 (10, 782664b8, 1,
30002c0aa30, 30007f46b
70, 310028505a0)
Nov 9 23:47:01 sccpdb01 genunix: [ID 179002 kern.notice] %l0-3:
0000000000080059 00000310028505a0 0000030009f90340 0000000000000000
Nov 9 23:47:01 sccpdb01 %l4-7: 0000000000000a0e 0000000000000000
00000310028505a0 00000300008626d0
Nov 9 23:47:01 sccpdb01 genunix: [ID 723222 kern.notice]
000002a1011b6e30 vxfs:vx_io_start+c (30002c0aa30, 30007f46b70, 80059,
30002c0aa30, 2
000, 0)
Nov 9 23:47:01 sccpdb01 genunix: [ID 179002 kern.notice] %l0-3:
0000030004699770 000000000406e000 0000000027ca8000 000002a7505a0000
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:36:14 EDT