E6800 domain reboot problem (send_mondo_set)

From: Kerekes, Ed (Ed_Kerekes@steris.com)
Date: Tue Dec 03 2002 - 14:37:21 EST


We are having an issue with some of our domains rebooting with messages
similar to:

Dec 3 13:17:34 SUNW,UltraSPARC-III+: [ID 563681 kern.notice] send
mondo
 timeout [735294 NACK 0 BUSY]
Dec 3 13:17:34 IDSR 0x1 aids:
Dec 3 13:17:34 SUNW,UltraSPARC-III+: [ID 823475 kern.notice] c
Dec 3 13:17:34 SUNW,UltraSPARC-III+: [ID 100000 kern.notice]
Dec 3 13:17:34 unix: [ID 836849 kern.notice]
Dec 3 13:17:34 ^Mpanic[cpu13]/thread=3000f676940:
Dec 3 13:17:34 unix: [ID 152620 kern.notice] send_mondo_set: timeout
Dec 3 13:17:34 unix: [ID 100000 kern.notice]
Dec 3 13:17:34 genunix: [ID 723222 kern.notice] 000002a101a44600
SUNW,U
ltraSPARC-III+:send_mondo_set+238 (1, 1, 2a, 3, 5b35173bfd4,
5b35173bfe1)
Many lines follow

This has happened on 4 different domains on 2 different 6800s several
times and we have replaced CPUs and per Sun's recommendation we have
also turned off predictive branching on the cpus. This has us stumped.
We are running oracle 9i on all domains. If anyone has any ideas or has
seen this before it would be greatly appreciated.

Ed Kerekes
STERIS Corporation
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:25:24 EDT