Panic with "module "cl_comm" due to a NULL pointer dereference"

From: Mehmet Soysal (soysal@ira.uka.de)
Date: Thu Dec 08 2005 - 20:11:40 EST


Hello,
i installed few months ago Sun Cluster 3.1 on 2 E250 Solaris 9 (32bit)
and everthing was fine.
This setup was for testing purposes. Today i installed on this Hardware
a new Version of Solaris 9 and Sun Cluster 3.1 but this time i used the
Solaris 9 64bit version. First it didn4t recognized the two additional
Network cards (Skyconnect 9821 and 9843sx). Both Nodes have for the
Interconnect one copper card and one fibre cards. I downloaded a 64 bit
Driver for Sparc fom syskonnect.de and after a boot -r it recognized
these Cards. But now one of this 2 Node Cluster crash with a core dump.
I googled a litte bit but i didn4t found a solution. I saw a site wer it
was suggested to stop BSM but it didn4t helped. I installed the newest
patches but it didn4t help. Here is the output from the console:

----SNIP--------
Configuring /dev and /devices
TSI: gfxp0 is GFX8P @ 1152x900

skge: SysKonnect Gigabit Ethernet Adapter families v8.12.1.3 Wed Aug
3 13:15:54 MEST 2005

skge0: SK-9821 V2.0 Gigabit Ethernet 10/100/1000Base-T Adapter
PreferredPort: A
Dual Net Support: No
RLMT Mode: CLS
Jumbo Frame Support: Off
Copy Threshold: 1500
VLAN Support: No
Interrupt Moderation: On

skge1: SysKonnect SK-NET Gigabit Ethernet Adapter SK-9843 SX
PreferredPort: A
Dual Net Support: No
RLMT Mode: CLS
Jumbo Frame Support: Off
Copy Threshold: 1500
VLAN Support: No
Interrupt Moderation: On

Configuring the /dev directory (compatibility devices)
Re-generating rdriver.conf file ...
Booting as part of a cluster
NOTICE: CMM: Node iramsb (nodeid = 1) with votecount = 1 added.
NOTICE: CMM: Node iramsa (nodeid = 2) with votecount = 0 added.
NOTICE: clcomm: Adapter skge1 constructed
NOTICE: clcomm: Path iramsa:skge1 - iramsb:ge0 being constructed
NOTICE: clcomm: Adapter skge0 constructed
NOTICE: clcomm: Path iramsa:skge0 - iramsb:skge0 being constructed
NOTICE: CMM: Node iramsa: attempting to join cluster.
skge1: Network connection up on port A
    Link Speed: 1000 Mbps
    Autonegotiation: Yes
    Duplex Mode: Full
    Flow Control: Symmetric
NOTICE: clcomm: Path iramsa:skge1 - iramsb:ge0 being initiated
NOTICE: CMM: Node iramsb (nodeid: 1, incarnation #: 1134085143) has
become reachable.
NOTICE: CMM: Cluster has reached quorum.
NOTICE: CMM: Node node1 (nodeid = 1) is up; new incarnation number =
1134085143.
NOTICE: CMM: Node node2 (nodeid = 2) is up; new incarnation number =
1134087516.
NOTICE: CMM: Cluster members: node1 node2.
NOTICE: CMM: node reconfiguration #13 completed.
NOTICE: CMM: Node node2: joined cluster.

panic[cpu0]/thread=2a101d77d40: BAD TRAP: type=31 rp=2a101d77740 addr=30
mmu_fsr=0 occurred in module "cl_comm" due to a NULL pointer dereference

sched: trap type = 0x31
addr=0x30
pid=0, pc=0x782b079c, sp=0x2a101d76fe1, tstate=0x4480001605, context=0x0
g1-g7: 1498800, 8dd6ca, 11bad, 78413c00, 78413c00, 16, 2a101d77d40

000002a101d77460 unix:die+80 (31, 2a101d77740, 30, 0, 100c828, e)
  %l0-3: 0000000000000000 0000000001413848 000002a101d77740 000002a101d77630
  %l4-7: 0000000000000031 000003000001ead8 000003000001eb00 00000300003658c0
000002a101d77540 unix:trap+8a4 (2a101d77740, 0, 10000, 10200, 0, 8)
  %l0-3: 0000000000000001 0000000000000000 0000000001438788 0000000000000031
  %l4-7: 0000000000000005 0000000000000001 0000000000000000 0000000000000000
000002a101d77690 unix:ktl0+48 (8, 6b, 0, 78413c8b, 30002262f39, 0)
  %l0-3: 0000000000000006 0000000000001400 0000004480001605 000000000102d9a8
  %l4-7: 0000000000000014 00000300003658c0 0000000000000000 000002a101d77740
000002a101d777e0
cl_comm:__1cKfp_adapterNget_fp_header6MpCLHC_pnEmsgb__+ec (30002375a58,
3000334dad9, 6, 0, 30002375b28, 30002375b20)
  %l0-3: 0000000000000000 0000000000000000 000003000000bb08 0000000000000000
  %l4-7: 000003000000bcb0 000003000000bcd8 0000000000000002 0000000000000000
000002a101d778a0
cl_comm:__1cJfp_holderVupdate_remote_macaddr6MrnHnetworkJmacinfo_t__v_+f0
(3000334d828, 6, 3000334d840, 3000334d980, 78413500, 3000334d840)
  %l0-3: 000003000334d980 0000000078413500 0000000000000000 000003000334db48
  %l4-7: 000003000334dad9 000003000334db05 0000000000000001 0000000000000000
000002a101d77960
cl_comm:__1cLpernodepathOstart_matching6MnM_ManagedSeq_4nL_NormalSeq_4nHnetworkJmacinfo_t___n0C____v_+150
(3000334d828, 0, 40000, 30003359200, 1, 30002a09790)
  %l0-3: 00000300033591b8 00000000783efae0 0000000078413528 0000000000000000
  %l4-7: 00000000784138a0 0000000000000028 000003000334db48 0000000000000001
000002a101d77a20 cl_comm:__1cGfpconfIfp_ns_if6M_v_+17c (3000334db88,
300032e7090, 30002a09790, 7844cf08, 3000334ddb8, 3000334dbb0)
  %l0-3: 0000000000040000 0000000000000001 000003000334dec0 000003000334dbb8
  %l4-7: 000003000334ddbc 0000000000000001 000002a101d77ad8 000000007844cf20

syncing file systems... 3 done
dumping to /dev/dsk/c0t0d0s3, offset 419823616, content: kernel
-----Snap---------

Maybe i did something wrong during the sencond Setup but i don4t know
what it could be.
Has anybody an Idea ?

MfG
M.Soysal
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:37:35 EDT