Nfs mount hangs on just a few clients

From: Mark Scheufele (mark.scheufele@diasemi.com)
Date: Mon Dec 01 2003 - 12:49:42 EST


Dear Sunmanagers,

I'm currently experiencing a weird NFS problem. On some machines in my
network
(one solarix 9 x86 and at least one redhat 7.2 box) the nfs service
hangs from time to
time. All the other machines are running fine between these periods of
time. There are also
no errors logged on the server ( E450 solaris 8 kernel patch:
108528-19).

I've done the following tests on the clients that have the problem:

[root@trax:/]#rpcinfo -T udp asterix nfs
program 100003 version 2 ready and waiting
program 100003 version 3 ready and waiting
[root@trax:/]#rpcinfo -T tcp asterix nfs
program 100003 version 2 ready and waiting
program 100003 version 3 ready and waiting
[root@trax:/]#rpcinfo -T udp asterix mountd
program 100005 version 1 ready and waiting
program 100005 version 2 ready and waiting
program 100005 version 3 ready and waiting
[root@trax:/]#rpcinfo -T tcp asterix mountd
program 100005 version 1 ready and waiting
program 100005 version 2 ready and waiting
program 100005 version 3 ready and waiting
[root@trax:/]#rpcinfo -T udp asterix nlockmgr
program 100021 version 1 ready and waiting
program 100021 version 2 ready and waiting
program 100021 version 3 ready and waiting
program 100021 version 4 ready and waiting
[root@trax:/]#rpcinfo -T tcp asterix nlockmgr
program 100021 version 1 ready and waiting
program 100021 version 2 ready and waiting
program 100021 version 3 ready and waiting
program 100021 version 4 ready and waiting
[root@trax:/]#rpcinfo -T udp asterix llockmgr
rpcinfo: RPC: Program not registered

I'm wondering why the llockmgr daemon is not running (it should be
according infodoc
11987 on the sunsolve page). Might this affect the behaviour? If yes how
can I start it (my
nfs services are started over veritas cluster server software)?

Another thing I recognized is that during the periods when the nfs
service hangs mounting the shared filesystem using the hostname for the
physical interface (ge0 )works. Just the mount over the virtual
interface ( plumbed by the cluster software ge0:2) does not succeed.

ge0: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index 2
        inet 10.1.17.108 netmask fffff000 broadcast 10.1.31.255
        ether 8:0:20:b5:23:44
ge0:1: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index
2
        inet 10.1.17.133 netmask fffff000 broadcast 10.1.31.255
ge0:2: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index
2
        inet 10.1.17.106 netmask fffff000 broadcast 10.1.31.255

It would be great if you could point me into the right direction
(rebooting and switching the service group is currently no option for
me).

Many thanks and advance and I'll summarize,

mark
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:27:35 EDT