Summary : Re: IP Multi-pathing - problem in auto-mounting

From: Alan Kong (kkkong@ee.cuhk.edu.hk)
Date: Mon Aug 15 2005 - 23:53:57 EDT


Thanks to:
daniel.denes@bankgesellschaft.de
aditya.n1@rediffmail.com
and a few out-of-offices

It turned out that the name of the back-up link was not entered into DNS
although it had been entered into NIS+ host table. Once the name was
added, the server(IPMP) works like a champ.

Regards
Alan

Alan Kong wrote:

> Dear All,
>
> We have a Sun Fire V240 server config. to run multi-pathing with the 2
> on-board bge NICs. The configuration was directly from the document by
> Rodrick Brown. There was no problem in accessing remote file systems
> using auto-mount at start-up.
>
> However, some remote file systems were inaccessible after a few hours
> and the following were observed in syslog file:
>
> Aug 12 11:28:41 esunvx240 nfs: [ID 664466 kern.notice] NFS access
> failed for ser
> ver cuees9: error 7 (RPC: Authentication error)
> Aug 12 11:28:46 esunvx240 last message repeated 3 times
> Aug 12 11:28:58 esunvx240 nfs: [ID 664466 kern.notice] NFS access
> failed for ser
> ver mailhost: error 7 (RPC: Authentication error)
> Aug 12 11:28:58 esunvx240 last message repeated 1 time
> Aug 12 11:30:33 esunvx240 nfs: [ID 664466 kern.notice] NFS lookup
> failed for ser
> ver cuees9: error 7 (RPC: Authentication error)
> Aug 12 11:30:46 esunvx240 last message repeated 131 times
> Aug 12 11:53:47 esunvx240 in.mpathd[103]: [ID 585766 daemon.error]
> Cannot meet r
> equested failure detection time of 10000 ms on (inet bge1) new failure
> detection
> time for group "production" is 48202 ms
> Aug 12 11:54:47 esunvx240 in.mpathd[103]: [ID 302819 daemon.error]
> Improved fail
> ure detection time 24101 ms on (inet bge1) for group "production"
> Aug 12 11:54:47 esunvx240 in.mpathd[103]: [ID 302819 daemon.error]
> Improved fail
> ure detection time 12050 ms on (inet bge0) for group "production"
> Aug 12 11:54:48 esunvx240 in.mpathd[103]: [ID 302819 daemon.error]
> Improved fail
> ure detection time 10000 ms on (inet bge1) for group "production"
> Aug 12 13:53:54 esunvx240 in.mpathd[103]: [ID 585766 daemon.error]
> Cannot meet r
> equested failure detection time of 10000 ms on (inet bge1) new failure
> detection
> time for group "production" is 45892 ms
>
> cuees9 is the application server where ssh and other application
> resides and ssh starts when the v240 start-up.
> mailhost is the mail server where /var/mail hangs from there.
>
> The current platform running on v240 is Solaris 2.9 in a NIS+
> environment.
>
> Pls advise on:
> - whether IP multi-pathing works in NIS+ with auto-mount environment;
> - suggestions on solving the problem;
> - is there any config. varibles I need to to change?
>
> Thank you.
>
> Regards
> Alan
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:31:19 EDT