RE : Continuous in.ipmpd messages

From: Hutin Bertrand (Bertrand.Hutin@fr.Fujitsu.com)
Date: Tue Mar 13 2007 - 17:35:42 EST


ipmpd check the network by pinging the default router, or if not available it
uses multicast to discover its neighbours.

try a snoop on both interfaces and truss the process.

________________________________

De: sunmanagers-bounces@sunmanagers.org de la part de Bob
Date: ven. 09/03/2007 19:11
@: sunmanagers@sunmanagers.org
Objet : Continuous in.ipmpd messages

I have a 15K domain that's a member of a Sun Cluster that I'm getting
constant
in.mpathd failover messages. Here's the messages from just today:

Mar 9 03:58:24 xxxxxxxx in.mpathd[3694]: [ID 594170 daemon.error] NIC
failure
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 03:58:24 xxxxxxxx in.mpathd[3694]: [ID 832587 daemon.error]
Successfully
failed over from NIC ce1 to NIC ce4
Mar 9 03:58:27 xxxxxxxx in.mpathd[3694]: [ID 299542 daemon.error] NIC repair
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 03:58:27 xxxxxxxx in.mpathd[3694]: [ID 620804 daemon.error]
Successfully
failed back to NIC ce1
Mar 9 05:51:54 xxxxxxxx in.mpathd[3694]: [ID 594170 daemon.error] NIC
failure
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 05:51:54 xxxxxxxx in.mpathd[3694]: [ID 832587 daemon.error]
Successfully
failed over from NIC ce1 to NIC ce4
Mar 9 05:51:57 xxxxxxxx in.mpathd[3694]: [ID 299542 daemon.error] NIC repair
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 05:51:57 xxxxxxxx in.mpathd[3694]: [ID 620804 daemon.error]
Successfully
failed back to NIC ce1
Mar 9 06:00:04 xxxxxxxx in.mpathd[3694]: [ID 594170 daemon.error] NIC
failure
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 06:00:04 xxxxxxxx in.mpathd[3694]: [ID 832587 daemon.error]
Successfully
failed over from NIC ce1 to NIC ce4
Mar 9 06:00:07 xxxxxxxx in.mpathd[3694]: [ID 299542 daemon.error] NIC repair
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 06:00:07 xxxxxxxx in.mpathd[3694]: [ID 620804 daemon.error]
Successfully
failed back to NIC ce1
Mar 9 06:12:34 xxxxxxxx in.mpathd[3694]: [ID 594170 daemon.error] NIC
failure
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 06:12:34 xxxxxxxx in.mpathd[3694]: [ID 832587 daemon.error]
Successfully
failed over from NIC ce1 to NIC ce4
Mar 9 06:12:37 xxxxxxxx in.mpathd[3694]: [ID 299542 daemon.error] NIC repair
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 06:12:37 xxxxxxxx in.mpathd[3694]: [ID 620804 daemon.error]
Successfully
failed back to NIC ce1
Mar 9 06:16:14 xxxxxxxx in.mpathd[3694]: [ID 594170 daemon.error] NIC
failure
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 06:16:14 xxxxxxxx in.mpathd[3694]: [ID 832587 daemon.error]
Successfully
failed over from NIC ce1 to NIC ce4
Mar 9 06:16:17 xxxxxxxx in.mpathd[3694]: [ID 299542 daemon.error] NIC repair
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 06:16:17 xxxxxxxx in.mpathd[3694]: [ID 620804 daemon.error]
Successfully
failed back to NIC ce1
Mar 9 08:09:33 xxxxxxxx in.mpathd[3694]: [ID 594170 daemon.error] NIC
failure
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 08:09:33 xxxxxxxx in.mpathd[3694]: [ID 832587 daemon.error]
Successfully
failed over from NIC ce1 to NIC ce4
Mar 9 08:09:37 xxxxxxxx in.mpathd[3694]: [ID 299542 daemon.error] NIC repair
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 08:09:37 xxxxxxxx in.mpathd[3694]: [ID 620804 daemon.error]
Successfully
failed back to NIC ce1
Mar 9 08:30:15 xxxxxxxx in.mpathd[3694]: [ID 594170 daemon.error] NIC
failure
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 08:30:15 xxxxxxxx in.mpathd[3694]: [ID 832587 daemon.error]
Successfully
failed over from NIC ce1 to NIC ce4
Mar 9 08:30:17 xxxxxxxx in.mpathd[3694]: [ID 299542 daemon.error] NIC repair
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 08:30:17 xxxxxxxx in.mpathd[3694]: [ID 620804 daemon.error]
Successfully
failed back to NIC ce1
Mar 9 08:51:24 xxxxxxxx in.mpathd[3694]: [ID 594170 daemon.error] NIC
failure
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 08:51:24 xxxxxxxx in.mpathd[3694]: [ID 832587 daemon.error]
Successfully
failed over from NIC ce1 to NIC ce4
Mar 9 08:51:27 xxxxxxxx in.mpathd[3694]: [ID 299542 daemon.error] NIC repair
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 08:51:27 xxxxxxxx in.mpathd[3694]: [ID 620804 daemon.error]
Successfully
failed back to NIC ce1
Mar 9 09:47:24 xxxxxxxx in.mpathd[3694]: [ID 594170 daemon.error] NIC
failure
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 09:47:24 xxxxxxxx in.mpathd[3694]: [ID 832587 daemon.error]
Successfully
failed over from NIC ce1 to NIC ce4
Mar 9 09:47:27 xxxxxxxx in.mpathd[3694]: [ID 299542 daemon.error] NIC repair
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 09:47:27 xxxxxxxx in.mpathd[3694]: [ID 620804 daemon.error]
Successfully
failed back to NIC ce1
Mar 9 10:42:24 xxxxxxxx in.mpathd[3694]: [ID 594170 daemon.error] NIC
failure
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 10:42:24 xxxxxxxx in.mpathd[3694]: [ID 832587 daemon.error]
Successfully
failed over from NIC ce1 to NIC ce4
Mar 9 10:42:27 xxxxxxxx in.mpathd[3694]: [ID 299542 daemon.error] NIC repair
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 10:42:27 xxxxxxxx in.mpathd[3694]: [ID 620804 daemon.error]
Successfully
failed back to NIC ce1
Mar 9 11:28:44 xxxxxxxx in.mpathd[3694]: [ID 594170 daemon.error] NIC
failure
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 11:28:44 xxxxxxxx in.mpathd[3694]: [ID 832587 daemon.error]
Successfully
failed over from NIC ce1 to NIC ce4
Mar 9 11:28:48 xxxxxxxx in.mpathd[3694]: [ID 299542 daemon.error] NIC repair
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 11:28:48 xxxxxxxx in.mpathd[3694]: [ID 620804 daemon.error]
Successfully
failed back to NIC ce1
Mar 9 12:01:24 xxxxxxxx in.mpathd[3694]: [ID 594170 daemon.error] NIC
failure
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 12:01:24 xxxxxxxx in.mpathd[3694]: [ID 832587 daemon.error]
Successfully
failed over from NIC ce1 to NIC ce4
Mar 9 12:01:27 xxxxxxxx in.mpathd[3694]: [ID 299542 daemon.error] NIC repair
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 12:01:27 xxxxxxxx in.mpathd[3694]: [ID 620804 daemon.error]
Successfully
failed back to NIC ce1
Mar 9 12:45:05 xxxxxxxx in.mpathd[3694]: [ID 594170 daemon.error] NIC
failure
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 12:45:05 xxxxxxxx in.mpathd[3694]: [ID 832587 daemon.error]
Successfully
failed over from NIC ce1 to NIC ce4
Mar 9 12:45:07 xxxxxxxx in.mpathd[3694]: [ID 299542 daemon.error] NIC repair
detected on ce1 of group xxxxxxxx_ipmp
Mar 9 12:45:07 xxxxxxxx in.mpathd[3694]: [ID 620804 daemon.error]
Successfully
failed back to NIC ce1

I have not seen any evidence of hardware failure and there are no errors
shown
on the switches. What could be causing these failovers?
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:41:45 EDT