Cluster routing and LAG question

From: lawries@btinternet.com
Date: Mon Nov 03 2003 - 11:56:30 EST


Admins,

I have a three node cluster Tru64 5.1B PK3 and SRM 6.4 on all.
One server has two, two port NICS and the others, two single
port NICS.

All network ports are set up using LAG and the virtual device name in each case is lag0. The LAN cables are split evenly
between two cisco switches. Routing is default RIP.

The are three cluster alias' (in addition to the cluster default)
set up for failover, not load balanceing. i.e selp of 10 on one node and 1 on the other (only two nodes offer each alias)

Today, the lead node crashed and the cluster stopped offering all
alias addresses. Has anyone experienced this?

I am no network expert and would appreciate any advice on where to look and how to dianose this problem.

Below are the routing tables and the cpoious output of evmwatch.

node1:root > netstat -r
Routing tables
Destination Gateway Flags Refs Use Interface

Route Tree for Protocol Family 2:
default xxx.xxx.xxx.254 UGS 3 26797 lag0
xxx.xxx.xxx node1 U 6 2573 lag0
hart7 localhost UH 0 3 lo0
hart8 localhost UH 0 4 lo0
hart9 localhost UH 0 3 lo0
ld-hart-cluster localhost UH 1 142 lo0
node1 node1 UHL 10 1692 lag0
loop localhost UR 0 0 lo0
localhost localhost UHL 3 688 lo0
192.168.0 node1-ics0 U 3 13339 ics0
node1-ics0 node1-ics0 UHL 2 8347 ics0
node1:root > evmwatch -A
NIFF: node node1.org.zone has declared a connectivity alert with network xxx.xxx.xxx.0 via interface lag0
vmunix: arp: local IP address xxx.xxx.xxx.7 in use by hardware address AA-01-0A-64-A3-03
vmunix: arp: local IP address xxx.xxx.xxx.7 in use by hardware address AA-01-0A-64-A3-03
vmunix: arp: local IP address xxx.xxx.xxx.7 in use by hardware address AA-01-0A-64-A3-04
vmunix: arp: local IP address xxx.xxx.xxx.7 in use by hardware address AA-01-0A-64-A3-04
vmunix: arp: local IP address xxx.xxx.xxx.7 in use by hardware address AA-01-0A-64-A3-03
vmunix: arp: local IP address xxx.xxx.xxx.7 in use by hardware address AA-01-0A-64-A3-04
NIFF: node node1.org.zone has declared a connectivity alert with network xxx.xxx.xxx.0 via interface lag0
NIFF: node node1.org.zone has declared a connectivity alert with network xxx.xxx.xxx.0 via interface lag0
vmunix: arp: local IP address xxx.xxx.xxx.7 in use by hardware address AA-01-0A-64-A3-03
vmunix: arp: local IP address xxx.xxx.xxx.7 in use by hardware address AA-01-0A-64-A3-03
vmunix: arp: local IP address xxx.xxx.xxx.7 in use by hardware address AA-01-0A-64-A3-04
vmunix: arp: local IP address xxx.xxx.xxx.7 in use by hardware address AA-01-0A-64-A3-04
vmunix: arp: local IP address xxx.xxx.xxx.7 in use by hardware address AA-01-0A-64-A3-03
vmunix: arp: local IP address xxx.xxx.xxx.7 in use by hardware address AA-01-0A-64-A3-04
NIFF: node node1.org.zone has declared a connectivity alert with network xxx.xxx.xxx.0 via interface lag0
NIFF: node node1.org.zone has declared a connectivity alert with network xxx.xxx.xxx.0 via interface lag0
vmunix: arp: local IP address xxx.xxx.xxx.7 in use by hardware address AA-01-0A-64-A3-03
vmunix: arp: local IP address xxx.xxx.xxx.7 in use by hardware address AA-01-0A-64-A3-03
vmunix: arp: local IP address xxx.xxx.xxx.7 in use by hardware address AA-01-0A-64-A3-03
vmunix: arp: local IP address xxx.xxx.xxx.7 in use by hardware address AA-01-0A-64-A3-04
vmunix: arp: local IP address xxx.xxx.xxx.7 in use by hardware address AA-01-0A-64-A3-04
vmunix: arp: local IP address xxx.xxx.xxx.7 in use by hardware address AA-01-0A-64-A3-04
NIFF: node node1.org.zone has declared a connectivity alert with network xxx.xxx.xxx.0 via interface lag0

Thanks for any help

Lawrie



This archive was generated by hypermail 2.1.7 : Sat Apr 12 2008 - 10:49:42 EDT