Re: hdisks missing

From: Green, Simon (Simon.Green@EU.ALTRIA.COM)
Date: Fri Apr 25 2003 - 05:35:40 EDT


I guess from your post that you have a resource group running on each node,
in mutual takeover. So some disks are used by one node, some by the other
when everything's running normally. At the moment, each node is OK - taken
in isolation - so the actual disk drives must be working.

I can't really think of anything which would definitely cause the sort of
problem you're seeing, but here are a few things to check: maybe one of them
will suggest something to you.

What sort of SSA drawer is it? If it's a 7133-020 or D40, how is it
caballed and how are the bypass cards set?

What does SSA Link Verification tell you? (From the diagnostic Service
Aids.) Run "maymap" if you have it. Although you have not made any
deliberate changes to the SSA loop it's possible that the cables were
disconnected in order to gain access to the node for the upgrade. Are you
certain everything got put back in the right place?

Do you still have all of the volume groups defined on both systems? (If
you've been deleting and re-defining disks, you'll probably need to export
and re-import some of these.)

What are the microcode levels of the adapters? Make sure that they're both
the same.

Did you re-boot the two nodes simultaneously? I have had problems -
particularly with old MCA nodes using Enhanced 4-port Adapters - that if two
nodes in the same loop try to configure their SSA devices at the same time
strange things can happen, including devices going missing. Always stagger
a reboot - even if it's only by half a minute or so.

I think I'd want to shutdown both nodes, then reboot just one of them and
examine the SSA devices BEFORE re-starting HACMP. If you have HACMP
starting automatically, disable that temporarily. Once one node is OK, boot
the second one. Only when both nodes' SSA config is OK should you start
HACMP.

Simon Green
Altria ITSC Europe s.a.r.l.

AIX-L Archive at http://marc.theaimsgroup.com/?l=aix-l&r=1&w=2
AIX FAQ at http://www.faqs.org/faqs/aix-faq/

N.B. Unsolicited email from vendors will not be appreciated.

> -----Original Message-----
> From: Klaus Oberle
> Sent: 24 April 2003 12:01
> To: aix-l@Princeton.EDU
> Subject: hdisks missing
>
>
> Hi *,
>
> I have a HACMP-Cluster consisting of two old SP Highnodes
> (AIX4.3.3 - ML
> 08) which shares one SSA-Drawer. Recently they were both
> being upgrated by
> adding additional procs and memory from other obsolete
> Highnodes. After the
> upgrade, both machines came up and the cluster applications runs fine.
> Problem is, "lspv" on both nodes only lists hdisks which
> belongs to the
> active VG of that node - hdisks form the other node are no
> longer there. On
> the other hand, every node can see beside its own pdisks the
> pdisks that
> belongs to the other node. (ok - cabling or something else
> wasn't changed
> during the hardware upgrade).
>
> To get the missed hdisks back (for properly failover), i
> removed it first
> (rmdev -dl hdiskX ..) and ran "cfgmgr" without success. The
> hdisks still
> remain lost. Any hints how to solve this???



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 22:16:46 EDT