DS10 missing devices in OS, but not SRM

From: Adam Preset (preset@isc.upenn.edu)
Date: Mon Nov 15 2004 - 14:56:35 EST


Howdy,

We moved a DS10/466Mhz with two blue external shelves from one data center
to another. Upon boot in the new location, four disks seem to be
unavailable from the OS, although they are visible in the firmware.

>>>show dev
dka0.0.0.2000.0 DKA0 RZ2DD-LS 0306
dka100.1.0.2000.0 DKA100 RZ2DD-LS 0306
dka200.2.0.2000.0 DKA200 RZ2ED-LS 0306
dka300.3.0.2000.0 DKA300 RZ2ED-LS 0306
dkb0.0.0.2001.0 DKB0 RZ2DD-LS 0306
dkb100.1.0.2001.0 DKB100 RZ2DD-LS 0306
dkb200.2.0.2001.0 DKB200 RZ2ED-LS 0306
dkb300.3.0.2001.0 DKB300 RZ2ED-LS 0306
dqa0.0.0.13.0 DQA0 TOSHIBA CD-ROM XM-6302B 1017
dva0.0.0.0.0 DVA0
ewa0.0.0.9.0 EWA0 08-00-2B-86-1E-D0
ewb0.0.0.11.0 EWB0 08-00-2B-86-1E-C6
ewc0.0.0.2002.0 EWC0 00-06-2B-00-17-D5
pka0.7.0.2000.0 PKA0 SCSI Bus ID 7
pkb0.7.0.2001.0 PKB0 SCSI Bus ID 7

Here was the first clue upon boot that something wasn't quite right:

dsfmgr: NOTE: updating kernel basenames for system at /
    scp kevm tty00 tty01 lp0 dsk0 dsk1 dsk4 dsk5 floppy0 cdrom0
starting LSM in boot mode
lsm:vold: WARNING: Disk disk02 in group rootdg: Disk device not found
lsm:vold: WARNING: Disk disk03 in group rootdg: Disk device not found
lsm:vold: WARNING: Disk disk06 in group rootdg: Disk device not found
lsm:vold: WARNING: Disk disk07 in group rootdg: Disk device not found

Our data volume is no longer accessible:

$ bcheckrc
...
exec: /sbin/mount_advfs -F 0x4010 d_data#www_web /usr/local/ftp
Error: /dev/vol/datavol is an invalid device or cannot be opened.

More detail on the LSM volume called datavol, which contains the four
missing devices:

# volprint -h datavol
Disk group: rootdg

TY NAME ASSOC KSTATE LENGTH PLOFFS STATE TUTIL0 PUTIL0
v datavol fsgen DISABLED 71121936 - ACTIVE - -
pl datavol-01 datavol DISABLED 71121936 - NODEVICE - -
sd d2 datavol-01 DISABLED 35560968 0 NODEVICE - -
sd d6 datavol-01 DISABLED 35560968 35560968 NODEVICE - -
pl datavol-02 datavol DISABLED 71121936 - NODEVICE - -
sd d3 datavol-02 DISABLED 35560968 0 NODEVICE - -
sd d7 datavol-02 DISABLED 35560968 35560968 NODEVICE - -
pl datavol-03 datavol DISABLED LOGONLY - ACTIVE - -
sd dsk0d-02 datavol-03 ENABLED 1170 LOG - - -

And at the moment, we can only see four of the disks from the OS. Those
four happen to have LSM mirrored copies of /, /usr, /var, swap, /usr/local,
and /home. As you can see from "show dev" above, the SRM does see eight
devices.

# hwmgr -view devices
 HWID: Device Name Mfg Model Location
 ------------------------------------------------------------------------------
    3: scp (unknown) (unknown)
    4: /dev/kevm
   28: /dev/disk/floppy0c 3.5in floppy fdi0-unit-0
   46: /dev/disk/cdrom0c TOSHIBA CD-ROM XM-6302B bus-0-targ-0-lun-0
   47: /dev/disk/dsk0c DEC RZ2DD-LS (C) DEC bus-2-targ-0-lun-0
   48: /dev/disk/dsk1c DEC RZ2DD-LS (C) DEC bus-2-targ-1-lun-0
   51: /dev/disk/dsk4c DEC RZ2DD-LS (C) DEC bus-3-targ-0-lun-0
   52: /dev/disk/dsk5c DEC RZ2DD-LS (C) DEC bus-3-targ-1-lun-0

# scu show edt

CAM Equipment Device Table (EDT) Information:

    Bus/Target/Lun Device Type ANSI Vendor ID Product ID Revision N/W
    -------------- ----------- ------ --------- ---------------- -------- ---
     0 0 0 CD-ROM SCSI-2 TOSHIBA CD-ROM XM-6302B 1017 N
     2 0 0 Direct SCSI-2 DEC RZ2DD-LS (C) DEC 0306 W
     2 1 0 Direct SCSI-2 DEC RZ2DD-LS (C) DEC 0306 W
     3 0 0 Direct SCSI-2 DEC RZ2DD-LS (C) DEC 0306 W
     3 1 0 Direct SCSI-2 DEC RZ2DD-LS (C) DEC 0306 W

Unfortunately, it looks like four drives are still missing.

Here's what I've tried so far:

        a) Replaced all cables. Twice.
        b) Replaced internal dual-port KZPCM card.
        c) Reseated all drives.
        d) Run "init" from SRM.
        e) Removed device files for devices that were missing and run
        dsfmgr -K. They didn't come back. (Restored from tape.)
        f) Booted generic kernel. Devices not visible.

Would anyone happen to have an idea for how I might clear up this
confusion between SRM and Tru64 about what devices are available?

Many thanks. I will summarize.

Adam

-- 
Adam Preset <preset@isc.upenn.edu>
ISC Networking & Telecommunications
University of Pennsylvania


This archive was generated by hypermail 2.1.7 : Sat Apr 12 2008 - 10:50:11 EDT