Replacing mirrored Sun system disk

From: Clive Elsum (Clive.Elsum@CSIRO.AU)
Date: Tue Jul 09 2002 - 19:30:21 EDT


Hi ,

I am having problems getting a definitive approach to replacing a mirrored
system disk on our Sun 420R.

We are running Solaris 8 on a Sun 420R with 2 18Gb disks mirrored via
Disksuite 2.4.1. The second disk is showing errors and needs to be replaced.
The problem is I keep getting conflicting information on the correct procedure.
Sun basically gave "short shift" saying use metaoffline, metaonline,
metareplace.

1 - use the command metaoffline <mirror name> ...to offline the mirror
(the secondary one. )
2 - Shutdown and replace the faulty disk and run devfsadm or drvconfig ; disks
3 - Up the system and run the command metaonline <mirror name>
4 - when disks are synced run the command metareplace -e
The mirror will then eventually recover .

This does not seem correct, as metaonline would enable at bootup and a boot -r
would reconfigure the disks etc. Also no mention of formatting the disk.

Other stuff I have looked at indicate metadetach then replace faulty disk
(some say power down others say on-line) format the disk as per failed
disk prtvtoc, then metattach, then metareplace.

I really need a definitive method of attack that will work.

Given the md.tab file is:
# Mirror for /
#
d0 -m d1
d1 1 1 /dev/dsk/c0t0d0s0
d2 1 1 /dev/dsk/c0t1d0s0
#
#
# Mirror for swap
#
d10 -m d11
d11 1 1 /dev/dsk/c0t0d0s1
d12 1 1 /dev/dsk/c0t1d0s1
#
#
# Mirror for /usr/local
#
d20 -m d21
d21 1 1 /dev/dsk/c0t0d0s3
d22 1 1 /dev/dsk/c0t1d0s3
#
#
# Mirror for /it
#
d30 -m d31
d31 1 1 /dev/dsk/c0t0d0s4
d32 1 1 /dev/dsk/c0t1d0s4

Would the correct procedure be:

metaoffline d0 d2
metaoffline d10 d12
metaoffline d20 d21
metaoffline d30 d32
Replace disk "hot swap" NO POWER OFF
Format the disk as per prtvtoc of old disk
metaonline d0 d2
metaonline d10 d12
metaonline d20 d22
metaonline d30 d32
metareplace -e d2 c0t1d0s0
metareplace -e d12 c0t1d0s1
metareplace -e d22 c0t1d0s3
metareplace -e d32 c0t1d0s4

OR do I replace metaoffline with metadetach and metaonline with metattach
and if so are there any other steps I am missing.

Much thanks in advance

Clive

Output info shows:

# iostat -E
sd0 Soft Errors: 48 Hard Errors: 0 Transport Errors: 0
Vendor: IBM Product: DDYST1835SUN18G Revision: S96H Serial No: 157444
Size: 18.11GB <18110967808 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 48 Predictive Failure Analysis: 0
sd1 Soft Errors: 48 Hard Errors: 35 Transport Errors: 16
Vendor: IBM Product: DDYST1835SUN18G Revision: S96H Serial No: 10K705
Size: 18.11GB <18110967808 bytes>
Media Error: 30 Device Not Ready: 0 No Device: 5 Recoverable: 0
Illegal Request: 48 Predictive Failure Analysis: 0
sd6 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: TOSHIBA Product: DVD-ROM SD-M1401 Revision: 1007 Serial No: 06/22/00
Size: 18446744073.71GB <-1 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 0 Predictive Failure Analysis: 0
sd30 Soft Errors: 1 Hard Errors: 0 Transport Errors: 0
Vendor: STK Product: OPENstorage 9176 Revision: 0401 Serial No: 1T03310196
Size: 365.06GB <365061079040 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 1 Predictive Failure Analysis: 0
sd46 Soft Errors: 0 Hard Errors: 1 Transport Errors: 0
Vendor: STK Product: OPENstorage 9176 Revision: 0401 Serial No: 1T02811801
Size: 365.06GB <365061079040 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 1 Recoverable: 0
Illegal Request: 0 Predictive Failure Analysis: 0
sd68 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: STK Product: OPENstorage 9176 Revision: 0401 Serial No: 1T03310196
Size: 220.09GB <220091908096 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 0 Predictive Failure Analysis: 0
sd74 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: STK Product: OPENstorage 9176 Revision: 0401 Serial No: 1T02811801
Size: 220.09GB <220091908096 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 0 Predictive Failure Analysis: 0
sd330 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: STK Product: Universal Xport Revision: 0401 Serial No: 1T03310196
Size: 0.02GB <18874368 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 0 Predictive Failure Analysis: 0
sd474 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: STK Product: Universal Xport Revision: 0401 Serial No: 1T02811801
Size: 0.02GB <18874368 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 0 Predictive Failure Analysis: 0
st15 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: STK Product: 9840 Revision: 1.30 Serial No: .109
st16 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: STK Product: 9840 Revision: 1.30 Serial No: .109
st17 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: STK Product: 9840 Revision: 1.30 Serial No: .109
st18 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: STK Product: T9940A Revision: 1.30 Serial No: .210
st19 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: STK Product: T9940A Revision: 1.30 Serial No: .210
st20 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: STK Product: T9940A Revision: 1.30 Serial No: .210

# metastat
d0: Mirror
    Submirror 0: d1
      State: Needs maintenance
    Submirror 1: d2
      State: Needs maintenance
    Pass: 1
    Read option: roundrobin (default)
    Write option: parallel (default)
    Size: 16779432 blocks

d1: Submirror of d0
    State: Needs maintenance
    Invoke: metareplace d0 c0t0d0s0 <new device>
    Size: 16779432 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t0d0s0 0 No Maintenance

d2: Submirror of d0
    State: Needs maintenance
    Invoke: after replacing "Maintenance" components:
                metareplace d0 c0t1d0s0 <new device>
    Size: 16779432 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t1d0s0 0 No Last Erred

d10: Mirror
    Submirror 0: d11
      State: Okay
    Submirror 1: d12
      State: Okay
    Pass: 1
    Read option: roundrobin (default)
    Write option: parallel (default)
    Size: 4198392 blocks

d11: Submirror of d10
    State: Okay
    Size: 4198392 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t0d0s1 0 No Okay

d12: Submirror of d10
    State: Okay
    Size: 4198392 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t1d0s1 0 No Okay

d20: Mirror
    Submirror 0: d21
      State: Okay
    Submirror 1: d22
      State: Okay
    Pass: 1
    Read option: roundrobin (default)
    Write option: parallel (default)
    Size: 8392072 blocks

d21: Submirror of d20
    State: Okay
    Size: 8392072 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t0d0s3 0 No Okay

d22: Submirror of d20
    State: Okay
    Size: 8392072 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t1d0s3 0 No Okay

d30: Mirror
    Submirror 0: d31
      State: Okay
    Submirror 1: d32
      State: Okay
    Pass: 1
    Read option: roundrobin (default)
    Write option: parallel (default)
    Size: 5955968 blocks

d31: Submirror of d30
    State: Okay
    Size: 5955968 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t0d0s4 0 No Okay

d32: Submirror of d30
    State: Okay
    Size: 5955968 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t1d0s4 0 No Okay

# prtvtoc /dev/rdsk/c0t1d0s0
* /dev/rdsk/c0t1d0s0 partition map
*
* Dimensions:
* 512 bytes/sector
* 248 sectors/track
* 19 tracks/cylinder
* 4712 sectors/cylinder
* 7508 cylinders
* 7506 accessible cylinders
*
* Flags:
* 1: unmountable
* 10: read-only
*
* First Sector Last
* Partition Tag Flags Sector Count Sector Mount Directory
       0 2 00 0 16779432 16779431
       1 3 01 16779432 4198392 20977823
       2 5 00 0 35368272 35368271
       3 4 00 20977824 8392072 29369895
       4 0 00 29369896 5955968 35325863
       7 0 00 35325864 42408 35368271
#

Thanks in advance

Clive
---------------------------------------------------------------------
Clive Elsum BAppSc, RHCE
Systems Engineer - Information Technology Group
CSIRO Atmospheric Research
PMB 1, Aspendale, Victoria, Australia 3195
Phone : (+61 3) 9239 4509
Fax: (+61 3) 9239 4444
E-mail Clive.Elsum@csiro.au
---------------------------------------------------------------------
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:24:35 EDT