problems adding a metadb back on a slice (long post)

From: Jose Vicente Nunez Zuleta (josevnz@newbreak.com)
Date: Thu Aug 08 2002 - 11:51:05 EDT


Greetings,

We have a E250 Sparc running Solaris 2.6 (SunOS lenbkx0005 5.7 Generic_106541-19 sun4u sparc SUNW,Ultra-250, latests patches applied) with a Raid1 configuration. On a crontab we have an script that runs every day and checks for errors and today we found the following (from the metadb -i command):

  F M p unknown unknown /dev/dsk/c0t0d0s3
     a m p luo 1050 1034 /dev/dsk/c0t0d0s3
     a p luo 2084 1034 /dev/dsk/c0t0d0s3
     a p luo 16 1034 /dev/dsk/c0t9d0s7
     a p luo 1050 1034 /dev/dsk/c0t9d0s7
     a p luo 2084 1034 /dev/dsk/c0t9d0s7
     a p luo 3118 1034 /dev/dsk/c0t9d0s7
     a p luo 4152 1034 /dev/dsk/c0t9d0s7
     a p luo 5186 1034 /dev/dsk/c0t9d0s7
     a p luo 6220 1034 /dev/dsk/c0t9d0s7
     a p luo 7254 1034 /dev/dsk/c0t9d0s7
     a p luo 8288 1034 /dev/dsk/c0t9d0s7
     a p luo 9322 1034 /dev/dsk/c0t9d0s7
     a p luo 10356 1034 /dev/dsk/c0t9d0s7
     a p luo 11390 1034 /dev/dsk/c0t9d0s7
     a p luo 12424 1034 /dev/dsk/c0t9d0s7
     a p luo 13458 1034 /dev/dsk/c0t9d0s7
     a p luo 14492 1034 /dev/dsk/c0t9d0s7
 o - replica active prior to last mddb configuration change
 u - replica is up to date
 l - locator for this replica was read successfully
 c - replica's location was in /etc/opt/SUNWmd/mddb.cf
 p - replica's location was patched in kernel
 m - replica is master, this is replica selected as input
 W - replica has device write errors
 a - replica is active, commits are occurring to this replica
 M - replica had problem with master blocks
 D - replica had problem with data blocks
 F - replica had format problems
 S - replica is too small to hold current data base
 R - replica had device read errors

The Sun documentation recomends to erase and add any metadabase that has problems in order to fix any corruption problem:

metadb -d /dev/dsk/c0t0d0s3

That took away the corrupted metadb but now i got the following error trying to add the db back to the device:

bash-2.05# metadb -a -f /dev/dsk/c0t0d0s3
metadb: lenbkx0005: c0t0d0s3: is swapped on

Here is the output from the metadb -i command:

bash-2.05# metadb -i
        flags first blk block count
     a p luo 16 1034 /dev/dsk/c0t9d0s7
     a p luo 1050 1034 /dev/dsk/c0t9d0s7
     a p luo 2084 1034 /dev/dsk/c0t9d0s7
     a p luo 3118 1034 /dev/dsk/c0t9d0s7
     a p luo 4152 1034 /dev/dsk/c0t9d0s7
     a p luo 5186 1034 /dev/dsk/c0t9d0s7
     a p luo 6220 1034 /dev/dsk/c0t9d0s7
     a p luo 7254 1034 /dev/dsk/c0t9d0s7
     a p luo 8288 1034 /dev/dsk/c0t9d0s7
     a p luo 9322 1034 /dev/dsk/c0t9d0s7
     a p luo 10356 1034 /dev/dsk/c0t9d0s7
     a p luo 11390 1034 /dev/dsk/c0t9d0s7
     a p luo 12424 1034 /dev/dsk/c0t9d0s7
     a p luo 13458 1034 /dev/dsk/c0t9d0s7
     a p luo 14492 1034 /dev/dsk/c0t9d0s7
 o - replica active prior to last mddb configuration change
 u - replica is up to date
 l - locator for this replica was read successfully
 c - replica's location was in /etc/opt/SUNWmd/mddb.cf
 p - replica's location was patched in kernel
 m - replica is master, this is replica selected as input
 W - replica has device write errors
 a - replica is active, commits are occurring to this replica
 M - replica had problem with master blocks
 D - replica had problem with data blocks
 F - replica had format problems
 S - replica is too small to hold current data base
 R - replica had device read errors

And here is the output for the metastat command:

d13: Mirror
    Submirror 0: d4
      State: Okay
    Submirror 1: d0
      State: Okay
    Pass: 1
    Read option: roundrobin (default)
    Write option: parallel (default)
    Size: 1027216 blocks

d4: Submirror of d13
    State: Okay
    Size: 1027216 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t0d0s0 0 No Okay

d0: Submirror of d13
    State: Okay
    Size: 1104096 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t9d0s0 0 No Okay

d14: Mirror
    Submirror 0: d3
      State: Okay
    Submirror 1: d2
      State: Okay
    Pass: 1
    Read option: roundrobin (default)
    Write option: parallel (default)
    Size: 25119672 blocks

d3: Submirror of d14
    State: Okay
    Size: 25119672 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t0d0s7 0 No Okay

d2: Submirror of d14
    State: Okay
    Size: 25170336 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t9d0s5 0 No Okay

d15: Mirror
    Submirror 0: d5
      State: Okay
    Submirror 1: d1
      State: Okay
    Pass: 1
    Read option: roundrobin (default)
    Write option: parallel (default)
    Size: 6144448 blocks

d5: Submirror of d15
    State: Okay
    Size: 6144448 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t0d0s6 0 No Okay

d1: Submirror of d15
    State: Okay
    Size: 6293856 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t9d0s4 0 No Okay

d16: Mirror
    Submirror 0: d7
      State: Okay
    Submirror 1: d8
      State: Okay
    Pass: 1
    Read option: roundrobin (default)
    Write option: parallel (default)
    Size: 75501088 blocks

d7: Submirror of d16
    State: Okay
    Size: 75501088 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t8d0s0 0 No Okay

d8: Submirror of d16
    State: Okay
    Size: 75501088 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t11d0s0 0 No Okay

d17: Mirror
    Submirror 0: d12
      State: Okay
    Submirror 1: d11
      State: Okay
    Pass: 1
    Read option: roundrobin (default)
    Write option: parallel (default)
    Size: 4195584 blocks

d12: Submirror of d17
    State: Okay
    Size: 4195584 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t8d0s3 0 No Okay

d11: Submirror of d17
    State: Okay
    Size: 4195584 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t11d0s3 0 No Okay

d18: Mirror
    Submirror 0: d10
      State: Okay
    Submirror 1: d9
      State: Okay
    Pass: 1
    Read option: roundrobin (default)
    Write option: parallel (default)
    Size: 16777480 blocks

d10: Submirror of d18
    State: Okay
    Size: 16777480 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t8d0s1 0 No Okay

d9: Submirror of d18
    State: Okay
    Size: 16777480 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t11d0s1 0 No Okay

d21: Mirror
    Submirror 0: d19
      State: Okay
    Submirror 1: d20
      State: Okay
    Pass: 1
    Read option: roundrobin (default)
    Write option: parallel (default)
    Size: 4195584 blocks

d19: Submirror of d21
    State: Okay
    Size: 4195584 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t8d0s4 0 No Okay

d20: Submirror of d21
    State: Okay
    Size: 4195584 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t11d0s4 0 No Okay

d24: Mirror
    Submirror 0: d22
      State: Okay
    Submirror 1: d23
      State: Okay
    Pass: 1
    Read option: roundrobin (default)
    Write option: parallel (default)
    Size: 1024616 blocks

d22: Submirror of d24
    State: Okay
    Size: 1024616 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t8d0s5 0 No Okay

d23: Submirror of d24
    State: Okay
    Size: 1024616 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t11d0s5 0 No Okay

d27: Mirror
    Submirror 0: d25
      State: Okay
    Submirror 1: d26
      State: Okay
    Pass: 1
    Read option: roundrobin (default)
    Write option: parallel (default)
    Size: 8391168 blocks

d25: Submirror of d27
    State: Okay
    Size: 8391168 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t8d0s6 0 No Okay

d26: Submirror of d27
    State: Okay
    Size: 8391168 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t11d0s6 0 No Okay

d30: Mirror
    Submirror 0: d28
      State: Okay
    Submirror 1: d29
      State: Okay
    Pass: 1
    Read option: roundrobin (default)
    Write option: parallel (default)
    Size: 31462024 blocks

d28: Submirror of d30
    State: Okay
    Size: 31462024 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t8d0s7 0 No Okay

d29: Submirror of d30
    State: Okay
    Size: 31462024 blocks
    Stripe 0:
        Device Start Block Dbase State Hot Spare
        c0t11d0s7 0 No Okay

d6: Concat/Stripe
    Size: 1027216 blocks
    Stripe 0:
        Device Start Block Dbase
        c0t0d0s1 0 No

Also if i try to start the metatool program it get stuck detecting the md devices:

bash-2.05# metatool
Warning: Cannot convert string "lucidasans-bold12" to type FontStruct
Warning: Cannot convert string "lucidasans8" to type FontStruct
Warning: Cannot convert string "lucidasans-typewriter12" to type FontStruct
Warning: Cannot convert string "lucidasans12" to type FontStruct
Initializing metatool... Done.
Discovering drives and slices... ^Cmetatool: Interrupt

I searched for previous posts / the documentation but saw no information about this. So far the system is working fine but i would like to have the level of redundancy i got before.

Any help will be greathly appreciated.

JV
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:24:44 EDT