Addendium: Summary KZPSC-BA Raid Controller Panic

From: Ron Bramblett (bramblet@fuller.com)
Date: Mon Aug 23 2004 - 09:24:33 EDT


Hello,

I had a message in the errorlog.

Aug 20 04:56:12 alfred vmunix: AdvFS I/O error:
Aug 20 04:56:12 alfred vmunix: Volume: /dev/re0c
Aug 20 04:56:12 alfred vmunix: Tag: 0xfffffff7.0000
Aug 20 04:56:12 alfred vmunix: Page: 237
Aug 20 04:56:12 alfred vmunix: Block: 10432
Aug 20 04:56:12 alfred vmunix: Block count: 16
Aug 20 04:56:12 alfred vmunix: Type of operation: Write
Aug 20 04:56:12 alfred vmunix: Error: 5
Aug 20 04:56:12 alfred vmunix: EEI: 0x0
Aug 20 04:56:13 alfred vmunix: AdvFS initiated retries: 0
Aug 20 04:56:13 alfred vmunix: Seconds from first I/O attempt to this
failure: 0
Aug 20 04:56:13 alfred vmunix: Total AdvFS retries on this volume: 0
Aug 20 04:56:13 alfred vmunix:
Aug 20 04:56:13 alfred vmunix: bs_osf_complete: metadata write failed
Aug 20 04:56:13 alfred vmunix: AdvFS Domain Panic; Domain sigma_dmn Id
0x3bbfe8cb.00082940
Aug 20 04:56:13 alfred vmunix: An AdvFS domain panic has occurred due to
either a metadata write error or an internal inconsistency. This domain is
being rendered inaccessible.

Basically I had to do the following to correct it.

I tried to copy the configuration from floppy disk to the raid controller

I had T/S come out and reseat the controller card

Next I tried to copy the config again but it did not like it so the card was
reseated.

I blew away the domain on this raid set and restored the data.

I wasn't completely happy so I went back in swxcrmgr to see if my
configuration was corrupt. When I did that a hard drive said it was bad.

I replaced the hard drive and rebuilt the data and it seems to work.

When I came in this morning I looked at the logs and in the kern.log shows
this

Aug 22 03:03:13 alfred vmunix: xcr0 at pci0 slot 8
Aug 22 03:03:13 alfred vmunix: re0 at xcr0 unit 0 (unit status = CRITICAL,
raid level = 5)
Aug 22 03:03:13 alfred vmunix: (WRITE BACK cache operation SUPPORTED if
battery backup enabled)

What is going on? The computer reboots at 3:00am on Sunday so this was from
the reboot log then.
When I replaced the hard drive it gave me a ONLINE status

-- 
Ron Bramblett
Fuller Brush Company
Systems Adminstrator
Almost 100 years strong and still going ...


This archive was generated by hypermail 2.1.7 : Sat Apr 12 2008 - 10:50:06 EDT