Nobody can login our productive server

From: Marques, Virginia (virginia.marques@eds.com)
Date: Thu Jun 26 2003 - 09:35:11 EDT


One person this morning did a rm * on / partition with user root. After that
no one can login to the server this person logoff before notifying us what
he did.
The problem was that no one has access to ksh because in the / partition
there is a link that was removed:

#pwd
/
#ls -l bin
lrwxrwxrwx 1 root root 9 Jun 26 09:35 bin -> ./usr/bin

and every user in my /etc/passwd has the shell /bin/ksh (also root)

We decided to ask the customer to shutdown the server in order to start-up
from cdrom, recreate the links and start-up again. But the main problem is
that we have VxVM with encapsulated boot disk(s). So we had to:

- Shutdown (Stop-A)
- Ok> Boot cdrom -sw
- mount / partition:
        #mount -F ufs /dev/dsk/c0t0d0s0 /a
- recreate removed links
        # cd /a
        # ln -s ./usr/bin bin
        # ln -s ./usr/lib lib
- Now comes the VxVM section we had to modify /etc/system and /etc/vfstab
files in order to tell VxVM not to start. And also in directory
/etc/vx/reconfig.d/state.d we had to:

        # rm root-done
        # touch install-db
        # init 0
                ok> boot disk

- Next step: encapsulate boot disk with vxinstall and leave other disks
alone.
- After two more reboots system is up with VxVM now we had to mount all
other partitions and customers begin to work again.

(now we have to initialise three other disks we had in rootdg and make the
boot disk mirror)

That all took 50 minutes. My question is (and please, excuse the long mail
and the bad English): is there a better way to do this in order to reduce
the time in then the server is down?. I mean to reduce the reboots (4 in
this version).

Kind regards,
Virginia
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:26:39 EDT