Re: Off Topic-"tar errors and file corruption"

From: Saxon, Lamar (Lamar.Saxon@AMERICREDIT.COM)
Date: Thu Dec 05 2002 - 12:03:03 EST


The common issue I have seen this problem is when the file is still open and
being accessed while the remote tar is running. We do similar task
replicating our DR environment and have seen this message with like the
Oracle listener log while the listener was running.

Might do a fuser or lsof on it to check if it is still open and being
accessed...

Lamar

-----Original Message-----
From: John F Riordan [mailto:jriorda2@CSC.COM]
Sent: Thursday, December 05, 2002 10:56 AM
To: aix-l@Princeton.EDU
Subject: Off Topic-"tar errors and file corruption"

Hi all,
Was wondering if anyone could explain the following problem I had with tar.
I know what happened but am unsure why tar would do what it did.

Scenario:

We have an application using a Universe database, and yesterday several
files were corrupted. After doing some research and testing the files were
corrupted from a backup that was being run at 1:00am.

I was using a mounted file system ( /expbackup ) from one RS/6000 (rs6k01)
to an other RS/6000 ( /impback ) (rs6k02) for an occasional backup.

I was a backing up a directory ( /u1/data )of about 800MB on (rs6k02) to
the mounted directory ( /impback ) which is coming from ( rs6k01). Hope I
am not making this more difficult than it is.

The tar command I am using is the following: " tar cvf - ./u1/data |( cd
/impback ; tar xvf - )"

Everything is fine unless the NFS mount is broken from (rs6k01).. If the (
/expbackup ) becomes unmounted before unmounting it from (rs6k02) the
filesystem still shows as being mounted on (rs6k02) however if you try to
change directories to ( /impback ) you get "invalid directory". That I
understand, as the mount point is lost. Ok, my problem is that when my tar
script is kicked off via cron it still runs. As you look at the output you
would see that the files are being added then extracted. However if you
look at the add directory and then the extract directory they are the same.
It's adding and extracting to the same file system. And every so often I
see the message "tar 0511-173 File ./u1/data/blah is not of expected file
size". It's not every file, but the files that return this message end up
being corrupted.

I have since changed the way that backup is run but I was wondering if
anyone could tell me what is going on when that message is being displayed.

Sorry, so long winded,
and as always Thanks In Advance.
John



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 22:16:23 EDT