Coda File System

client hang

From: Patrick Walsh <pwalsh_at_esoft.com>
Date: Wed, 07 Sep 2005 13:50:27 -0600
	We had a client hang today.  I realized that venus hadn't been
restarted on this machine for some time (close to a month) and that may
have contributed to the problem, but just the same I'd like to avoid the
problem in the future.

	The assert that got triggered is in the SetRcRights function.

	Here's the console excerpt (sorry gdb isn't installed on this machine
so i couldn't get you a backtrace):

Date: Wed 09/07/2005

03:30:02 Volume /httpd busy, waiting
10:39:02 fatal error -- Assertion failed: file
"/home/pwalsh/working/coda/BUILD/coda-6.0.11/coda-src/venus/fso1.cc",
line 827

10:39:04 RecovTerminate: dirty shutdown (1 uncommitted transactions)
Assertion failed: 0, file
"/home/pwalsh/working/coda/BUILD/coda-6.0.11/coda-src/venus/fso1.cc",
line 827
Sleeping forever.  You may use gdb to attach to process 936.


	This is the venus.log file excerpt:

[ T(01) : 432371 : 10:38:59 ] BeginRvmFlush (1, 524344, S)
[ T(01) : 432371 : 10:38:59 ] EndRvmFlush

[ H(07) : 10557 : 10:39:02 ] HDBDaemon just woke up
[ H(07) : 10557 : 10:39:02 ] Hoard Walk interrupted -- object different!
<606e1fc8.7f000004.1d6.b52>
[ H(07) : 10557 : 10:39:02 ] Number of interrupt failures = 1
[ H(07) : 10557 : 10:39:02 ] volent::Enter: observe with proc_key = 0
[ H(07) : 10557 : 10:39:02 ] WAITING(VOL): /scmca, state = Hoarding, [1,
0], counts = [1 0 1 0]
[ H(07) : 10557 : 10:39:02 ] CML= [0, -666], Res = 1
[ H(07) : 10557 : 10:39:02 ] WAITING(VOL): shrd_count = 1, excl_count =
0, excl_pgid = 0
0x63ee2088 : fid = (606e1fc8.7f000004.1.1), comp = scmca, vol = 606dc108
        state = Normal, stat = { 2048, 1297, 1126111141, 0, 0777, 7,
Directory }, rc rights = 0
        VV = {[ 1294 1279 0 0 0 0 0 0 ] [ 0xec0840e 2763 ] [ 0 ]}
        ac rights = { [0 11] [-1 0 00] [-1 0 00] [-1 0 00] [-1 0 00] [-1
0 00] [500 9 11] [504 7f
10] [0 9 10] }
        voltype = [0 1 0], fake = 0, fetching = 0 local = 0
        rep = 0, data = 0, owrite = 0, dirty = 0, shadow = 0 ckmtpt
        mvstat = Root, mtpoint = (606e1fc8.7f000000.2.2)
        parent = (0.0.0.0, 0), children = 17
        priority = 25000 (16646), hoard = [0, -2, 0], lastref =
192112409
        mle_bindings = (0, 0), cleanstat = [-1, -1]
        cachefile = [ 00/00/00/29, 7061548, 0/0 ]
        directory = 0
        refs = [0 1 1], openers = [0 0 0]       lastresolved = 0
*****  VenusPrint  *****

*** BEGIN RealmDB ***
6055c5c8 realm '.htaccess', refcount 0/0
606e1fc8 realm 'director', refcount 8/18
        dir81:2432
        dir82:2432
63f6cf88 realm 'localhost', refcount 1/2
*** END RealmDB ***
Servers: count = 2



	Any ideas on what may have caused this to happen?


-- 
Patrick Walsh
eSoft Incorporated
303.444.1600 x3350
http://www.esoft.com/

Received on 2005-09-07 15:55:03