*BSD News Article 46467


Return to BSD News archive

#! rnews 2719 sserve.cc.adfa.oz.au
Path: sserve!newshost.anu.edu.au!harbinger.cc.monash.edu.au!simtel!news.kei.com!bloom-beacon.mit.edu!crl.dec.com!decwrl!svc.portal.com!news1.best.com!blob.best.net!not-for-mail
From: dillon@best.com (Matt Dillon)
Newsgroups: comp.unix.bsd.freebsd.misc
Subject: two crash problems, anyone have any ideas? (similar experiences?)
Date: 5 Jul 1995 20:06:33 -0700
Organization: Best Internet Communications, Inc. (info@best.com)
Lines: 65
Distribution: world
Message-ID: <3tfjvp$hbe@blob.best.net>
NNTP-Posting-Host: blob.best.net

Configuration:
    
    128M memory, 130+ users, three SCSI disks (barracudas), load averages
    around 10, NCR PCI SCSI controller, Etherlink III (ISA) ethernet.
    pentium-90.

    FreeBSD 2.0.5-RELEASE-BEST (SHELL) #4: Thu Jun 29 01:57:18 PDT 1995
    (with last set of patches patched in).

Problem #1:

    Heavily loaded machine is running along.  Then, for no good reason,
    anything requiring disk I/O comes to a screaming halt... if I happen
    to have a vmstat running, it continues to go, but attempting to do
    anything (such as ^C or run a program from an existing shell prompt)
    blocks forever.

    The vmstat shows a large number of processes blocking, virtually none
    running, and disk I/O going to zero.

    It is possible that some paging still works... I am able to send a packet
    to my little rebooter daemon (which was swapped out and managed to swap
    itself in) and it responds that it is calling reboot(), and at that 
    point the kernel is able to sync its disks, but is UNABLE to dump ... 
    just freezes solid.

    So, unfortunately, it is impossible to get a crash dump out of the 
    situation.  Worse, the machine doesn't panic, and it is impossible
    to reboot it without hitting the hard reset.

    The machine lasts anywhere from 30 minutes to a day and a half before
    crashing and burning.

Problem #2:

    biodone() page busy < 0 ... haven't been able to track this one yet
    partially because there isn't enough space on / to be able to install
    a debug /kernel.  Sigh.  But at least we get a crash dump out of it.


    IdlePTD 1ac000
    current pcb at 192930
    panic: biodone: page busy < 0

    #0  0xf016ab70 in boot ()
    (kgdb) back
    #0  0xf016ab70 in boot ()
    #1  0xf01105c3 in panic ()
    #2  0xf0122e64 in biodone ()
    #3  0xf218593b in end ()
    #4  0xf21846bf in end ()
    #5  0xf219ba96 in end ()
    #6  0xf012ae50 in vn_read ()
    #7  0xf01117c7 in read ()
    #8  0xf016f8c7 in syscall ()
    #9  0xf016846b in Xsyscall ()
    #10 0x805b587 in ?? ()
    #11 0x8059f43 in ?? ()
    #12 0x9037 in ?? ()
    #13 0x758a in ?? ()
    #14 0x3cfe in ?? ()
    #15 0x10e8 in ?? ()
    (kgdb)