tonster
|
|
M-Net Downtime Status, August 23, 2002
|
Aug 23 20:53 UTC 2002 |
Okay, new item.
M-Net crashed sometime around 2pm today. From what I can tell, we've
either got a corrupted filesystem, or the vinum filesystem is
corrupt. Either way, it's going to take some time to repair. I'm
unsure how it crashed. I think it just rebooted itself, as I was on
at the time and there were no shutdown messages.
Here is what I saw when I had it booting into a serial console:
fdc0: FIFO enabled, 8 bytes threshold
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0
kbd0 at atkbd0
psm0: <PS/2 Mouse> irq 12 on atkbdc0
psm0: model Generic PS/2 mouse, device ID 0
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on
isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x100>
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A, console
sio1 at port 0x2f8-0x2ff irq 3 on isa0
sio1: type 16550A
IP packet filtering initialized, divert disabled, rule-based
forwarding enabled, default to deny, logging disabled
ad0: DMA limited to UDMA33, non-ATA66 cable or device
ad0: 8297MB <Maxtor 90871U2> [16858/16/63] at ata0-master UDMA33
acd0: CDROM <ATAPI CDROM> at ata1-master PIO4
Waiting 15 seconds for SCSI devices to settle
sa0 at ahc0 bus 0 target 6 lun 0
sa0: <ARCHIVE Python 28388-XXX 5.45> Removable Sequential Access SCSI-
2 device
sa0: 7.812MB/s transfers (7.812MHz, offset 15)
da0 at ahc0 bus 0 target 0 lun 0
da0: <SEAGATE ST39216W 0010> Fixed Direct Access SCSI-3 device
da0: 40.000MB/s transfers (20.000MHz, offset 8, 16bit), Tagged
Queueing Enabled
da0: 8761MB (17942584 512 byte sectors: 255H 63S/T 1116C)
da1 at ahc0 bus 0 target 1 lun 0
da1: <WDIGTL WDE9100 1.50> Fixed Direct Access SCSI-2 device
da1: 40.000MB/s transfers (20.000MHz, offset 8, 16bit)
da1: 8683MB (17783204 512 byte sectors: 255H 63S/T 1106C)
vinum: loaded
Mounting root from ufs:/dev/ad0s1a
vinum: reading configuration from /dev/da1s1b
vinum: updating configuration from /dev/da0s1b
vinum: /dev is mounted read-only, not rebuilding /dev/vinum
Warning: defective objects
P var.p1 C State: faulty Subdisks: 1 Size:
969 MB
P home.p1 C State: faulty Subdisks: 1 Size:
3767 MB
P usr.p1 C State: faulty Subdisks: 1 Size:
545 MB
P usrlocal.p1 C State: faulty Subdisks: 1 Size:
969 MB
P usrbbs.p1 C State: faulty Subdisks: 1 Size:
827 MB
S var.p1.s0 State: stale PO: 0 B Size:
969 MB
S home.p1.s0 State: stale PO: 0 B Size:
3767 MB
S usr.p1.s0 State: stale PO: 0 B Size:
545 MB
S usrlocal.p1.s0 State: stale PO: 0 B Size:
969 MB
S usrbbs.p1.s0 State: stale PO: 0 B Size:
827 MB
swapon: adding /dev/ad0s1b as swap device
Automatic boot in progress...
/dev/ad0s1a: FILESYSTEM CLEAN; SKIPPING CHECKS
/dev/ad0s1a: clean, 444301 free (485 frags, 55477 blocks, 0.1%
fragmentation)
/dev/vinum/usr: FILESYSTEM CLEAN; SKIPPING CHECKS
/dev/vinum/usr: clean, 331415 free (8815 frags, 40325 blocks, 1.6%
fragmentation)
/dev/vinum/usrlocal: FILESYSTEM CLEAN; SKIPPING CHECKS
/dev/vinum/usrlocal: clean, 134816 free (1040 frags, 16722 blocks,
0.1% fragmentation)
/dev/vinum/binsuid: FILESYSTEM CLEAN; SKIPPING CHECKS
/dev/vinum/binsuid: clean, 84764 free (28 frags, 10592 blocks, 0.0%
fragmentation)
/dev/vinum/var: FILESYSTEM CLEAN; SKIPPING CHECKS
/dev/vinum/var: clean, 532829 free (861 frags, 66496 blocks, 0.1%
fragmentation)
/dev/vinum/varmail: FILESYSTEM CLEAN; SKIPPING CHECKS
/dev/vinum/varmail: clean, 519218 free (12226 frags, 63374 blocks,
0.8% fragmentation)
/dev/vinum/usrbbs: FILESYSTEM CLEAN; SKIPPING CHECKS
/dev/vinum/usrbbs: clean, 394402 free (10770 frags, 47954 blocks, 1.3%
fragmentation)
/dev/vinum/roothome: FILESYSTEM CLEAN; SKIPPING
CHECKS/etc/rc.shutdown: /usr/bin/logger: not found
Shutting down daemon processes:.
Saving firewall state tables:.
When it gets to the firewall line above, it just hangs.
Needless to say, I'm going to run to WWNet later tonight and retrieve
the box so I can work on it.
Visit http://down.arbornet.org for more current information. I'm not
sure how often I'll remember to actually logon to grex.
|
tonster
|
|
response 22 of 108:
|
Aug 25 05:49 UTC 2002 |
posted August 25, 2002 01:44
-----------------------------------------------------------------------
---------
Okay, good news and bad news.
The good news is that I've finally successfully repaired all of m-
net's partitions, including /bin/suid. I finally found out how to make
it work by reading more closely the man page for vinum, though it's
not clear exactly why it required what it did. Anyway, all the
partitions come up cleanly after they're fsck'd.
The problems arise when it attempts to start sendmail. At that point,
it kernel panic's with this:
Fatal trap 12: page fault while in kernel mode
fault virtual address = 0x0
fault code = supervisor read, page not present
instruction pointer = 0x8:0xc0196c34
stack pointer = 0x10:0xc8bc8d0c
frame pointer = 0x10:0xc8bc8d14
code segment = base 0x0, limit 0xfffff, type 0x1b
= DPL 0, pres 1, def32 1, gran 1
processor eflags = interrupt enabled, resume, IOPL = 0
current process = 137 (sendmail)
interrupt mask = net
trap number = 12
panic: page fault
Possibly, sendmail is just corrupt. I'm going to try disabling
sendmail and starting the box up and see if I can get a login: prompt
then.
------------------
Tony
|