Weird Filesystem Problem on PE2650/RH7.2

Joe Stevens jgsteven at yahoo.com
Mon Mar 3 20:30:01 CST 2003


I had a strange problem occur a few days ago on one of
our PE2650s that acts as a DB server.  The data in the
partition that holds the database (/data, on reiserfs)
suddently became in accessible.  Doing a "ls" of the
directory contents didn't return an error, but no
files were visible (when many should be).

I rebooted the server, ran reiserfsck on the
partition, and remounted it, and everything returned
to normal. Does anybody have any ideas what this might
be?

The server is running RedHat 7.2 with the
2.4.18-18.7.xsmp kernel.  The drive in question was
part of a raid 5 set on the PE's embeded raid
controller.  Only the /data partition was affected --
other partitions on the same physical disks were
accessable (although the other partitions were ext3). 
When the machine failed its uptime was around 150
days.

Below a snippet of /var/log/messages that records the
problem is included for refrence.  The phrase "kernel
bug" appears in line 13, although it refrences
prints.c, so I don't really know what to think...

--------------------
Feb 24 05:17:35 brickdb logger: All Daily Reporting
Complete.
Feb 24 06:03:49 brickdb kernel: aacraid: Host adapter
reset request. SCSI hang ?
Feb 24 06:03:49 brickdb kernel: scsi: device set
offline - command error recover failed: host 0 channel
0 id 1 lun 0
Feb 24 06:03:49 brickdb kernel: SCSI disk error : host
0 channel 0 id 1 lun 0 return code = 6000000
Feb 24 06:03:49 brickdb kernel:  I/O error: dev 08:11,
sector 15876992
Feb 24 06:03:49 brickdb kernel:  I/O error: dev 08:11,
sector 15876992
Feb 24 06:03:49 brickdb logger: Database has been
cleaned and vacuumed.
Feb 24 06:03:49 brickdb kernel:  I/O error: dev 08:11,
sector 103192
Feb 24 06:03:54 brickdb kernel: vs-13050:
reiserfs_update_sd: i/o failure occurred trying to
update [6820 6855 0x0 SD] s
tat data I/O error: dev 08:11, sector 35216
Feb 24 06:03:54 brickdb kernel: journal-601, buffer
write failed
Feb 24 06:03:56 brickdb kernel: ------------[ cut here
]------------
Feb 24 06:03:56 brickdb kernel: kernel BUG at
prints.c:334!
Feb 24 06:03:56 brickdb kernel: invalid operand: 0000
Feb 24 06:03:56 brickdb kernel: ppp_async ppp_generic
slhc racser ide-cd cdrom autofs nfs lockd sunrpc
bcm5700 reiserfs
aacraid sd_mod scsi_mod
Feb 24 06:03:56 brickdb kernel: CPU:    0
Feb 24 06:03:56 brickdb kernel: EIP:   
0010:[bcm5700:__insmod_bcm5700_O/lib/modules/2.4.18-18.7.xsmp/kernel/driv+-83871
1/96]    Not tainted
Feb 24 06:03:56 brickdb kernel: EIP:   
0010:[<f888b3c9>]    Not tainted
Feb 24 06:03:56 brickdb kernel: EFLAGS: 00010286
Feb 24 06:03:56 brickdb kernel:
Feb 24 06:03:56 brickdb kernel: EIP is at
reiserfs_panic [reiserfs] 0x29 (2.4.18-18.7.xsmp)
Feb 24 06:03:56 brickdb kernel: eax: 00000024   ebx:
f889f940   ecx: 00000000   edx: f75a2000
Feb 24 06:03:56 brickdb kernel: esi: c1f02c00   edi:
00001120   ebp: c1f02c00   esp: c1e1fe90
Feb 24 06:03:56 brickdb kernel: ds: 0018   es: 0018  
ss: 0018
Feb 24 06:03:56 brickdb kernel: Process kupdated (pid:
13, stackpage=c1e1f000)
Feb 24 06:03:56 brickdb kernel: Stack: f88a22a4
f88a6b00 f889f940 c1e1feb4 f88aa720 00000000 f8895ba7
c1f02c00
Feb 24 06:03:56 brickdb kernel:        f889f940
c1e1e000 0000000a 00000000 00000011 c1f4c6c0 f88d2c70
c01457f8
Feb 24 06:03:56 brickdb kernel:        00000811
0000113a f88aa000 00000010 00000010 00000000 f88997e5
c1f02c00
Feb 24 06:03:56 brickdb kernel: Call Trace:
[bcm5700:__insmod_bcm5700_O/lib/modules/2.4.18-18.7.xsmp/kernel/driv+-744796
/96] .rodata.str1.1 [reiserfs] 0x444 (0xc1e1fe90))
Feb 24 06:03:56 brickdb kernel: Call Trace:
[<f88a22a4>] .rodata.str1.1 [reiserfs] 0x444
(0xc1e1fe90))
Feb 24 06:03:56 brickdb kernel:
[bcm5700:__insmod_bcm5700_O/lib/modules/2.4.18-18.7.xsmp/kernel/driv+-726272/96]
error_b
uf [reiserfs] 0x0 (0xc1e1fe94))
Feb 24 06:03:56 brickdb kernel: [<f88a6b00>] error_buf
[reiserfs] 0x0 (0xc1e1fe94))
Feb 24 06:03:56 brickdb kernel:
[bcm5700:__insmod_bcm5700_O/lib/modules/2.4.18-18.7.xsmp/kernel/driv+-755392/96]
.rodata
.str1.32 [reiserfs] 0x37c0 (0xc1e1fe98))
Feb 24 06:03:56 brickdb kernel: [<f889f940>]
.rodata.str1.32 [reiserfs] 0x37c0 (0xc1e1fe98))
Feb 24 06:03:56 brickdb kernel:
[bcm5700:__insmod_bcm5700_O/lib/modules/2.4.18-18.7.xsmp/kernel/driv+-795737/96]
flush_c
ommit_list [reiserfs] 0x2e7 (0xc1e1fea8))
Feb 24 06:03:56 brickdb kernel: [<f8895ba7>]
flush_commit_list [reiserfs] 0x2e7 (0xc1e1fea8))
Feb 24 06:03:56 brickdb kernel:
[bcm5700:__insmod_bcm5700_O/lib/modules/2.4.18-18.7.xsmp/kernel/driv+-755392/96]
.rodata
.str1.32 [reiserfs] 0x37c0 (0xc1e1feb0))
Feb 24 06:03:56 brickdb kernel: [<f889f940>]
.rodata.str1.32 [reiserfs] 0x37c0 (0xc1e1feb0))
Feb 24 06:03:56 brickdb kernel: [getblk+24/64] getblk
[kernel] 0x18 (0xc1e1fecc))
Feb 24 06:03:56 brickdb kernel: [<c01457f8>] getblk
[kernel] 0x18 (0xc1e1fecc))

---------------

__________________________________________________
Do you Yahoo!?
Yahoo! Tax Center - forms, calculators, tips, more
http://taxes.yahoo.com/




More information about the Linux-PowerEdge mailing list