kernel oops on 2650 :-(

Brian K. Jones jonesy at CS.Princeton.EDU
Thu Oct 30 14:46:01 CST 2003


Hi,

I've been having A LOT of problems running RH 9 on 3 2650's. I'm
inclined to believe the problem is a redhat one, and not a dell one, but
these boxes are locking up pretty consistently any time the load reaches
the 5.00 range, which isn't good, since they're sitting next to 3 Sun
250s whose loads have been to 30 without disallowing a console login.
These machines are completely unreachable by any means, though ping and
nmap will work fine. 

I've been delaying reporting these issues, partially because there are
so many variables, and partially because I didn't have a good log entry
to reference that I could definitively tie to a lockup. Now I do. Here
it is, so if anyone knows what may have caused this, it would be great.

============SNIP===========
Oct 30 00:24:43 tux kernel: Unable to handle kernel NULL pointer
dereference at virtual address 0000007c
Oct 30 00:24:43 tux kernel:  printing eip:
Oct 30 00:24:43 tux kernel: c014ed15
Oct 30 00:24:43 tux kernel: *pde = 00000000
Oct 30 00:24:43 tux kernel: Oops: 0000
Oct 30 00:24:43 tux kernel: iptable_filter ip_tables autofs nfs lockd
sunrpc tg3 keybdev mousedev hid input usb-ohci usbcore ext3 jbd raid1
aic7xxx sd_mod scsi_mod
Oct 30 00:24:43 tux kernel: CPU:    2
Oct 30 00:24:43 tux kernel: EIP:    0060:[<c014ed15>]    Not tainted
Oct 30 00:24:43 tux kernel: EFLAGS: 00010246
Oct 30 00:24:43 tux kernel:
Oct 30 00:24:43 tux kernel: EIP is at page_referenced [kernel] 0xe5
(2.4.20-8smp)
Oct 30 00:24:43 tux kernel: eax: c1000030   ebx: c1000030   ecx:
00000000   edx: 00000000
Oct 30 00:24:43 tux kernel: esi: c1000030   edi: 00000000   ebp:
c0340880   esp: f7ffbf78
Oct 30 00:24:43 tux kernel: ds: 0068   es: 0068   ss: 0068
Oct 30 00:24:43 tux kernel: Process kscand/HighMem (pid: 14,
stackpage=f7ffb000)
Oct 30 00:24:43 tux kernel: Stack: c1c484f8 f7ffa000 00000000 00000000
00000000 00000000 f7ffbfb4 c1d118f8
Oct 30 00:24:43 tux kernel:        c1d118f8 c1dfc8bc c03419d0 c0340880
c0146a82 f7ffa000 c012d930 00000001
Oct 30 00:24:43 tux kernel:        00000003 f7ffa000 c0340880 f7ffa000
c0147b94 c0340880 00000003 00000000
Oct 30 00:24:43 tux kernel: Call Trace:   [<c0146a82>] scan_active_list
[kernel] 0xb2 (0xf7ffbfa8))
Oct 30 00:24:43 tux kernel: [<c012d930>] process_timeout [kernel] 0x0
(0xf7ffbfb0))
Oct 30 00:24:43 tux kernel: [<c0147b94>] kscand [kernel] 0x104
(0xf7ffbfc8))
Oct 30 00:24:43 tux kernel: [<c0147a90>] kscand [kernel] 0x0
(0xf7ffbfe0))
Oct 30 00:24:43 tux kernel: [<c010759d>] kernel_thread_helper [kernel]
0x5 (0xf7ffbff0))
Oct 30 00:24:43 tux kernel:
Oct 30 00:24:43 tux kernel:
Oct 30 00:24:43 tux kernel: Code: 8b 41 7c 39 41 68 b8 01 00 00 00 0f 43
44 24 10 89 44 24 10
========================

Thanks for any input. 
brian.




More information about the Linux-PowerEdge mailing list