RHEL4 System Crash: Unable to handle kernel paging request at virtualaddress

Anthony_Leatherwood at Dell.com Anthony_Leatherwood at Dell.com
Wed Jun 6 10:26:43 CDT 2007


 
Shane,

You could try this to see if you see a problem with mem consumption in
lower zone:

# echo 100 > /proc/sys/vm/lower_zone_protection

This will 100MB protection to the lower zone of memory.For more
information you can look at
"/usr/share/doc/kernel-doc-<VERSION>/Documentation/filesystems/proc.txt"
.
You have to install "kernel-doc" package for this.

If the above setting helps then you need to add following line in the
"/etc/sysctl.conf" file.
vm.lower_zone_protection=100




Cheers!
Anthony Leatherwood


-----Original Message-----
From: linux-poweredge-bounces at dell.com
[mailto:linux-poweredge-bounces at dell.com] On Behalf Of Shane Presley
Sent: Wednesday, June 06, 2007 6:30 AM
To: redhat-list at redhat.com; linux-poweredge-Lists
Subject: RHEL4 System Crash: Unable to handle kernel paging request at
virtualaddress

Hello,

I had a RHEL4 system crash a day or two ago.  First RedHat system that
I've ever seen completely hung, requiring me to hard power cycle it.
Felt like my Windows days.  But then it happened yesterday as well.
So something is wrong with this server.

It's fully patched (up2date), and is a Dell 2850.  I captured the
/var/log/messages right before it panicked and here's the logs:

Jun  4 21:26:42 myhost kernel: Unable to handle kernel paging request at
virtual address 0f3514db Jun  4 21:26:42 myhost kernel:  printing eip:
Jun  4 21:26:42 myhost kernel: c01705b8
Jun  4 21:26:42 myhost kernel: *pde = 33c68001 Jun  4 21:26:42 myhost
kernel: Oops: 0000 [#1] Jun  4 21:26:42 myhost kernel: SMP Jun  4
21:26:42 myhost kernel: Modules linked in: mptctl mptbase ipmi_devintf
ipmi_si ipmi_msghandler dell_rbu autofs4 i2c_dev i2c_core sunrpc
dm_mirror dm_mod button battery ac md5 ipv6 uhci_hcd ehci_hcd e1000
floppy ata_piix libata sg ext3 jbd megaraid_mbox megaraid_mm sd_mod
scsi_mod
Jun  4 21:26:42 myhost kernel: CPU:    0
Jun  4 21:26:42 myhost kernel: EIP:    0060:[<c01705b8>]    Not tainted
VLI
Jun  4 21:26:42 myhost kernel: EFLAGS: 00010206   (2.6.9-55.ELsmp)
Jun  4 21:26:42 myhost kernel: EIP is at __d_lookup+0x65/0x109
Jun  4 21:26:42 myhost kernel: eax: c2155c30   ebx: cada98f6   ecx:
00000011   edx: c212e200
Jun  4 21:26:42 myhost kernel: esi: 0f3514db   edi: cada98f6   ebp:
f43aa50c   esp: f3789e0c
Jun  4 21:26:42 myhost kernel: ds: 007b   es: 007b   ss: 0068
Jun  4 21:26:42 myhost kernel: Process bbtest-net (pid: 2942,
threadinfo=f3789000 task=f24723b0) Jun  4 21:26:42 myhost kernel: Stack:
00000000 c2155c30 e1cbe00e
cada98f6 0000000c f3789e80 cada98f6 00000000
Jun  4 21:26:42 myhost kernel:        cada98f6 f3789f50 c0166ba3
f7f1be00 f3789e78 f3789e80 cada98f6 f543b548
Jun  4 21:26:42 myhost kernel:        cada98f6 f3789f50 c0167475
00000000 00000000 00000000 fffcf000 c1c18aa0 Jun  4 21:26:42 myhost
kernel: Call Trace:
Jun  4 21:26:42 myhost kernel:  [<c0166ba3>] do_lookup+0x23/0xb1 Jun  4
21:26:42 myhost kernel:  [<c0167475>] __link_path_walk+0x844/0xc25 Jun
4 21:26:42 myhost kernel:  [<c0167899>] link_path_walk+0x43/0xbe Jun  4
21:26:42 myhost kernel:  [<c02d443f>] __cond_resched+0x14/0x39 Jun  4
21:26:42 myhost kernel:  [<c01c3e8a>] direct_strncpy_from_user+0x3e/0x5d
Jun  4 21:26:42 myhost kernel:  [<c011b01b>] do_page_fault+0x1ae/0x5c6
Jun  4 21:26:42 myhost kernel:  [<c0167c2e>] path_lookup+0x14b/0x17f Jun
4 21:26:42 myhost kernel:  [<c0168309>] open_namei+0x99/0x579 Jun  4
21:26:42 myhost kernel:  [<c015a599>] filp_open+0x45/0x70 Jun  4
21:26:42 myhost kernel:  [<c02d443f>] __cond_resched+0x14/0x39 Jun  4
21:26:42 myhost kernel:  [<c01c3e8a>] direct_strncpy_from_user+0x3e/0x5d
Jun  4 21:26:42 myhost kernel:  [<c015a8f5>] sys_open+0x31/0x7d Jun  4
21:26:42 myhost kernel:  [<c02d5ee3>] syscall_call+0x7/0xb Jun  4
21:26:42 myhost kernel: Code: 24 0c 89 c2 81 f2 01 00 37 9e d3 ea 31 d0
8b 15 e8 a0 44 c0 23 05 e0 a0 44 c0 8d 04 82 89 44 24 04 8b 30 85 f6 0f
84 99 00 00 00 <8b> 06 0f 18 00 90 8d 5e 98 0f ae e8 8d 76 00 8b 44 24
0c 39 43 Jun  4 21:26:42 myhost kernel:  <0>Fatal exception: panic in 5
seconds

So I'm not sure what to make of that.  I noticed one process name in
there (bbtest-net) which is part of my BigBrother monitoring system.
But that's been running OK for years, and hasn't been changed recently.
Not sure where else to look.  Could this be a hardware
(memory?) problem?

Shane

_______________________________________________
Linux-PowerEdge mailing list
Linux-PowerEdge at dell.com
http://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq



More information about the Linux-PowerEdge mailing list