RHEL4 System Crash: Unable to handle kernel paging request at virtual address

Jeff Burke jburke at redhat.com
Wed Jun 6 07:24:05 CDT 2007


Shane Presley wrote:
> Hello,
> 
> I had a RHEL4 system crash a day or two ago.  First RedHat system that
> I've ever seen completely hung, requiring me to hard power cycle it.
> Felt like my Windows days.  But then it happened yesterday as well.
> So something is wrong with this server.
> 
> It's fully patched (up2date), and is a Dell 2850.  I captured the
> /var/log/messages right before it panicked and here's the logs:
> 
> Jun  4 21:26:42 myhost kernel: Unable to handle kernel paging request
> at virtual address 0f3514db
> Jun  4 21:26:42 myhost kernel:  printing eip:
> Jun  4 21:26:42 myhost kernel: c01705b8
> Jun  4 21:26:42 myhost kernel: *pde = 33c68001
> Jun  4 21:26:42 myhost kernel: Oops: 0000 [#1]
> Jun  4 21:26:42 myhost kernel: SMP
> Jun  4 21:26:42 myhost kernel: Modules linked in: mptctl mptbase
> ipmi_devintf ipmi_si ipmi_msghandler dell_rbu autofs4 i2c_dev i2c_core
> sunrpc dm_mirror dm_mod button battery ac md5 ipv6 uhci_hcd ehci_hcd
> e1000 floppy ata_piix libata sg ext3 jbd megaraid_mbox megaraid_mm
> sd_mod scsi_mod
> Jun  4 21:26:42 myhost kernel: CPU:    0
> Jun  4 21:26:42 myhost kernel: EIP:    0060:[<c01705b8>]    Not tainted VLI
> Jun  4 21:26:42 myhost kernel: EFLAGS: 00010206   (2.6.9-55.ELsmp)
> Jun  4 21:26:42 myhost kernel: EIP is at __d_lookup+0x65/0x109
> Jun  4 21:26:42 myhost kernel: eax: c2155c30   ebx: cada98f6   ecx:
> 00000011   edx: c212e200
> Jun  4 21:26:42 myhost kernel: esi: 0f3514db   edi: cada98f6   ebp:
> f43aa50c   esp: f3789e0c
> Jun  4 21:26:42 myhost kernel: ds: 007b   es: 007b   ss: 0068
> Jun  4 21:26:42 myhost kernel: Process bbtest-net (pid: 2942,
> threadinfo=f3789000 task=f24723b0)
> Jun  4 21:26:42 myhost kernel: Stack: 00000000 c2155c30 e1cbe00e
> cada98f6 0000000c f3789e80 cada98f6 00000000
> Jun  4 21:26:42 myhost kernel:        cada98f6 f3789f50 c0166ba3
> f7f1be00 f3789e78 f3789e80 cada98f6 f543b548
> Jun  4 21:26:42 myhost kernel:        cada98f6 f3789f50 c0167475
> 00000000 00000000 00000000 fffcf000 c1c18aa0
> Jun  4 21:26:42 myhost kernel: Call Trace:
> Jun  4 21:26:42 myhost kernel:  [<c0166ba3>] do_lookup+0x23/0xb1
> Jun  4 21:26:42 myhost kernel:  [<c0167475>] __link_path_walk+0x844/0xc25
> Jun  4 21:26:42 myhost kernel:  [<c0167899>] link_path_walk+0x43/0xbe
> Jun  4 21:26:42 myhost kernel:  [<c02d443f>] __cond_resched+0x14/0x39
> Jun  4 21:26:42 myhost kernel:  [<c01c3e8a>] direct_strncpy_from_user+0x3e/0x5d
> Jun  4 21:26:42 myhost kernel:  [<c011b01b>] do_page_fault+0x1ae/0x5c6
> Jun  4 21:26:42 myhost kernel:  [<c0167c2e>] path_lookup+0x14b/0x17f
> Jun  4 21:26:42 myhost kernel:  [<c0168309>] open_namei+0x99/0x579
> Jun  4 21:26:42 myhost kernel:  [<c015a599>] filp_open+0x45/0x70
> Jun  4 21:26:42 myhost kernel:  [<c02d443f>] __cond_resched+0x14/0x39
> Jun  4 21:26:42 myhost kernel:  [<c01c3e8a>] direct_strncpy_from_user+0x3e/0x5d
> Jun  4 21:26:42 myhost kernel:  [<c015a8f5>] sys_open+0x31/0x7d
> Jun  4 21:26:42 myhost kernel:  [<c02d5ee3>] syscall_call+0x7/0xb
> Jun  4 21:26:42 myhost kernel: Code: 24 0c 89 c2 81 f2 01 00 37 9e d3
> ea 31 d0 8b 15 e8 a0 44 c0 23 05 e0 a0 44 c0 8d 04 82 89 44 24 04 8b
> 30 85 f6 0f 84 99 00 00 00 <8b> 06 0f 18 00 90 8d 5e 98 0f ae e8 8d 76
> 00 8b 44 24 0c 39 43
> Jun  4 21:26:42 myhost kernel:  <0>Fatal exception: panic in 5 seconds
> 
> So I'm not sure what to make of that.  I noticed one process name in
> there (bbtest-net) which is part of my BigBrother monitoring system.
> But that's been running OK for years, and hasn't been changed
> recently.  Not sure where else to look.  Could this be a hardware
> (memory?) problem?
> 
> Shane
> 
> _______________________________________________
> Linux-PowerEdge mailing list
> Linux-PowerEdge at dell.com
> http://lists.us.dell.com/mailman/listinfo/linux-poweredge
> Please read the FAQ at http://lists.us.dell.com/faq
> 
Shane,
	Red Hat Bugzilla has this issue already open. Please see
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=208378
Also if you could please update that BZ with as much information as
possible. For example: How many cpus, How much memory, System
configuration: type of file system, raid, sata, etc...

	I took the liberty of adding this information to the BZ but I did not
put your email on the cc list.

Thanks,
Jeff



More information about the Linux-PowerEdge mailing list