2650 - tg3 on 2.4.18-19.7.xsmp rh7.3 ... OOPS

Jacek Radajewski jacek at usq.edu.au
Thu Jan 16 00:34:01 CST 2003


Hi,

Just had an oops (again) on my PE2600 box running RHL 7.3 + kernel 2.4.19.7.  There was a lot of activity at the time between an oracle 8i client running on the box that crashed and an oracle server.  I pumped the opps through ksysoops but got some errors and warnings.  I don't know much about the kernel code, but it seems that tg3 might not be the problem in this case.  Any help will be appreciated.


details:
red hat linux 7.3 + most patches
2 GB RAM
3 x 36gb disks in raid 5 + 1 36gb in volume
2 x 2.4gb p iv

running samba, apache, weblogic 7.0, oracle 8i client


------------------ cut -------------------------------------------

ksymoops 2.4.4 on i686 2.4.18-19.7.xsmp.  Options used
     -V (default)
     -k /proc/ksyms (default)
     -l /proc/modules (default)
     -o /lib/modules/2.4.18-19.7.xsmp/ (default)
     -m /boot/System.map-2.4.18-19.7.xsmp (default)

Warning: You did not tell me where to find symbol information.  I will
assume that the log matches the kernel and modules that are running
right now and I'll use the default options above for symbol resolution.
If the current kernel and/or modules do not match the log, you can get
more accurate output by telling me the kernel version and where to find
map, modules, ksyms etc.  ksymoops -h explains the options.

Error (expand_objects): cannot stat(/lib/ext3.o) for ext3
Error (expand_objects): cannot stat(/lib/jbd.o) for jbd
Error (expand_objects): cannot stat(/lib/aacraid.o) for aacraid
Error (expand_objects): cannot stat(/lib/sd_mod.o) for sd_mod
Error (expand_objects): cannot stat(/lib/scsi_mod.o) for scsi_mod
Warning (map_ksym_to_module): cannot match loaded module ext3 to a unique module object.  Trace may not be reliable.
Warning (map_ksym_to_module): cannot match loaded module aacraid to a unique module object.  Trace may not be reliable.
Unable to handle kernel NULL pointer dereference at virtual address 00000007
f894451d
*pde = 00000000
Oops: 0002
CPU:    1
EIP:    0010:[<f894451d>]    Not tainted
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010246
eax: dcfb0a00   ebx: 00000000   ecx: ffffffff   edx: c03fdc04
esi: c03fdc04   edi: 00000000   ebp: ca79b89c   esp: c36b7ed0
ds: 0018   es: 0018   ss: 0018
Process swapper (pid: 0, stackpage=c36b7000)
Stack: c01ac741 c03fdc04 ca79b89c 00000000 00000000 c03fdbc0 00000000 c03fdc04 
       c03fdbc0 ca79b89c c03fdc04 0000000e c01acacc c03fdc04 ca79b89c c3684e80 
       c03fdc04 00000286 c03fdbc0 c01acf99 c3684e80 0000000e f89442f0 c36b0d60 
Call Trace: [<c01ac741>] start_request [kernel] 0x1a1 (0xc36b7ed0))
[<c01acacc>] ide_do_request [kernel] 0x29c (0xc36b7f00))
[<c01acf99>] ide_intr [kernel] 0x129 (0xc36b7f1c))
[<f89442f0>] cdrom_pc_intr [ide-cd] 0x0 (0xc36b7f28))
[<c010a61e>] handle_IRQ_event [kernel] 0x5e (0xc36b7f3c))
[<c010a852>] do_IRQ [kernel] 0xc2 (0xc36b7f5c))
[<c0106e60>] default_idle [kernel] 0x0 (0xc36b7f74))
[<c010d058>] call_do_IRQ [kernel] 0x5 (0xc36b7f80))
[<c0106e60>] default_idle [kernel] 0x0 (0xc36b7f90))
[<c0106e8c>] default_idle [kernel] 0x2c (0xc36b7fac))
[<c0106ef4>] cpu_idle [kernel] 0x24 (0xc36b7fb8))
[<c011db4b>] call_console_drivers [kernel] 0xeb (0xc36b7fd0))
[<c011dcf9>] printk [kernel] 0x129 (0xc36b7ffc))
Code: c7 41 08 00 00 00 00 68 b0 44 94 f8 8b 41 04 50 52 e8 8d f2 

>>EIP; f894451d <[autofs].data.end+d79a/e27d>   <=====
Trace; c01ac741 <start_request+1a1/210>
Trace; c01acacc <ide_do_request+29c/2f0>
Trace; c01acf99 <ide_intr+129/160>
Trace; f89442f0 <[autofs].data.end+d56d/e27d>
Trace; c010a61e <handle_IRQ_event+5e/90>
Trace; c010a852 <do_IRQ+c2/110>
Trace; c0106e60 <default_idle+0/40>
Trace; c010d058 <call_do_IRQ+5/d>
Trace; c0106e60 <default_idle+0/40>
Trace; c0106e8c <default_idle+2c/40>
Trace; c0106ef4 <cpu_idle+24/30>
Trace; c011db4b <call_console_drivers+eb/100>
Trace; c011dcf9 <printk+129/140>
Code;  f894451d <[autofs].data.end+d79a/e27d>
00000000 <_EIP>:
Code;  f894451d <[autofs].data.end+d79a/e27d>   <=====
   0:   c7 41 08 00 00 00 00      movl   $0x0,0x8(%ecx)   <=====
Code;  f8944524 <[autofs].data.end+d7a1/e27d>
   7:   68 b0 44 94 f8            push   $0xf89444b0
Code;  f8944529 <[autofs].data.end+d7a6/e27d>
   c:   8b 41 04                  mov    0x4(%ecx),%eax
Code;  f894452c <[autofs].data.end+d7a9/e27d>
   f:   50                        push   %eax
Code;  f894452d <[autofs].data.end+d7aa/e27d>
  10:   52                        push   %edx
Code;  f894452e <[autofs].data.end+d7ab/e27d>
  11:   e8 8d f2 00 00            call   f2a3 <_EIP+0xf2a3> f89537c0 <.data.end+3021/????>

 <0>Kernel panic: Aiee, killing interrupt handler!

3 warnings and 5 errors issued.  Results may not be reliable.


-------------------- original oops message --------------------
Unable to handle kernel NULL pointer dereference at virtual address 00000007
 printing eip:
f894451d
*pde = 00000000
Oops: 0002
nls_iso8859-1 ide-cd cdrom soundcore autofs tg3 usb-ohci usbcore ext3 jbd aacraid sd_mod scsi_mod  
CPU:    1
EIP:    0010:[<f894451d>]    Not tainted
EFLAGS: 00010246

EIP is at cdrom_do_packet_command [ide-cd] 0x2d (2.4.18-19.7.xsmp)
eax: dcfb0a00   ebx: 00000000   ecx: ffffffff   edx: c03fdc04
esi: c03fdc04   edi: 00000000   ebp: ca79b89c   esp: c36b7ed0
ds: 0018   es: 0018   ss: 0018
Process swapper (pid: 0, stackpage=c36b7000)
Stack: c01ac741 c03fdc04 ca79b89c 00000000 00000000 c03fdbc0 00000000 c03fdc04 
       c03fdbc0 ca79b89c c03fdc04 0000000e c01acacc c03fdc04 ca79b89c c3684e80 
       c03fdc04 00000286 c03fdbc0 c01acf99 c3684e80 0000000e f89442f0 c36b0d60 
Call Trace: [<c01ac741>] start_request [kernel] 0x1a1 (0xc36b7ed0))
[<c01acacc>] ide_do_request [kernel] 0x29c (0xc36b7f00))
[<c01acf99>] ide_intr [kernel] 0x129 (0xc36b7f1c))
[<f89442f0>] cdrom_pc_intr [ide-cd] 0x0 (0xc36b7f28))
[<c010a61e>] handle_IRQ_event [kernel] 0x5e (0xc36b7f3c))
[<c010a852>] do_IRQ [kernel] 0xc2 (0xc36b7f5c))
[<c0106e60>] default_idle [kernel] 0x0 (0xc36b7f74))
[<c010d058>] call_do_IRQ [kernel] 0x5 (0xc36b7f80))
[<c0106e60>] default_idle [kernel] 0x0 (0xc36b7f90))
[<c0106e8c>] default_idle [kernel] 0x2c (0xc36b7fac))
[<c0106ef4>] cpu_idle [kernel] 0x24 (0xc36b7fb8))
[<c011db4b>] call_console_drivers [kernel] 0xeb (0xc36b7fd0))
[<c011dcf9>] printk [kernel] 0x129 (0xc36b7ffc))


Code: c7 41 08 00 00 00 00 68 b0 44 94 f8 8b 41 04 50 52 e8 8d f2 
 <0>Kernel panic: Aiee, killing interrupt handler!
In interrupt handler - not syncing




More information about the Linux-PowerEdge mailing list