OpenManage 6.4 + Ubuntu 10.04 + SLES11 SP1 (lustre patched)

Ramiro Alba raq at cttc.upc.edu
Wed Feb 9 12:22:54 CST 2011


Hello everyone:

I have some Dell PowerEdge 2970 servers with Ubuntu 10.04 and using
SLES11 SP1 kernel (2.6.32), but pathed for Lustre filesystem. My problem
is that with all this mixture, when installing OpenManage 6.4 Ubuntu
packages kernel freezes the server (see kernel messages bellow). If I
don't patch the kernel for Lustre, everything is running fine, but I
can't do without it.
The problem deriving this issue is that I can see no way of knowing if
the internal RAID is ok.

LSI Logic / Symbios Logic MegaRAID SAS 1078 [1000:0060] (rev 04)

I have the same problem with every OpenManage version I tried.

Is anyone having the same problem or else having an idea of how manage
with this issue?

Any idea/suggestion will be welcomed.

Thanks in advance

Regards


######### MESSAGES #####################################

/etc/init.d/dataeng


---------------- Starting service --------------

/etc/init.d/dataeng start
Starting Systems Management Device Drivers:
Starting dell_rbu: * 
Starting ipmi driver:  * Already started
Starting Systems Management Data Engine:
Starting dsm_sa_datamgrd: 
Message from syslogd at jff222 at Feb  9 13:40:34 ...
 kernel:[1396660.306307] Oops: 0000 [#1] SMP 

Message from syslogd at jff222 at Feb  9 13:40:34 ...
 kernel:[1396660.306397] last sysfs file:
/sys/devices/pci0000:00/0000:00:09.0/0000:01:00.0/0000:02:03.0/0000:07:00.0/host0/target0:2:0/0:2:0:0/queue_type

Message from syslogd at jff222 at Feb  9 13:40:34 ...
 kernel:[1396660.308006] Stack:

Message from syslogd at jff222 at Feb  9 13:40:34 ...
 kernel:[1396660.308006] Call Trace:

Message from syslogd at jff222 at Feb  9 13:40:34 ...
 kernel:[1396660.308006] Code: 83 c4 28 c3 90 41 57 41 56 41 55 41 54 49
89 fc
55 53 48 83 ec 08 48 8b 87 b0 00 00 00 48 8b 97 c8 00 00 00 48 8b 80 a0
02 00 00
<48> 8b a8 38 02 00 00 48 85 ed 0f 84 7a 01 00 00 44 8b 6a 68 41 
 * 

Message from syslogd at jff222 at Feb  9 13:40:34 ...
 kernel:[1396660.308006] CR2: 0000000000000238
Starting dsm_sa_eventmgrd: 


---------- Kernel errors -------------  

BUG: unable to handle kernel NULL pointer dereference at
0000000000000238
IP: [<ffffffffa0154ed6>] sd_iostats_start_req+0x26/0x1e0 [sd_mod]
PGD 1d38cd067 PUD 111c95067 PMD 0 
Oops: 0000 [#1] SMP 
last sysfs file:
/sys/devices/pci0000:00/0000:00:09.0/0000:01:00.0/0000:02:03.0/0000:07:00.0/host0/target0:2:0/0:2:0:0/queue_type
CPU 0 
Modules linked in: dell_rbu(N) ipmi_si(N) ipmi_devintf(N)
ib_uverbs(N) ib_ipoib(N) ib_cm(N) ib_sa(N) ipv6(N) ib_umad(N)
dm_mod(N) bnx2(N) ib_mthca(N) ib_mad(N) lp(N) amd64_edac_mod(N)
rtc_cmos(N)
edac_core(N) i2c_piix4(N) ib_core(N) rtc_core(N) shpchp(N) processor(N)
parport(N) i2c_core(N) pci_hotplug(N) rtc_lib(N) edac_mce_amd(N)
serio_raw(N)
dcdbas(N) joydev(N) button(N) loop(N) ext3(N) jbd(N) mbcache(N) sg(N)
sr_mod(N)
ses(N) sd_mod(N) enclosure(N) crc_t10dif(N) cdrom(N) usbhid(N) hid(N)
thermal(N)
sata_svw(N) thermal_sys(N) floppy(N) libata(N) megaraid_sas(N) hwmon(N)
ohci_hcd(N) ehci_hcd(N) usbcore(N) scsi_mod(N) [last unloaded: ib_addr]
Supported: Yes
Pid: 29563, comm: dsm_sa_datamgrd Tainted: G          N
2.6.32.19-0.2.1-lustre.1.8.5 #1 PowerEdge 2970
RIP: 0010:[<ffffffffa0154ed6>]  [<ffffffffa0154ed6>]
sd_iostats_start_req+0x26/0x1e0 [sd_mod]
RSP: 0018:ffff8801022edb68  EFLAGS: 00010092
[1396660.308006] RAX: 0000000000000000 RBX: ffff8802347f7880 RCX:
0000000000000000
[1396660.308006] RDX: ffff8802347f9280 RSI: 0000000000000064 RDI:
ffff8802347f7880
[1396660.308006] RBP: ffff880234fc0080 R08: 0000000000000000 R09:
ffff8801f94c0e80
[1396660.308006] R10: 0000000000000000 R11: 0000000000000000 R12:
ffff8802347f7880
[1396660.308006] R13: ffff880235368800 R14: ffff880437346400 R15:
ffffffffffffffff
[1396660.308006] FS:  00007fc395639700(0000) GS:ffff880009000000(0000)
knlGS:0000000000000000
[1396660.308006] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[1396660.308006] CR2: 0000000000000238 CR3: 000000018fbdf000 CR4:
00000000000006f0
[1396660.308006] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[1396660.308006] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7:
0000000000000400
[1396660.308006] Process dsm_sa_datamgrd (pid: 29563, threadinfo
ffff8801022ec000, task ffff88023455a080)
[1396660.308006] Stack:
[1396660.308006]  ffff880235368800 ffff8802347f7880 ffff880234fc0080
0000000000000000
[1396660.308006] <0> ffff880235368800 ffff880437346400 ffffffffffffffff
ffffffffa0155fe1
[1396660.308006] <0> 0000000000000008 ffff880234fc0080 0000000000000001
00000001811231e1
[1396660.308006] Call Trace:
[1396660.308006]  [<ffffffffa0155fe1>] sd_prep_fn+0xb1/0xce0 [sd_mod]
[1396660.308006]  [<ffffffff811b0ce1>] blk_peek_request+0xc1/0x190
[1396660.308006]  [<ffffffffa00076a9>] scsi_request_fn+0x59/0x5a0
[scsi_mod]
[1396660.308006]  [<ffffffff811b4cc1>] blk_execute_rq_nowait+0x61/0xb0
[1396660.308006]  [<ffffffffa01781c4>] sg_common_write+0x334/0x580 [sg]
[1396660.308006]  [<ffffffffa0178631>] sg_new_write+0x221/0x320 [sg]
[1396660.308006]  [<ffffffffa0178bc6>] sg_ioctl+0x496/0xbf0 [sg]
[1396660.308006]  [<ffffffff81106e52>] vfs_ioctl+0x82/0xb0
[1396660.308006]  [<ffffffff81106fa8>] do_vfs_ioctl+0x88/0x570
[1396660.308006]  [<ffffffff81107510>] sys_ioctl+0x80/0xa0
[1396660.308006]  [<ffffffff81002efb>] system_call_fastpath+0x16/0x1b
[1396660.308006]  [<00007fc394b2f197>] 0x7fc394b2f197
[1396660.308006] Code: 83 c4 28 c3 90 41 57 41 56 41 55 41 54 49 89 fc
55 53 48
83 ec 08 48 8b 87 b0 00 00 00 48 8b 97 c8 00 00 00 48 8b 80 a0 02 00 00
<48> 8b
a8 38 02 00 00 48 85 ed 0f 84 7a 01 00 00 44 8b 6a 68 41 
[1396660.308006] RIP  [<ffffffffa0154ed6>] sd_iostats_start_req
+0x26/0x1e0
[sd_mod]
[1396660.308006]  RSP <ffff8801022edb68>
[1396660.308006] CR2: 0000000000000238
[1396660.308006] ---[ end trace 55d9d7926ecbc6d2 ]---

-- 
Ramiro Alba

Centre Tecnològic de Tranferència de Calor
http://www.cttc.upc.edu


Escola Tècnica Superior d'Enginyeries
Industrial i Aeronàutica de Terrassa
Colom 11, E-08222, Terrassa, Barcelona, Spain
Tel: (+34) 93 739 86 46



-- 
Aquest missatge ha estat analitzat per MailScanner
a la cerca de virus i d'altres continguts perillosos,
i es considera que està net.



More information about the Linux-PowerEdge mailing list