Scientific 6.1 x86_64 kernel panic on reboot on R610

Mikolaj Kucharski mikolaj at kucharski.name
Fri Mar 2 05:36:45 CST 2012


For the records. Kernel from SL6.2 fixes the problem.

# uname -r
2.6.32-220.4.1.el6.x86_64


On Mon, Feb 27, 2012 at 05:02:12PM +0000, Mikolaj Kucharski wrote:
> Hi,
> 
> I can reproduce the issue approx 4 times out of 5 tests. I have few
> Dell PowerEdge R610 and Scientific Linux 6.1 x86_64 running on them.
> Server has SSD drives, if that makes any difference. If I reboot the
> server via shutdown -r now or alt+ctrl+del I'm getting kernel panic at
> the very end of the reboot. Did anyone seen isse like that on their
> fleet, by any chance?
> 
> 
> # head -n1 /etc/issue
> Scientific Linux release 6.1 (Carbon)
> 
> # uname -r
> 2.6.32-131.0.15.el6.x86_64
> 
> # reboot
> ...
> Unmounting file systems:  [  OK  ]
> init: Re-executing /sbin/init
> Please stand by while rebooting the system...
> BUG: unable to handle kernel paging request at 000000000003a780
> IP: [<ffffffffa003ff88>] _scsih_qcmd+0x158/0x790 [mpt2sas]
> PGD 0 
> Oops: 0002 [#1] SMP 
> last sysfs file: /sys/devices/pci0000:00/0000:00:01.0/0000:01:00.0/net/em1/address
> CPU 10 
> Modules linked in: mptctl mptbase dell_rbu sunrpc ipv6 power_meter hwmon sg bnx2 dcdbas serio_raw ghes hed iTCO_wdt iTCO_vendor_support i7core_edac edac_core ext4 mbcache jbd2 sr_mod cdrom sd_mod crc_t10dif ata_generic pata_acpi ata_piix mpt2sas scsi_transport_sas raid_class dm_mod [last unloaded: ipmi_msghandler]
> 
> Modules linked in: mptctl mptbase dell_rbu sunrpc ipv6 power_meter hwmon sg bnx2 dcdbas serio_raw ghes hed iTCO_wdt iTCO_vendor_support i7core_edac edac_core ext4 mbcache jbd2 sr_mod cdrom sd_mod crc_t10dif ata_generic pata_acpi ata_piix mpt2sas scsi_transport_sas raid_class dm_mod [last unloaded: ipmi_msghandler]
> Pid: 116, comm: kblockd/10 Not tainted 2.6.32-131.0.15.el6.x86_64 #1 PowerEdge R610
> RIP: 0010:[<ffffffffa003ff88>]  [<ffffffffa003ff88>] _scsih_qcmd+0x158/0x790 [mpt2sas]
> Restarting system.
> machine restart
> RSP: 0018:ffff880c13c1dcc0  EFLAGS: 00010046
> RAX: 0000000000000000 RBX: ffff880c129f8580 RCX: 000000000000000e
> RDX: 0000000000000000 RSI: 000000000000074f RDI: 000000000003a780
> RBP: ffff880c13c1dd30 R08: ffff880611fd9938 R09: 0000000000000000
> R10: 0000000000000000 R11: 0000000000000000 R12: ffff880611f785e0
> R13: ffff880c11f59800 R14: 000000000003a780 R15: 0000000002000000
> FS:  0000000000000000(0000) GS:ffff8806454a0000(0000) knlGS:0000000000000000
> CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
> CR2: 000000000003a780 CR3: 0000000c11fd7000 CR4: 00000000000006e0
> DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
> DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
> Process kblockd/10 (pid: 116, threadinfo ffff880c13c1c000, task ffff880c13c16ac0)
> Stack:
>  0000000000000000 0000000000000000 0000000000000000 0000000000000000
> <0> 0000000000000000 0000000000000000 0000000000000000 0000074f00000000
> <0> ffff880c13c1dd10 ffff880c129f8580 ffff880611f78000 0000000000000000
> Call Trace:
>  [<ffffffff8134e1ac>] scsi_dispatch_cmd+0x1ac/0x340
>  [<ffffffff81355ba5>] scsi_request_fn+0x415/0x590
>  [<ffffffff814da777>] ? thread_return+0x4e/0x777
>  [<ffffffff81242600>] ? blk_unplug_work+0x0/0x70
>  [<ffffffff812477e2>] __generic_unplug_device+0x32/0x40
>  [<ffffffff8124781e>] generic_unplug_device+0x2e/0x50
>  [<ffffffff81242636>] blk_unplug_work+0x36/0x70
>  [<ffffffff81242600>] ? blk_unplug_work+0x0/0x70
>  [<ffffffff810887d0>] worker_thread+0x170/0x2a0
>  [<ffffffff8108e100>] ? autoremove_wake_function+0x0/0x40
>  [<ffffffff81088660>] ? worker_thread+0x0/0x2a0
>  [<ffffffff8108dd96>] kthread+0x96/0xa0
>  [<ffffffff8100c1ca>] child_rip+0xa/0x20
>  [<ffffffff8108dd00>] ? kthread+0x0/0xa0
>  [<ffffffff8100c1c0>] ? child_rip+0x0/0x20
> Code: 89 e7 89 c6 89 45 cc e8 87 61 ff ff a8 04 49 89 c6 48 89 c7 ba 70 00 00 00 0f 85 d8 02 00 00 89 d1 31 c0 c1 e9 03 83 e2 04 89 c9 <f3> 48 ab 0f 85 7f 01 00 00 0f b6 43 48 0f b6 53 49 84 c0 74 53 
> RIP  [<ffffffffa003ff88>] _scsih_qcmd+0x158/0x790 [mpt2sas]
>  RSP <ffff880c13c1dcc0>
> CR2: 000000000003a780
> ---[ end trace 257e9284bba805fa ]---
> Kernel panic - not syncing: Fatal exception
> Pid: 116, comm: kblockd/10 Tainted: G      D    ----------------   2.6.32-131.0.15.el6.x86_64 #1
> Call Trace:
>  [<ffffffff814da06e>] ? panic+0x78/0x143
>  [<ffffffff814de0b4>] ? oops_end+0xe4/0x100
>  [<ffffffff81040c9b>] ? no_context+0xfb/0x260
>  [<ffffffff81040f25>] ? __bad_area_nosemaphore+0x125/0x1e0
>  [<ffffffff81040ff3>] ? bad_area_nosemaphore+0x13/0x20
>  [<ffffffff810416cd>] ? __do_page_fault+0x31d/0x480
>  [<ffffffff8110f965>] ? mempool_alloc_slab+0x15/0x20
>  [<ffffffff8107a6f4>] ? mod_timer+0x144/0x220
>  [<ffffffff81264877>] ? kobject_put+0x27/0x60
>  [<ffffffff814e007e>] ? do_page_fault+0x3e/0xa0
>  [<ffffffff814dd425>] ? page_fault+0x25/0x30
>  [<ffffffffa003ff88>] ? _scsih_qcmd+0x158/0x790 [mpt2sas]
>  [<ffffffff8134e1ac>] ? scsi_dispatch_cmd+0x1ac/0x340
>  [<ffffffff81355ba5>] ? scsi_request_fn+0x415/0x590
>  [<ffffffff814da777>] ? thread_return+0x4e/0x777
>  [<ffffffff81242600>] ? blk_unplug_work+0x0/0x70
>  [<ffffffff812477e2>] ? __generic_unplug_device+0x32/0x40
>  [<ffffffff8124781e>] ? generic_unplug_device+0x2e/0x50
>  [<ffffffff81242636>] ? blk_unplug_work+0x36/0x70
>  [<ffffffff81242600>] ? blk_unplug_work+0x0/0x70
>  [<ffffffff810887d0>] ? worker_thread+0x170/0x2a0
>  [<ffffffff8108e100>] ? autoremove_wake_function+0x0/0x40
>  [<ffffffff81088660>] ? worker_thread+0x0/0x2a0
>  [<ffffffff8108dd96>] ? kthread+0x96/0xa0
>  [<ffffffff8100c1ca>] ? child_rip+0xa/0x20
>  [<ffffffff8108dd00>] ? kthread+0x0/0xa0
>  [<ffffffff8100c1c0>] ? child_rip+0x0/0x20
> 

-- 
best regards
q#



More information about the Linux-PowerEdge mailing list