scsi problems with precision 690 and fedora core 6 test 3
Garrett Mitchener
garrett.mitchener at gmail.com
Thu Sep 28 17:43:52 CDT 2006
Hi, I've been having a really frustrating hardware problem. I just
got a new precision 690, and since FC6 is coming out soon I decided to
try out a test release.
The problem I'm having is that all the kernels I've tried from FC6
test 3 eventually start generating these error messages in
/var/log/messages (see below). Eventually the system gets to the
point where any access to the file system makes a process freeze and I
have to reboot it. (If I wait, rebooting doesn't work and I have to
power off). The time it takes to get to this point varies.
I posted bug reports at red hat
(https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=208033) and
someone else posted a similar bug for Fedora 5
(https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=200787) saying
they tracked the problem to some interaction between recent kernels
and the transfer speed. (So I have every reason to believe that this
problem will also be present in the official release of FC6.) One of
the people at red hat asked me to try reducing the transfer speed to
80 MB/s to see if that makes the problem go away, but I couldn't find
any such option in any of the bios setup programs.
So:
Q1) Are other people seeing this? (Just to confirm that it's not a
hardware failure particular to my machine.)
Q2) Is there a way to change this transfer speed option?
Q3) Any idea what's going wrong in the kernel & how to fix it?
I'd really like to get this resolved because my wonderful new
workstation is distinctly less useful in the meantime.
Thanks a lot,
-- Garrett Mitchener
Sep 25 10:34:08 grograman kernel: mptscsih: ioc0: attempting task abort!
(sc=ffff81006d0d6a30)
Sep 25 10:34:08 grograman kernel: sd 0:0:1:0:
Sep 25 10:34:08 grograman kernel: command: Read(10): 28 00 0e b2 1c 05
00 00 08 00
Sep 25 10:34:08 grograman kernel: mptbase: ioc0: LogInfo(0x31140000):
Originator={PL}, Code={IO Executed}, SubCode(0x0000)
Sep 25 10:34:08 grograman kernel: mptscsih: ioc0: task abort: SUCCESS
(sc=ffff81006d0d6a30)
Sep 25 10:34:08 grograman kernel: INFO: trying to register non-static key.
Sep 25 10:34:08 grograman kernel: the code is fine but needs lockdep annotation.
Sep 25 10:34:08 grograman kernel: turning off the locking correctness validator.
Sep 25 10:34:08 grograman kernel:
Sep 25 10:34:08 grograman kernel: Call Trace:
Sep 25 10:34:08 grograman kernel: [<ffffffff8026ebbd>] show_trace+0xae/0x336
Sep 25 10:34:08 grograman kernel: [<ffffffff8026ee5a>] dump_stack+0x15/0x17
Sep 25 10:34:08 grograman kernel: [<ffffffff802a8871>]
__lock_acquire+0x135/0xa64
Sep 25 10:34:09 grograman kernel: [<ffffffff802a9743>] lock_acquire+0x4b/0x69
Sep 25 10:34:09 grograman kernel: [<ffffffff80267dff>] _spin_lock_irq+0x2b/0x38
Sep 25 10:34:09 grograman kernel: [<ffffffff80265873>]
wait_for_completion_timeout+0x35/0xd7
Sep 25 10:34:09 grograman kernel: [<ffffffff8807d57d>]
:scsi_mod:scsi_send_eh_cmnd+0x269/0x405
Sep 25 10:34:09 grograman kernel: [<ffffffff8807d784>]
:scsi_mod:scsi_eh_tur+0x32/0x86
Sep 25 10:34:09 grograman kernel: [<ffffffff8807e01b>]
:scsi_mod:scsi_error_handler+0x3f5/0xa81
Sep 25 10:34:09 grograman kernel: [<ffffffff802354ad>] kthread+0x100/0x136
Sep 25 10:34:09 grograman kernel: [<ffffffff802617a0>] child_rip+0xa/0x12
Sep 25 10:34:09 grograman kernel: DWARF2 unwinder stuck at child_rip+0xa/0x12
Sep 25 10:34:09 grograman kernel: Leftover inexact backtrace:
Sep 25 10:34:09 grograman kernel: [<ffffffff80267e72>]
_spin_unlock_irq+0x2b/0x31
Sep 25 10:34:09 grograman kernel: [<ffffffff80260ddc>] restore_args+0x0/0x30
Sep 25 10:34:09 grograman kernel: [<ffffffff8024fca4>] run_workqueue+0x19/0xfa
Sep 25 10:34:09 grograman kernel: [<ffffffff802353ad>] kthread+0x0/0x136
Sep 25 10:34:09 grograman kernel: [<ffffffff80261796>] child_rip+0x0/0x12
Sep 25 10:34:09 grograman kernel:
and many like this:
Sep 25 13:12:58 grograman kernel: mptscsih: ioc0: attempting task abort!
(sc=ffff81013e7caa30)
Sep 25 13:12:58 grograman kernel: sd 0:0:1:0:
Sep 25 13:12:58 grograman kernel: command: Write(10): 2a 00 1b 5b 9b a5
00 00 28 00
Sep 25 13:12:58 grograman kernel: mptbase: ioc0: LogInfo(0x31140000):
Originator={PL}, Code={IO Executed}, SubCode(0x0000)
Sep 25 13:12:58 grograman kernel: mptscsih: ioc0: task abort: SUCCESS
(sc=ffff81013e7caa30)
Sep 25 13:22:33 grograman kernel: mptscsih: ioc0: attempting task abort!
(sc=ffff81010578cd60)
Sep 25 13:22:33 grograman kernel: sd 0:0:1:0:
Sep 25 13:22:33 grograman kernel: command: Write(10): 2a 00 0f 5c 82 f5
00 00 08 00
Sep 25 13:22:33 grograman kernel: mptbase: ioc0: LogInfo(0x31140000):
Originator={PL}, Code={IO Executed}, SubCode(0x0000)
Sep 25 13:22:33 grograman kernel: mptscsih: ioc0: task abort: SUCCESS
(sc=ffff81010578cd60)
Sep 25 13:30:50 grograman kernel: mptscsih: ioc0: attempting task abort!
(sc=ffff8101196ea958)
Sep 25 13:30:50 grograman kernel: sd 0:0:1:0:
Sep 25 13:30:50 grograman kernel: command: Write(10): 2a 00 01 5c 77 dd
00 00 08 00
Sep 25 13:30:50 grograman kernel: mptbase: ioc0: LogInfo(0x31140000):
Originator={PL}, Code={IO Executed}, SubCode(0x0000)
Sep 25 13:30:50 grograman kernel: mptscsih: ioc0: task abort: SUCCESS
(sc=ffff8101196ea958)
Sep 25 13:32:33 grograman kernel: mptscsih: ioc0: attempting task abort!
(sc=ffff81013e7ca3d0)
More information about the Linux-Precision
mailing list