megaraid and scsi errors
Timo Veith
tv at rz-zw.fh-kl.de
Mon Jul 3 11:05:16 CDT 2006
Hi list readers,
last week I upgraded the kernel because of some security fixes. The
machine is a PE 2550 with two PentiumIIIs and it's running Debian sarge
with kernel 2.6.8-3-686-smp (now). There is a RAID1 (system) and a RAID5
(data).
The reboot forced a filesystem check and dropped me to a repair console. I
also noticed some strange boot errors which I have not seen before on
that machine. (see dmesg output below). However I repaired the filesystem
errors, booted again just to find out that I couln't login any more.
I rebooted (actually I had to power off the machine hard) with knoppix and
wanted to see if I can find anything. I noticed the error messages from
megaraid there too. And the filesystems still had errors. After fsck I
found that /etc/shadow was gone. Fortunately there was a backup lieing
around. But thats not the question.
The question is why am I getting those error messages and how can I get
rid of them.
Output from dmesg (shortened):
...
megaraid: found 0x101e:0x1960:bus 2:slot 0:func 0
scsi0:Found MegaRAID controller at 0xf881e000, IRQ:185
megaraid: [161N:3.17] detected 2 logical drives.
megaraid: channel[0] is raid.
megaraid: channel[1] is raid.
scsi0 : LSI Logic MegaRAID 161N 254 commands 16 targs 5 chans 7 luns
Using anticipatory io scheduler
scsi0: scanning scsi channel 0 for logical drives.
Vendor: MegaRAID Model: LD0 RAID1 17278R Rev: 161N
Type: Direct-Access ANSI SCSI revision: 02
Vendor: MegaRAID Model: LD1 RAID5 42746R Rev: 161N
Type: Direct-Access ANSI SCSI revision: 02
scsi0: scanning scsi channel 1 for logical drives.
scsi0: scanning scsi channel 2 for logical drives.
scsi0: scanning scsi channel 4 [P0] for physical devices.
Vendor: DELL Model: 1x4 U2W SCSI BP Rev: 1.32
Type: Processor ANSI SCSI revision: 02
scsi0: scanning scsi channel 5 [P1] for physical devices.
Vendor: Dell Model: 12 BAY U2W CU Rev: 0209
Type: Processor ANSI SCSI revision: 03
megaraid: ABORTING-63 cmd=a0 <c=5 t=15 l=0>
megaraid: ABORTING-63[7d], fw owner.
megaraid: reservation reset failed.
megaraid: RESET-63 cmd=a0 <c=5 t=15 l=0>
megaraid: RESET-63[7d], fw owner.
megaraid: reservation reset failed.
megaraid: RESET-63 cmd=a0 <c=5 t=15 l=0>
megaraid: RESET-63[7d], fw owner.
megaraid: reservation reset failed.
megaraid: RESET-63 cmd=a0 <c=5 t=15 l=0>
megaraid: RESET-63[7d], fw owner.
scsi: Device offlined - not ready after error recovery: host 0 channel 5
id 15 lun 0
megaraid: ABORTING-64 cmd=12 <c=5 t=15 l=1>
megaraid: ABORTING-64[7d], fw owner.
megaraid: reservation reset failed.
megaraid: RESET-64 cmd=12 <c=5 t=15 l=1>
megaraid: RESET-64[7d], fw owner.
megaraid: reservation reset failed.
megaraid: RESET-64 cmd=12 <c=5 t=15 l=1>
megaraid: RESET-64[7d], fw owner.
megaraid: reservation reset failed.
megaraid: RESET-64 cmd=12 <c=5 t=15 l=1>
megaraid: RESET-64[7d], fw owner.
scsi: Device offlined - not ready after error recovery: host 0 channel 5
id 15 lun 1
Attached scsi generic sg0 at scsi0, channel 0, id 0, lun 0, type 0
Attached scsi generic sg1 at scsi0, channel 0, id 1, lun 0, type 0
Attached scsi generic sg2 at scsi0, channel 4, id 6, lun 0, type 3
Attached scsi generic sg3 at scsi0, channel 5, id 15, lun 0, type 3
SCSI device sda: 35385344 512-byte hdwr sectors (18117 MB)
sda: asking for cache data failed
sda: assuming drive cache: write through
/dev/scsi/host0/bus0/target0/lun0: p1 p2 p3 p4 < p5 >
Attached scsi disk sda at scsi0, channel 0, id 0, lun 0
SCSI device sdb: 497143808 512-byte hdwr sectors (254538 MB)
sdb: asking for cache data failed
sdb: assuming drive cache: write through
/dev/scsi/host0/bus0/target1/lun0:<4>megaraid: aborted cmd 0[7d]
complete.
p1 p2
Attached scsi disk sdb at scsi0, channel 0, id 1, lun 0
...
Kind regards and TIA
Timo
More information about the Linux-PowerEdge
mailing list