aacraid-i/o error

Patrick Ford patrick at clickability.com
Mon Nov 25 16:57:00 CST 2002


Steve,

Thanks for the reply.  I've replaced that drive a couple of times in the
past two weeks so I was thinking that it wasn't the root of the
problem.  Sorry to not have included that in my original post.  I didn't
run any sort of consistency check on the new drive though...didn't think
I needed to.  Guess I shouldn't rule out that the new drive is bad. 
There are also some other errors logged that led me to believe that it
was the controller (below).

Thanks again,
Patrick

kernel: scsi : aborting command due to timeout : pid 5605916, scsi1,
channel 0, id 2, lun 0 Read (10) 00 05 98 76 9f 00 00 80 00
kernel: scsi : aborting command due to timeout : pid 5605917, scsi1,
channel 0, id 2, lun 0 Read (10) 00 05 98 77 1f 00 00 80 00

kernel: SCSI host 1 abort (pid 5693348) timed out - resetting
kernel: SCSI bus is being reset for host 1 channel 0.

kernel: attempt to access beyond end of device
kernel: 08:41: rw=0, want=1028199704, limit=122881153



On Mon, 2002-11-25 at 13:04, Steve_Boley at Dell.com wrote:
> You've got a problem with drive id3 and it is causing your scsi bus to reset
> and getting write errors on heavy io.  Probably need to replace the drive.
> Steve
> 
> -----Original Message-----
> From: Patrick Ford [mailto:patrick at clickability.com]
> Sent: Monday, November 25, 2002 2:29 PM
> To: linux-aacraid-devel at exchange.dell.com
> Subject: aacraid-i/o error
> 
> 
> Has anyone had any luck getting the Netraid 4M card (aacraid driver) to
> work properly with kernel v 2.4.x or 2.5.x.  Under heavy I/O I get
> errors in /var/log/messages (see below).  Sometimes it causes one of the
> disk to go offline and the systems Panics.
> 
> I've tried the following (RedHat distro)
> 
> Kernel 2.4.19 - get the below errors
> Kernel 2.4.20 - aacraid mod will not load, hangs system
> Kernel 2.5.48 - Kernel Panic during boot
> 
> In all 3 cases I'm compiling the aacraid driver as a mod so that the
> scsi controller (megaraid) for disk which contains the root partition
> loads first.  I'm also using evms (volume manager).  Again, everything
> works fine until heavy I/O.
> 
> Any help would be greatly appreciated,
> Patrick
> 
> kernel: aacraid:SCSI Channel[0]: Timeout Detected On 10 Command(s)
> kernel: aacraid:ID(0:03:0); Abort Timeout. Resetting Bus 0
> kernel: aacraid:SCSI bus reset issued on channel 0
> kernel: aacraid:ID(0:03:0); Aborted Command [command:0x2a]
> kernel: aacraid:ID(0:03:0); Selection Timeout [command:0x2a]
> kernel: aacraid:Drive 0:3:0 returning error
> kernel: aacraid:ID(0:03:0) - command failure (retries exhausted)
> kernel: aacraid:Container 2 returning error on drive 0:3:0
> kernel: aacraid:Container 2 Returned Error to Adapter Cachekernel:
> write_callback: write failed, status = 5
> 
> kernel:  I/O error: dev 08:41, sector 3944352
> kernel: read_callback: read failed, status = 5
> kernel: SCSI disk error : host 1 channel 0 id 2 lun 0 return code = 1
> 
> 
> 
> _______________________________________________
> Linux-aacraid-devel mailing list
> Linux-aacraid-devel at dell.com
> http://lists.us.dell.com/mailman/listinfo/linux-aacraid-devel
> Please read the FAQ at http://lists.us.dell.com/faq or search the list
> archives at http://lists.us.dell.com/htdig/





More information about the Linux-PowerEdge mailing list