[RE: PowerEdge 2500 and Redhat 7.3 possible aacraid error]

Jens Guether guether at hhi.de
Thu Jun 13 04:46:01 CDT 2002


Hi David,

here is what Steve (thanx Steve) proposed. Although everything
is attached I will shortly summarize it:

- It seems that there are differences in the aic7xxx driver
   from redhat 7.2 and 7.3

- Steve proposed to change the second channel from scsi to raid
   and to check if the error is cleared.
   (I've not done this, but will test it at this evening)

I will let you know if this proposal works
Jens


-------- Original Message --------
Subject: RE: PowerEdge 2500 and Redhat 7.3 possible aacraid error
Date: Wed, 12 Jun 2002 22:44:32 -0500
From: Steve_Boley at Dell.com
To: guether at hhi.de

F2 system setup integrated devices and use the sideways arrow keys on
channel b and set from scsi to raid.

-----Original Message-----
From: Jens Guether [mailto:guether at hhi.de]
Sent: Wednesday, June 12, 2002 7:20 AM
To: Steve_Boley at exchange.dell.com
Subject: Re: PowerEdge 2500 and Redhat 7.3 possible aacraid error

how do I change the second channel to raid?

do you mean this one?
  > 02:04.1 SCSI storage controller: Adaptec AIC-7899P U160/m (rev 01)
  >          Subsystem: Dell Computer Corporation: Unknown device 00d9
  >          Flags: bus master, 66Mhz, medium devsel, latency 32, IRQ 11
  >          BIST result: 00
  >          I/O ports at d800 [disabled] [size=256]
  >          Memory at feafe000 (64-bit, non-prefetchable) [size=4K]
  >          Expansion ROM at feb00000 [disabled] [size=128K]
  >          Capabilities: [dc] Power Management version 2

thanx
Jens

Steve_Boley at Dell.com wrote:
 > Try changing the second channel from scsi to raid and see if the error
 > clears.  Seems they changed the aic7xxx in 7.3 and we can isolate if it
 > might be causing problems.
 > Steve
 >
 > -----Original Message-----
 > From: Jens Guether [mailto:guether at hhi.de]
 > Sent: Tuesday, June 11, 2002 12:39 PM
 > To: Steve_Boley at exchange.dell.com
 > Subject: Re: PowerEdge 2500 and Redhat 7.3 possible aacraid error
 >
 >
 > here is the outpiut of: lspci -v:
 >
 > 00:00.0 Host bridge: ServerWorks CNB20HE Host Bridge (rev 23)
 >          Flags: fast devsel
 >
 > 00:00.1 Host bridge: ServerWorks CNB20HE Host Bridge (rev 01)
 >          Flags: bus master, medium devsel, latency 32
 >
 > 00:00.2 Host bridge: ServerWorks: Unknown device 0006 (rev 01)
 >          Flags: medium devsel
 >
 > 00:00.3 Host bridge: ServerWorks: Unknown device 0006 (rev 01)
 >          Flags: medium devsel
 >
 > 00:04.0 Ethernet controller: Intel Corp. 82557/8/9 [Ethernet Pro 100]
 > (rev 08)
 >          Subsystem: Dell Computer Corporation: Unknown device 009b
 >          Flags: bus master, medium devsel, latency 32, IRQ 11
 >          Memory at fe202000 (32-bit, non-prefetchable) [size=4K]
 >          I/O ports at ecc0 [size=64]
 >          Memory at fe000000 (32-bit, non-prefetchable) [size=1M]
 >          Expansion ROM at fe100000 [disabled] [size=1M]
 >          Capabilities: [dc] Power Management version 2
 >
 > 00:0e.0 VGA compatible controller: ATI Technologies Inc Rage XL (rev 27)
 > (prog-if 00 [VGA])
 >          Subsystem: Dell Computer Corporation: Unknown device 00d9
 >          Flags: bus master, stepping, medium devsel, latency 32
 >          Memory at fd000000 (32-bit, non-prefetchable) [size=16M]
 >          I/O ports at e800 [size=256]
 >          Memory at fe201000 (32-bit, non-prefetchable) [size=4K]
 >          Expansion ROM at <unassigned> [disabled] [size=128K]
 >          Capabilities: [5c] Power Management version 2
 >
 > 00:0f.0 ISA bridge: ServerWorks OSB4 South Bridge (rev 50)
 >          Subsystem: ServerWorks OSB4 South Bridge
 >          Flags: bus master, medium devsel, latency 0
 >
 > 00:0f.1 IDE interface: ServerWorks OSB4 IDE Controller (prog-if 8a
 > [Master SecP PriP])
 >          Flags: bus master, medium devsel, latency 64
 >          I/O ports at 08b0 [size=16]
 >
 > 00:0f.2 USB Controller: ServerWorks OSB4/CSB5 USB Controller (rev 04)
 > (prog-if 10 [OHCI])
 >          Subsystem: ServerWorks OSB4/CSB5 USB Controller
 >          Flags: bus master, medium devsel, latency 32, IRQ 10
 >          Memory at fe200000 (32-bit, non-prefetchable) [size=4K]
 >
 > 01:02.0 PCI bridge: Intel Corp. 80960RM [i960RM Bridge] (rev 01)
 > (prog-if 00 [Normal decode])
 >          Flags: bus master, medium devsel, latency 32
 >          Bus: primary=01, secondary=02, subordinate=02, sec-latency=32
 >          I/O behind bridge: 0000d000-0000dfff
 >          Memory behind bridge: fea00000-febfffff
 >
 > 01:02.1 RAID bus controller: Dell Computer Corporation PowerEdge
 > Expandable RAID Controller 3/Di (rev 01)
 >          Subsystem: Dell Computer Corporation: Unknown device 00d9
 >          Flags: bus master, medium devsel, latency 32, IRQ 5
 >          Memory at f0000000 (32-bit, prefetchable) [size=128M]
 >          Expansion ROM at fe900000 [disabled] [size=64K]
 >
 > 02:04.0 SCSI storage controller: Adaptec RAID subsystem HBA (rev 01)
 >          Subsystem: Dell Computer Corporation: Unknown device 00c5
 >          Flags: bus master, 66Mhz, medium devsel, latency 32, IRQ 5
 >          BIST result: 00
 >          I/O ports at dc00 [size=256]
 >          Memory at feaff000 (64-bit, non-prefetchable) [size=4K]
 >          Expansion ROM at feb00000 [disabled] [size=128K]
 >          Capabilities: [dc] Power Management version 2
 >
 > 02:04.1 SCSI storage controller: Adaptec AIC-7899P U160/m (rev 01)
 >          Subsystem: Dell Computer Corporation: Unknown device 00d9
 >          Flags: bus master, 66Mhz, medium devsel, latency 32, IRQ 11
 >          BIST result: 00
 >          I/O ports at d800 [disabled] [size=256]
 >          Memory at feafe000 (64-bit, non-prefetchable) [size=4K]
 >          Expansion ROM at feb00000 [disabled] [size=128K]
 >          Capabilities: [dc] Power Management version 2
 >
 >
 > Steve_Boley at Dell.com wrote:
 >
 >>what all is in the system in pci slots?
 >>
 >>-----Original Message-----
 >>From: Jens Guether [mailto:guether at hhi.de]
 >>Sent: Tuesday, June 11, 2002 12:34 PM
 >>To: Steve_Boley at exchange.dell.com
 >>Subject: Re: PowerEdge 2500 and Redhat 7.3 possible aacraid error
 >>
 >>
 >>firmware 2.7-0 build 3153
 >>Jens
 >>
 >>Steve_Boley at Dell.com wrote:
 >>
 >>
 >>>Only way is at boot.
 >>>Steve
 >>>
 >>>-----Original Message-----
 >>>From: Jens Guether [mailto:guether at hhi.de]
 >>>Sent: Tuesday, June 11, 2002 12:18 PM
 >>>To: Steve_Boley at exchange.dell.com
 >>>Subject: Re: PowerEdge 2500 and Redhat 7.3 possible aacraid error
 >>>
 >>>
 >>>thanx for the quick help,
 >>>
 >>>but (soryy for the stupid question):
 >>>how do I detect the firmware version?
 >>>
 >>>Jens
 >>>
 >>>Steve_Boley at Dell.com wrote:
 >>>
 >>>
 >>>
 >>>>What's the firmware on the raid controller?
 >>>>Steve
 >>>>
 >>>>-----Original Message-----
 >>>>From: Jens Guether [mailto:guether at hhi.de]
 >>>>Sent: Tuesday, June 11, 2002 12:04 PM
 >>>>To: linux-poweredge at exchange.dell.com
 >>>>Subject: PowerEdge 2500 and Redhat 7.3 possible aacraid error
 >>>>
 >>>>
 >>>>Hi all,
 >>>>
 >>>>I installed Redhat 7.3 on two identical servers
 >>>>(Dell PowerEdge 2500) with the result that both machines
 >>>>crash every few(3-5) days.
 >>>>
 >>>>I updated the kernel from 2.4.18-3 -> 2.4.18-4 with the
 >>>>same result, both machines crash every few days.
 >>>>
 >>>>I inspected the file /var/log/messages and detected lines like:
 >>>>------------------------------------------------------------------------
 >>>>Jun  9 04:04:10 bslinux1 kernel: scsi : aborting command due to 
timeout
 >>>>: pid 136040, scsi1, channel 0, id 0, lun 0 Write (10) 00 00 26 8c 
c5 00

 >>>>00 08 00
 >>>>------------------------------------------------------------------------
 >>>>
 >>>>when booting the following lines are printed on both machines:
 >>>>------------------------------------------------------------------------
 >>>>SCSI subsystem driver Revision: 1.00
 >>>>kmod: failed to exec /sbin/modprobe -s -k scsi_hostadapter, errno = 2
 >>>>PCI: Found IRQ 11 for device 02:04.1
 >>>>scsi0 : Adaptec AIC7XXX EISA/VLB/PCI SCSI HBA DRIVER, Rev 6.2.5
 >>>>       <Adaptec aic7899 Ultra160 SCSI adapter>
 >>>>       aic7899: Ultra160 Wide Channel B, SCSI Id=7, 32/253 SCBs
 >>>>
 >>>>Red Hat/Adaptec aacraid driver, May  2 2002
 >>>>PCI: Found IRQ 5 for device 01:02.1
 >>>>PCI: Sharing IRQ 5 with 02:04.0
 >>>>PCI: Found IRQ 5 for device 01:02.1
 >>>>PCI: Sharing IRQ 5 with 02:04.0
 >>>>PCI: Found IRQ 5 for device 01:02.1
 >>>>PCI: Sharing IRQ 5 with 02:04.0
 >>>>spurious 8259A interrupt: IRQ7.
 >>>>scsi1 : percraid
 >>>> Vendor: DELL      Model: PERCRAID Volume   Rev: 0001
 >>>> Type:   Direct-Access                      ANSI SCSI revision: 02
 >>>> Vendor: DELL      Model: PERCRAID Volume   Rev: 0001
 >>>> Type:   Direct-Access                      ANSI SCSI revision: 02
 >>>>----------------------------------------------------------------------
 >>>>
 >>>>I searched this mailing list archive and found that some people had
 >>>>the same problems in Feb.2002, but the timeout problem was fixed in
rx.c.
 >>>>
 >>>>I checked the file rx.c in my sources and everything was fine, but
 >>>>the machines still crash and produce the scsi timeout messages.
 >>>>
 >>>>Can anyone help?
 >>>>thanx
 >>>>Jens Guether
 >>>>
 >>>>
 >>>>_______________________________________________
 >>>>Linux-PowerEdge mailing list
 >>>>Linux-PowerEdge at dell.com
 >>>>http://lists.us.dell.com/mailman/listinfo/linux-poweredge
 >>>>Please read the FAQ at http://lists.us.dell.com/faq or search the list
 >>>>archives at http://lists.us.dell.com/htdig/
 >>>>
 >>>
 >>>
 >>>
 >>>
 >>>
 >>
 >>
 >>
 >
 >
 >
 >
 >










More information about the Linux-PowerEdge mailing list