Dell PE1850/2850 RAID array issue with RHEL4

Hansjörg Maurer hansjoerg.maurer at dlr.de
Fri Aug 4 09:37:15 CDT 2006


Hi

our errors where almost similar

Jul 22 02:08:44 intra kernel: megaraid abort: 21963824[255:129], driver 
owner
Jul 22 02:08:44 intra kernel: megaraid: aborting-21963825 cmd=2a <c=2 
t=1 l=0>
Jul 22 02:08:44 intra kernel: megaraid abort: 21963825[255:129], driver 
owner
Jul 22 02:08:44 intra kernel: megaraid: aborting-21963826 cmd=2a <c=2 
t=1 l=0>
Jul 22 02:08:44 intra kernel: megaraid abort: 21963826[255:129], driver 
owner
Jul 22 02:08:44 intra kernel: megaraid: resetting the host...

and
http://www.firehat.org/repodata/repoview/linttylog-0-1.00-0.html
detects  finaly an hardware issue like I described in a previos mail

Give it a try :-)

Hansjörg


Herman Vega wrote:

>hi all
>
>We had similar problem last week, we resolved to change the perc4/dc
>to advantec scsi cards next week.
>
>Jul 28 22:00:01 nod1 kernel: megaraid: aborting-16591038 cmd=2a <c=2 t=0 l=0>
>Jul 28 22:00:01 nod1 kernel: megaraid abort: scsi cmd:16591071, do now own
>Jul 28 22:00:01 nod1 kernel: megaraid: resetting the host...
>Jul 28 22:00:01 nod1 kernel: megaraid mbox: reset sequence completed
>successfully
>Jul 28 22:00:01 nod1 kernel: megaraid: fast sync command timed out
>Jul 28 22:00:01 nod1 kernel: megaraid: reservation reset failed
>Jul 28 22:00:01 nod1 kernel: megaraid: resetting the host...
>Jul 28 22:00:01 nod1 kernel: scsi: Device offlined - not ready after
>error recovery: host 1 channel 2 id 0 lun 0
>Jul 28 22:00:01 nod1 last message repeated 36 times
>Jul 28 22:00:01 nod1 kernel: SCSI error : <1 2 0 0> return code = 0x70018
>Jul 28 22:00:01 nod1 kernel: end_request: I/O error, dev sdb, sector 314795551
>Jul 28 22:00:01 nod1 kernel: Buffer I/O error on device sdb1, logical
>block 39349436
>Jul 28 22:00:01 nod1 kernel: lost page write due to I/O error on sdb1
>Jul 28 22:00:01 nod1 kernel: scsi1 (0:0): rejecting I/O to offline device
>Jul 28 22:00:01 nod1 kernel: SCSI error : <1 2 0 0> return code = 0x70018
>Jul 28 22:00:01 nod1 kernel: end_request: I/O error, dev sdb, sector 314798975
>Jul 28 22:00:01 nod1 kernel: scsi1 (0:0): rejecting I/O to offline device
>Jul 28 22:00:01 nod1 last message repeated 138 times
>Jul 28 22:00:01 nod1 kernel: __journal_remove_journal_head: freeing
>b_committed_data
>Jul 28 22:00:01 nod1 kernel: __journal_remove_journal_head: freeing
>b_committed_data
>Jul 28 22:00:01 nod1 kernel: ext3_abort called.
>Jul 28 22:00:01 nod1 kernel: EXT3-fs error (device sdb1):
>ext3_journal_start_sb: Detected aborted journal
>Jul 28 22:00:01 nod1 kernel: Remounting filesystem read-only
>Jul 28 22:00:01 nod1 kernel: EXT3-fs error (device sdb1) in
>start_transaction: Journal has aborted
>Jul 28 22:00:01 nod1 kernel: scsi1 (0:0): rejecting I/O to offline device
>Jul 28 22:00:01 nod1 kernel: scsi1 (0:0): rejecting I/O to offline device
>Jul 28 22:00:02 nod1 kernel: scsi1 (0:0): rejecting I/O to offline device
>Jul 28 22:00:02 nod1 kernel: EXT3-fs error (device sdb1):
>ext3_find_entry: reading directory #18989559 offset 0
>
>
>On 7/28/06, wolf2k5 <wolf2k5 at gmail.com> wrote:
>  
>
>>Hi all,
>>
>>We've a few Dell PowerEdge 1850 and 2850 servers running RHEL4 U3
>>(some i386, others x86_64). The 1850s have RAID1 arrays, the 2850s
>>have RAID1 and RAID arrays. We applied the latest BIOS and RAID
>>firmware to all servers.
>>    
>>
>
>  
>



More information about the Linux-PowerEdge mailing list