Disk Dropped offline in PE2850

Kurt_Olsson at Dell.com Kurt_Olsson at Dell.com
Wed Nov 5 11:02:19 CST 2008


Hard to say.  These are not unusual messages. The first is communication
to the SCSI backplane, not a disk. The other is not relevant at all.
Does the log stop there and is the log agreeing with the time that the
services failed?  I would say that the output you are looking at here is
the end of the PERC init AFTER the reboot.

 

 

From: linux-poweredge-bounces at dell.com
[mailto:linux-poweredge-bounces at dell.com] On Behalf Of Brian O'Mahony
Sent: Tuesday, November 04, 2008 9:26 AM
To: linux-poweredge-Lists
Subject: RE: Disk Dropped offline in PE2850

 

Sigh

 

Two servers in Bangalore decide to stop receiving users requests. I
couldn't ssh into them locally, They were powered off and on onsite.

 

Controller log shows this on both:

 

11/04 19:19:37: MPT_Rec: INQ Error - Negotiating LD[6] pRfm a0790c20

11/04 19:19:37: Rejecting MISC opcode:  unknown sub-opcode (0x26)

 

Once again these machines have out of date FW on the PERC. Is that what
this error is? 

 

From: Patrick_Fischer at Dell.com [mailto:Patrick_Fischer at Dell.com] 
Sent: 03 November 2008 13:38
To: Brian O'Mahony; linux-poweredge at lists.us.dell.com
Subject: RE: Disk Dropped offline in PE2850

 

 

F4 is only a timeout. I nearly all cases, driver and Firmware Update
will fix it for the future.

Check also the disk FW.

 

From: linux-poweredge-bounces at dell.com
[mailto:linux-poweredge-bounces at dell.com] On Behalf Of Brian O'Mahony
Sent: Monday, November 03, 2008 2:29 PM
To: linux-poweredge-Lists
Subject: Disk Dropped offline in PE2850

 

This is more of a general question rather than linux specific.

 

A disk dropped offline this morning in a RAID5 array. Machine is running
rhel4u6. The dedicated hotspare did NOT kick back in.

 

There is no sense key errors in the controller logs, no predictive fails
or SMART errors. Log says disk was removed and then 5 seconds later
reinserted. 

 

The only thing I can find in the controller log is:

 

fail_reason=f4, channel=0, target=3

 

Its obviously the right disk, 0:3. What does fail_reason=f4 mean does
anyone know?

 

 

 
 
The information in this email is confidential and may be legally
privileged.
It is intended solely for the addressee. Access to this email by anyone
else
is unauthorized. If you are not the intended recipient, any disclosure,
copying, distribution or any action taken or omitted to be taken in
reliance
on it, is prohibited and may be unlawful. If you are not the intended
addressee please contact the sender and dispose of this e-mail. Thank
you.
 
 
The information in this email is confidential and may be legally
privileged.
It is intended solely for the addressee. Access to this email by anyone
else
is unauthorized. If you are not the intended recipient, any disclosure,
copying, distribution or any action taken or omitted to be taken in
reliance
on it, is prohibited and may be unlawful. If you are not the intended
addressee please contact the sender and dispose of this e-mail. Thank
you.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.us.dell.com/pipermail/linux-poweredge/attachments/20081105/f67a05d9/attachment-0001.htm 


More information about the Linux-PowerEdge mailing list