MegaCli segfault ?

Harald_Jensas at Dell.com Harald_Jensas at Dell.com
Tue Apr 3 16:57:57 CDT 2007


Hi,

Just guessing that the drive that it tries to get information from is the same model as the other drive as well?

I would recommend the following:

1. Do a firmware update of the drives:

- http://ftp.us.dell.com/sas-hdd/R142347.iso

Important Information 
 
On drives attached to PERC5 controllers, the first drive to be updated requires a 5 minute delay before firmware flash to begin. Do not power cycle or reboot your system while the update is in progress. 

 
Fixes and Enhancements 
 
This release addresses Maxtor SAS HDD firmware issues, where under certain circumstances a hard disk drive may go offline, hard disk drives (HDD), may report offline due to a timeout condition. If the HDD is unable to complete commands, this may result in the controller reporting the HDD off line. This firmware update has improved SMART Reporting, where drives can report SMART trips due to aggressive SMART Error Rate Measurement counters during read verifies.

G8774  - HD,300G,SAS,3,10K,3.5,MXT,GEN
M8033  - HD,146G,SAS,3,10K,3.5,MXT,GEN
G8763  - HD,73G,SAS,3,10K,3.5,MXT,GEN
G8764 - HD,146G,SAS,3,15K,3.5,MXT,BB
M8032 - HD,73G,SAS,3,15K,3.5,MXT,BB
G8764 - HD,146G,SAS,3,15K,3.5,MXT,BB  

If that fix your problem then you will be happy, not unhappy as you mentioned in your latest e-mail... At least I would be... ;)

2. Run diagnostics - a Drive Self Test using PowerEdge Diagnostics if you are running a supported Linux distribution, or 32-Bit Diagnostics if you run something that will not run PowerEdge Diagnostics. (PowerEdge diagnostics can perform a non disruptive test of your drives while the system is running.)

PowerEdge Diagnostics:
http://ftp.us.dell.com/diags/dell-pediags-linux-2.7.0.193-A01.tar.gz

32-Bit Diagnostics:
http://ftp.us.dell.com/diags/EL5083A0.bin



About an escalation, it will be extremely difficult getting Dell to do an escalation on non Dell supported/distributed/tested software like MegaCLI... You might get lucky with LSI. :)


PS!
If you are rebooting the server anyway, why not just update BIOS, RAID FW etc. at that time and save the hassle of another reboot after talking to support? 



--
Harald Jensås


> -----Original Message-----
> From: linux-poweredge-bounces at dell.com 
> [mailto:linux-poweredge-bounces at dell.com] On Behalf Of Martin Hamant
> Sent: Friday, March 30, 2007 10:43 AM
> To: linux-poweredge-Lists
> Subject: MegaCli segfault ?
> 
> Is anyone has ever seen this error ?
> 
> # MegaCli -LdPdInfo -a0
>                                      
> Number of Virtual Disks: 1
> Virtual Disk: 0
> Name:
> RAID Level: Primary-1, Secondary-0, RAID Level Qualifier-0 
> Size:139392MB
> State: Degraded
> Stripe Size: 64kB
> Number Of Drives:2
> Span Depth:1
> Default Cache Policy: WriteThrough ReadAheadNone Direct 
> Current Cache Policy: WriteThrough ReadAheadNone Direct 
> Access Policy: Read/Write Disk Cache Policy: Disk's Default 
> Number of Spans: 1
> Span: 0 - Number of PDs: 2
> PD: 0 Information
> Enclosure Number: 1
> Slot Number: 0
> Device Id: 0
> Sequence Number: 2
> Media Error Count: 0
> Other Error Count: 0
> Predictive Failure Count: 0
> Last Predictive Failure Event Seq Number: 0 Raw Size: 
> 140014MB [0x11177328 Sectors] Non Coerced Size: 139502MB 
> [0x11077328 Sectors] Coerced Size: 139392MB [0x11040000 
> Sectors] Firmware state: Online SAS Address(0): 
> 0x50010b90001ed41e SAS Address(1): 0x0
> Inquiry Data: MAXTOR  ATLAS10K5_147SASBP00J499CKBK    A
> 
> PD: 1 Information
> Segmentation fault
> 
> 
> Just happened today on one server :/
> Noticed the "State: degraded" but i can't see any errors in 
> ipmi logs or warning lights on disks ...
> 
> Thank you !
> 
> --
> Martin
> 
> _______________________________________________
> Linux-PowerEdge mailing list
> Linux-PowerEdge at dell.com
> http://lists.us.dell.com/mailman/listinfo/linux-poweredge
> Please read the FAQ at http://lists.us.dell.com/faq
> 



More information about the Linux-PowerEdge mailing list