monitoring SAS drives on a PE2950?

Terry Gliedt tpg at umich.edu
Thu Oct 9 10:45:34 CDT 2008


Ryan Langseth wrote:
> 
> On Thu, 9 Oct 2008, Adam Williams wrote:
> 
>> Is there a way to monitor the drives on a PE2950 for failure?  I know
>> there is OMSA, but I have to check the web interface daily.  is there a
>> way to have a script check the drives daily and email me if one fails?
>>
> 
> I have been trying to figure this out on our linux systems too, I really 
> don't want to install OMSA to do it.  there are nagios plugins to monitor 
> megaraid controllers. On our FreeBSD systems, drive failures are logged by 
> the kernel, on CentOS, there are no log messages when a disk fails.

I monitor 70+ machines (Dell, IBM and Sun) every day with a simple 
script that dumps the hardware event logs of each machine. I keep each 
log cleared and filter out non-events (like 'log cleared). The remaining 
lines (if any) are sent me each morning as mail and look like this:

############ SEL Information for 'p35' ['192.168.12.24'] ############
Complete SEL log saved in '/data/sel/p35.log'
SEL can be cleared with:   ipmitool -I lan -U MYROOT -a -H 192.168.12.24 
sel clear
    2 | 09/25/2008 | 14:47:21 | Power Supply #0x64 | Predictive failure 
| Asserted
    3 | 09/25/2008 | 14:47:31 | Power Supply #0x64 | Predictive failure 
| Deasserted
    4 | 09/25/2008 | 15:18:06 | Power Supply #0x64 | Predictive failure 
| Asserted

The key is to use IPMI. See

   http://www.hps.com/~tpg/notebook/ipmi.php

for the core of what I needed to know. Hopefully that is enough for you too.



More information about the Linux-PowerEdge mailing list