11G wrong IPMI data
Patrick Schreurs
patrick at news-service.com
Tue Jul 14 03:48:48 CDT 2009
Matt Domsch wrote:
> On Fri, Jul 10, 2009 at 10:40:24AM +0200, Patrick Schreurs wrote:
>> Hello,
>>
>> I'm struggling with temperature monitoring on a Dell R710. IPMI reports
>> a sensor as critical. We several R170's in a professional cool
>> datacenter. They all report the same critial:
>>
>> ~# ipmitool sdr|grep cr
>> Temp | 53 degrees C | cr
>>
>> The ambient temperature of this server is 17 degrees (celsius). Dell
>> advices to adjust the datacenter's climate, which isn't very helpful.
>
> At some point, the specification for these values changed. Instead of
> being an actual $temperature_now, the value means "delta between
> $temperature_now and $temperature_fail", e.g. the amount of safety
> room you have before you hit the critical point.
Below are the details form this sensor (ipmitool -v):
Sensor ID : Temp (0xc)
Entity ID : 8.1 (Memory Module)
Sensor Type (Analog) : Temperature
Sensor Reading : 53 (+/- 1) degrees C
Status : Upper Critical
Nominal Reading : 23.000
Normal Minimum : 11.000
Normal Maximum : 69.000
Upper critical : 47.000
Upper non-critical : 42.000
Lower critical : 3.000
Lower non-critical : 8.000
Positive Hysteresis : 1.000
Negative Hysteresis : 1.000
Minimum sensor range : Unspecified
Maximum sensor range : Unspecified
Event Message Control : Per-threshold
Readable Thresholds : lcr lnc unc ucr
Settable Thresholds : lcr lnc unc ucr
Threshold Read Mask : lcr lnc unc ucr
Event Status : Event Messages Disabled
Assertion Events : unc+ ucr+
Event Enable : Event Messages Disabled
Assertions Enabled :
The upper and lower limits (like $temperature_fail) are stored in
hardware, right? So the hardware concludes the specific reading results
in an 'Upper Critical'. So, is this a bug which needs fixing?
Thanks.
-Patrick
More information about the Linux-PowerEdge
mailing list