[Ipmitool-devel] Is the BMC robust to recover from system hangs? impitool unresponsive
rpnabar at gmail.com
Mon Aug 30 09:51:16 CDT 2010
On Mon, Aug 30, 2010 at 7:52 AM, Jarrod B Johnson <jbjohnso at us.ibm.com>wrote:
> Your BMC simply isn't responding to any traffic. BMCs are supposed to be
> completely resilient to OS failures when done properly (not much apart from
> things like power failures in non-redundant systems should be capable of
> knocking out a quality IPMI implementation) . You need to look to your
> system vendor's support for an explanation and/or resolution, since
> implementations vary greatly from one vendor to the next. Sometimes a vendor
> is not competent to make it work, sometimes a vendor is too cheap to make it
> easy, and sometimes a vendor simply hasn't covered your particular NIC
> driver/OS combination and the NIC vendor flubbed some register handling or
> some such to make the NIC shoot itself when the kernel panics.
Thanks for the tips Jarrod! I will look into the nodes. These are
DellR410-servers with the on-board Broadcom NIC. The first thing for this
Monday morning is for me to trudge down to the dark depths of the cluster
room and to manually log in and see what exactly happened to these nodes.
I'll post on the list if I find anything interesting
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Linux-PowerEdge