PE 1750 Uhhuh. NMI received for unknown reason 21 on CPU 0

Tilmann.Boess at bifab.de Tilmann.Boess at bifab.de
Tue May 23 04:44:40 CDT 2006


linux-poweredge-bounces at dell.com schrieb am 23.05.2006 06:36:11:

>   We have numerous PE1750's in production.  One in particular has 
started
> randomly rebooting in the last couple of days.  The second time it
> rebooted it would not restarted (even when selecting the power on
> button).  I ended up removing and reseating both power supplies which
> seems to of solved the issue.
> 
>   Just tonight the server has again rebooted on its own.  The only thing
> we're seeing in the server logs is:
> 
> May 22 23:23:05 svr kernel: LAPIC_NMI (acpi_id[0x0001] polarity[0x1]
> trigger[0x1] lint[0x1])
> May 22 23:23:05 svr kernel: LAPIC_NMI (acpi_id[0x0002] polarity[0x1]
> trigger[0x1] lint[0x1])
> May 22 23:23:05 svr kernel: LAPIC_NMI (acpi_id[0x0003] polarity[0x1]
> trigger[0x1] lint[0x1])
> May 22 23:23:05 svr kernel: LAPIC_NMI (acpi_id[0x0004] polarity[0x1]
> trigger[0x1] lint[0x1])
> May 22 23:41:00 svr kernel: Uhhuh. NMI received for unknown reason 21 on
> CPU 0.
> May 22 23:51:20 svr kernel: Uhhuh. NMI received for unknown reason 21 on
> CPU 0.

Hi,

we had similar problems in a PE650 (running Debian Woody):

Jan 13 16:43:53 picasso kernel: Dazed and confused, but trying to continue
Jan 13 16:43:53 picasso kernel: Do you have a strange power saving mode 
enabled?
Jan 13 16:43:53 picasso kernel: Uhhuh. NMI received for unknown reason 21.

We checked the server's RAM, found a faulty module and replaced it.
Since then these messages vanished.

Hope this helps,
  Tilmann



More information about the Linux-PowerEdge mailing list