About KIPMI0 process
Chris - PowerEdge Linux List
linux-poweredge at dotcomdesigners.com
Fri May 25 19:20:52 CDT 2007
----- Original Message -----
From: "Michael E Brown" <Michael_E_Brown at dell.com>
Sent: Wednesday, May 23, 2007 3:56 PM
Subject: Re: About KIPMI0 process
>> What do we have to do to turn all this stuff completely off to bring the
>> CPU
>> load down to 0.00 when it's not running anything at all? I'm open to any
>> and all suggestions at this point. We've resisted putting this server
>> into
>> production. I know this is considered "harmless" load by Dell, but it
>> really messes up our monitoring systems and alters the true CPU load that
>> we
>> monitor for best application processing. There's no reason we should be
>> seeing anything but 0.00 on a system that has nothing installed and
>> nothing
>> running on it.
>
> You sure it isnt some random system daemon? You havent provided any data
> to show what is causing the cpu load.
That's the problem. I disabled virtually all daemons that get installed
with 'RHEL4 minimal install' and I've been watching 'top' and even have a
script running that constantly checks the load and if it exceeds 0.40 it
loggs the top 20 processes once a second, and NOTHING is showing... I just
spent the last 15 minutes staring non-stop at 'top' and here's what the
results look like, when the load suddenly spikes at 0.60:
----------------------------------------------------------------------------------
Thu May 24 17:04:02 PDT 2007
top - 17:04:03 up 1 day, 22:08, 2 users, load average: 0.60, 0.23, 0.08
Tasks: 59 total, 1 running, 58 sleeping, 0 stopped, 0 zombie
Cpu0 : 0.0% us, 0.0% sy, 0.0% ni, 100.0% id, 0.0% wa, 0.0% hi, 0.0%
si
Cpu1 : 0.0% us, 0.0% sy, 0.0% ni, 100.0% id, 0.0% wa, 0.0% hi, 0.0%
si
Cpu2 : 0.0% us, 0.0% sy, 0.0% ni, 100.0% id, 0.0% wa, 0.0% hi, 0.0%
si
Cpu3 : 0.0% us, 0.0% sy, 0.0% ni, 100.0% id, 0.0% wa, 0.0% hi, 0.0%
si
Mem: 4149240k total, 336916k used, 3812324k free, 46620k buffers
Swap: 4192956k total, 0k used, 4192956k free, 231060k cached
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
1 root 15 0 3536 548 472 S 0 0.0 0:00.63 init
2 root RT 0 0 0 0 S 0 0.0 0:00.02 migration/0
3 root 34 19 0 0 0 S 0 0.0 0:00.00 ksoftirqd/0
4 root RT 0 0 0 0 S 0 0.0 0:00.01 migration/1
5 root 34 19 0 0 0 S 0 0.0 0:00.00 ksoftirqd/1
6 root RT 0 0 0 0 S 0 0.0 0:00.02 migration/2
7 root 34 19 0 0 0 S 0 0.0 0:00.00 ksoftirqd/2
8 root RT 0 0 0 0 S 0 0.0 0:00.01 migration/3
9 root 34 19 0 0 0 S 0 0.0 0:00.00 ksoftirqd/3
10 root 5 -10 0 0 0 S 0 0.0 0:00.00 events/0
----------------------------------------------------------------------------------
The 2nd user is me - one SSH session running 'top', second session me
grabbing data from the log. Nothing else is running. 'init' seems to stay
at the top of the 'top' list, and someone for no reason whatsoever the load
goes from 0.00 to around 0.5 to 0.6 for about 20-40 seconds, then drops back
down to 0.00. I don't see any other processes running when the load spikes
to ~0.60, and as you can see from the 'top' list above, there's nothing in
the %CPU column either, which is the part that is driving me nuts.
SOMETHING is causing the CPU load to spike, but not a single process is
showing in 'top' as using anything but 0% CPU.
Any ideas what else I can try or how else to troubleshoot this to figure out
what on earth might be causing this? I really don't see any "processes"
using CPU resources when this load issue occurs.
Chris
More information about the Linux-PowerEdge
mailing list