PE6850 and snmpd: Rebooting every Sunday

Patrick_Boyd at Dell.com Patrick_Boyd at Dell.com
Tue Oct 3 14:52:21 CDT 2006


Ok, then lets see if OMSA has a reason to restart the servers. Can you
send us the output of `grep "Server Administrator" /var/log/messages*`?

This should give us all the events that OMSA sees and would tell us if
OMSA is rebooting the server for some reason.

Thanks,
Patrick Boyd
Dell Storage Software Engineer
(512)728-3182
 

-----Original Message-----
From: linux-poweredge-bounces at dell.com
[mailto:linux-poweredge-bounces at dell.com] On Behalf Of Bahadir Kiziltan
Sent: Tuesday, October 03, 2006 2:43 PM
To: Jason Goodwin
Cc: linux-poweredge-Lists
Subject: Re: PE6850 and snmpd: Rebooting every Sunday

There were all safe and sound before installing OMSA 5.0 :-). Don't
wanna blame OMSA for this but since related services are stopped we
haven't faced with the issue.

OMSA version 5.1 is released. I give it a try soon.

On 10/3/06, Jason Goodwin <jason.goodwin at ppminfo.com> wrote:
> Just a wild guess, but maybe one of the log rotation scripts is 
> sending a kill to init (kill 1) instead of the process it is rotating
logs for?
> I've seen scripts that don't check the PID carefully before killing...
>
> Other than that, are you using any kind of hardware watchdog timer 
> that could be rebooting your server if the timer isn't getting reset 
> properly?
>
> Good luck tracking down the problem!
> --
> Jason
>
> On Tue, 2006-10-03 at 21:50 +0300, Bahadir Kiziltan wrote:
> > Bloody sure they are as recieved alerts about losing connection from

> > applications. Also double checked by issueing of uptime command.
> >
> > On 10/3/06, Mann, Andrew <amann at ea.com> wrote:
> > >        Are you sure these servers are rebooting?  The syslog 
> > > process and snmpd process are restarted every Sunday just after 4 
> > > AM on RedHat default configuration.  This is controlled by 
> > > logrotate which runs daily
> > > (/etc/cron.daily/logrotate) and is configured through 
> > > /etc/logrotate.conf and /etc/logrotate.d/syslog,
/etc/logrotate.d/snmpd.
> > >
> > > Andrew
> > >
> > > -----Original Message-----
> > > From: linux-poweredge-bounces at dell.com 
> > > [mailto:linux-poweredge-bounces at dell.com] On Behalf Of Bahadir 
> > > Kiziltan
> > > Sent: Tuesday, October 03, 2006 8:32 AM
> > > To: Linux-PowerEdge at dell.com
> > > Subject: PE6850 and snmpd: Rebooting every Sunday
> > >
> > > Hi,
> > >
> > > I have 4 PE6850 servers with the same hw config running RHEL4 U3, 
> > > EM64T. Dell OMSA 5.0 is also up and running on all servers.
> > >
> > > Every Sunday around 04:03 AM all servers go to reboot suddenly. 
> > > There are no jobs scheduled in crontab. When it'd happened first 
> > > time I checked system logs to find something can be useful...
> > >
> > > /var/log/messages.log
> > >
> > > Sep 24 04:03:03 tradbdb1 syslogd 1.4.1: restart.
> > > Sep 24 04:03:03 tradbdb1 snmpd[14839]: NET-SNMP version 5.1.2 Sep 
> > > 24 04:03:04 tradbdb1 crond(pam_unix)[12890]: session closed for 
> > > user root Sep 24 04:03:06 tradbdb1 snmpd[14839]: [smux_accept] 
> > > accepted fd 13 from 127.0.0.1:62456 Sep 24 04:03:06 tradbdb1 
> > > snmpd[14839]: accepted smux peer: oid 
> > > SNMPv2-SMI::enterprises.674.10892.1, password , descr Systems 
> > > Management SNMP MIB Plug-in Manager Sep 24 04:15:01 tradbdb1 
> > > crond(pam_unix)[596]: session opened for user root by (uid=0) Sep 
> > > 24 04:15:02 tradbdb1 crond(pam_unix)[596]: session closed for user

> > > root Sep 24 04:19:31 tradbdb1 syslogd 1.4.1: restart.
> > > Sep 24 04:19:31 tradbdb1 syslog: syslogd startup succeeded Sep 24 
> > > 04:19:31 tradbdb1 kernel: klogd 1.4.1, log source = /proc/kmsg 
> > > started.
> > >
> > > /var/log/snmpd.log
> > >
> > > [smux_accept] accepted fd 13 from 127.0.0.1:32770 accepted smux 
> > > peer: oid SNMPv2-SMI::enterprises.674.10892.1, password , descr 
> > > Systems Management SNMP MIB Plug-in Manager Got trap from peer on 
> > > fd 13 Got trap from peer on fd 13 [smux_accept] accepted fd 14 
> > > from 10.35.0.6:32809 refused smux peer: oid 
> > > SNMPv2-SMI::zeroDotZero, password , descr NSGS peer disconnected: 
> > > SNMPv2-SMI::enterprises.674.10892.1
> > > Received TERM or STOP signal...  shutting down...
> > >
> > >
> > > After stopped all OMSA services as well as snmpd deamon, I do not 
> > > encounter the problem.
> > >
> > > Any clue?
> > >
> > > _______________________________________________
> > > Linux-PowerEdge mailing list
> > > Linux-PowerEdge at dell.com
> > > http://lists.us.dell.com/mailman/listinfo/linux-poweredge
> > > Please read the FAQ at http://lists.us.dell.com/faq
> > >
> >
> > _______________________________________________
> > Linux-PowerEdge mailing list
> > Linux-PowerEdge at dell.com
> > http://lists.us.dell.com/mailman/listinfo/linux-poweredge
> > Please read the FAQ at http://lists.us.dell.com/faq
>
>

_______________________________________________
Linux-PowerEdge mailing list
Linux-PowerEdge at dell.com
http://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq



More information about the Linux-PowerEdge mailing list