OMSA v5.2 - dsm_sa_datamgr32d is stopped

Patrick_Boyd at Dell.com Patrick_Boyd at Dell.com
Mon Sep 17 12:10:55 CDT 2007


You've got a program with a semaphore leak. You need to restart the
box... and figure out which program is leaking semaphores.

-----Original Message-----
From: linux-poweredge-bounces at dell.com
[mailto:linux-poweredge-bounces at dell.com] On Behalf Of Frank Warnke
Sent: Monday, September 17, 2007 12:05 PM
To: linux-poweredge-Lists
Subject: Re: OMSA v5.2 - dsm_sa_datamgr32d is stopped

Some more information;

I decided to try one more time to uninstall (srvadmin-uninstall.sh) and
reinstall srvadmin (yum install srvadmin-all).

When I ran "srvadmin-services.sh start", I get something different;

# srvadmin-services.sh start
Starting mptctl:
Waiting for mptctl driver registration to complete:
                                                           [  OK  ]

Starting Systems Management Device Drivers:
Starting dell_rbu:                                         [  OK  ]
Starting ipmi driver: Already started                      [  OK  ]
Starting Systems Management Data Engine:
Starting dsm_sa_datamgr32d:                                [  OK  ]
Starting dsm_sa_eventmgr32d:                               [  OK  ]
Starting dsm_sa_snmp32d:                                   [  OK  ]
Starting DSM SA Shared Services:                           [  OK  ]
Starting DSM SA Connection Service                         [FAILED]

# srvadmin-services.sh status
dcdbas (module) is stopped
dell_rbu (module) is stopped
dsm_sa_datamgr32d is stopped
dsm_sa_eventmgr32d is stopped
dsm_sa_snmp32d is stopped
dsm_om_shrsvc32d (pid 24981) is running
dsm_om_connsvc32d is stopped


I see these messages in /var/log/messages;

Sep 17 12:35:52 s-layout1 Server Administrator (Shared Library): Data
Engine EventID: 0  A semaphore set has to be created but the system
limit for the maximum number of semaphore sets has been exceeded
Sep 17 12:35:52 s-layout1 last message repeated 5 times
Sep 17 12:35:55 s-layout1 snmpd[4552]: [smux_accept] accepted fd 11 from
127.0.0.1:47240
Sep 17 12:35:55 s-layout1 snmpd[4552]: accepted smux peer: oid SNMPv2-
SMI::enterprises.674.10892.1, descr Systems Management SNMP MIB Plug-in
Manager
Sep 17 12:35:56 s-layout1 Server Administrator (Shared Library): Data
Engine EventID: 0  A semaphore set has to be created but the system
limit for the maximum number of semaphore sets has been exceeded
Sep 17 12:35:56 s-layout1 last message repeated 5 times
Sep 17 12:35:57 s-layout1 kernel: dcdbas dcdbas: Dell Systems Management
Base Driver (version 5.6.0-2)
Sep 17 12:35:57 s-layout1 instsvcdrv: dcdbas device driver loaded
Sep 17 12:35:58 s-layout1 Server Administrator (Shared Library): Data
Engine EventID: 0  A semaphore set has to be created but the system
limit for the maximum number of semaphore sets has been exceeded
Sep 17 12:35:58 s-layout1 last message repeated 2 times
Sep 17 12:35:59 s-layout1 snmpd[4552]: peer disconnected: SNMPv2-
SMI::enterprises.674.10892.1
Sep 17 12:36:00 s-layout1 dataeng: dsm_sa_snmp32d shutdown succeeded
Sep 17 12:36:01 s-layout1 dataeng: dsm_sa_eventmgr32d shutdown succeeded
Sep 17 12:36:02 s-layout1 Server Administrator (Shared Library): Data
Engine EventID: 0  A semaphore set has to be created but the system
limit for the maximum number of semaphore sets has been exceeded
Sep 17 12:36:02 s-layout1 instsvcdrv: dcdbas device driver unloaded
Sep 17 12:36:02 s-layout1 instsvcdrv: dell_rbu device driver unloaded
Sep 17 12:36:08 s-layout1 Server Administrator (Shared Library): Data
Engine EventID: 0  A semaphore set has to be created but the system
limit for the maximum number of semaphore sets has been exceeded



The kernel is 2.6.18-8.1.10.el5 #1 SMP Thu Aug 30 20:43:28 EDT 2007
x86_64 x86_64 x86_64 GNU/Linux


Thanks,
Frank


On Mon, 2007-09-17 at 10:57 -0400, Frank Warnke wrote:
> I have installed OMSA on two other PE1900's running RHEL 5 Server back
> in July 2007 without a problem.  Last Friday I tried installing OMSA
on
> a third PE1900 and ran in to a problem.
> 
> This one is also running RHEL 5 Server, but with all the OS updates
> released since July 2007.
> 
> OMSA was installed via these steps;
> 
>    1) wget -O- -q http://linux.dell.com/repo/hardware/bootstrap.cgi |
> bash
> 
>    2) yum install srvadmin-all
> 
>    3) srvadmin-services.sh start 
> 
> 
> Starting srvadmin looks OK;
> 
> # srvadmin-services.sh start
> Starting mptctl:
> Waiting for mptctl driver registration to complete:
>                                                            [  OK  ]
> 
> Starting Systems Management Device Drivers:
> Starting dell_rbu:                                         [  OK  ]
> Starting ipmi driver: Already started                      [  OK  ]
> Starting Systems Management Data Engine:
> Starting dsm_sa_datamgr32d:                                [  OK  ]
> Starting dsm_sa_eventmgr32d:                               [  OK  ]
> Starting dsm_sa_snmp32d:                                   [  OK  ]
> Starting DSM SA Shared Services:                           [  OK  ]
> 
> Starting DSM SA Connection Service:                        [  OK  ]
> 
> 
> However, logging in to OMSA via a web browser, does not show system
> information like Dell server model or the PERC 5/i with its drives.
> 
> Running srvadmin status shows datamgr32d as stopped;
> 
> # srvadmin-services.sh status
> dell_rbu (module) is running
> ipmi driver is running
> dsm_sa_datamgr32d is stopped
> dsm_sa_eventmgr32d (pid 8142) is running
> dsm_sa_snmp32d (pid 8152) is running
> dsm_om_shrsvc32d (pid 8182) is running
> dsm_om_connsvc32d (pid 8256 8255) is running
> 
> 
> Here is what a srvadmin restart looks like;
> 
> # srvadmin-services.sh restart
> Shutting down DSM SA Shared Services:                      [  OK  ]
> 
> 
> Shutting down DSM SA Connection Service:                   [  OK  ]
> 
> 
> Stopping Systems Management Data Engine:
> Stopping dsm_sa_snmp32d:                                   [  OK  ]
> Stopping dsm_sa_eventmgr32d:                               [  OK  ]
> Stopping dsm_sa_datamgr32d: Not started                    [FAILED]
> Stopping Systems Management Device Drivers:
> Stopping dell_rbu:                                         [  OK  ]
> Starting mptctl:
> Waiting for mptctl driver registration to complete:
>                                                            [  OK  ]
> 
> Starting Systems Management Device Drivers:
> Starting dell_rbu:                                         [  OK  ]
> Starting ipmi driver: Already started                      [  OK  ]
> Starting Systems Management Data Engine:
> Starting dsm_sa_datamgr32d:                                [  OK  ]
> Starting dsm_sa_eventmgr32d:                               [  OK  ]
> Starting dsm_sa_snmp32d:                                   [  OK  ]
> Starting DSM SA Shared Services:                           [  OK  ]
> 
> Starting DSM SA Connection Service:                        [  OK  ]
> 
> The srvadmin status is still the same;
> 
> # srvadmin-services.sh status
> dell_rbu (module) is running
> ipmi driver is running
> dsm_sa_datamgr32d is stopped
> dsm_sa_eventmgr32d (pid 9700) is running
> dsm_sa_snmp32d (pid 9710) is running
> dsm_om_shrsvc32d (pid 9740) is running
> dsm_om_connsvc32d (pid 9814 9813) is running
> 
> 
> I have Googled and checked system logs as well as the archives
> but so far I have not been able to solve this.  Any ideas on how 
> to proceed to troubleshoot this would be much appreciated.
> 
> Thanks,
> Frank

_______________________________________________
Linux-PowerEdge mailing list
Linux-PowerEdge at dell.com
http://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq



More information about the Linux-PowerEdge mailing list