[Linux-PowerEdge] OMSA 8.3.0 not starting or segfaulting

Chandrasekhar_R at Dell.com Chandrasekhar_R at Dell.com
Fri Apr 29 02:14:02 CDT 2016


Hi Stefan,

Are you seeing this segfault issue on Re-branded servers or Dell brand as well?
Please get us the following command response on failing system:

smbios-sys-info-lite

Regards
Chandra



----------------------------------------------------------------------

Message: 1
Date: Fri, 29 Apr 2016 04:00:32 +0000
From:
Subject: Re: [Linux-PowerEdge] Talking to DRAC card from OS
To: , ,

Cc: Doug_Iler at Dell.com
Message-ID:

Content-Type: text/plain; charset="us-ascii"

Dell - Internal Use - Confidential
+Doug
What iDRAC version do you have? Latest release of iDRAC should work without Enterprise license.

From: linux-poweredge-bounces-Lists On Behalf Of Jreij, Elie
Sent: Friday, April 29, 2016 12:28 AM
To: raubvogel at gmail.com; linux-poweredge-Lists
Subject: Re: [Linux-PowerEdge] Talking to DRAC card from OS


Dell - Internal Use - Confidential
I assume you meant R320 instead of E320. Did you install the iDRAC Enterprise license? The dedicated iDRAC Ethernet port will not work without it on a R320.
Regards
Elie

-----Original Message-----
From: linux-poweredge-bounces-Lists On Behalf Of Mauricio Tavares
Sent: Thursday, April 28, 2016 6:29 AM
To: linux-poweredge-Lists
Subject: [Linux-PowerEdge] Talking to DRAC card from OS

I deployed a card to one of our poweredges E320 and when I tried to connect to it using its ethernet port, I get no connectivity in said port. So, since the host runs Linux and has openmanager, I was thinking on trying to connect to it from the OS and figure out what is going on. How can I do the deed?

_______________________________________________
Linux-PowerEdge mailing list
Linux-PowerEdge at dell.com
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.us.dell.com/pipermail/linux-poweredge/attachments/20160429/acfa9f93/attachment-0001.html

------------------------------

Message: 2
Date: Fri, 29 Apr 2016 08:17:07 +0200 (CEST)
From: "Dietrich, Stefan"
Subject: [Linux-PowerEdge] OMSA 8.3.0 not starting or segfaulting
To: linux-poweredge at lists.us.dell.com
Message-ID:
Content-Type: text/plain; charset=utf-8

Hello,

I have a bunch of machines, were OMSA from DSU 16.04.00 is no longer starting or throwing segfaults.

For example, on a Dell PowerEdge R430 with Scientific Linux 6 (yes, not "supported"...), some services could not be started:

# /opt/dell/srvadmin/sbin/srvadmin-services.sh start Starting Systems Management Device Drivers:
Starting dell_rbu: [ OK ]
Starting ipmi driver:
Already started [ OK ]
Starting Systems Management Data Engine:
Starting dsm_sa_datamgrd: [FAILED]
Starting dsm_sa_eventmgrd: [FAILED]
Starting DSM SA Shared Services: [ OK ]

# rpm -qa srvadmin\* | sort
srvadmin-base-8.3.0-1908.9058.el6.x86_64
srvadmin-cm-8.3.0-1908.9058.el6.x86_64
srvadmin-deng-8.3.0-1908.9058.el6.x86_64
srvadmin-hapi-8.3.0-1908.9058.el6.x86_64
srvadmin-isvc-8.3.0-1908.9058.el6.x86_64
srvadmin-nvme-8.3.0-1908.9058.el6.x86_64
srvadmin-omacore-8.3.0-1908.9058.el6.x86_64
srvadmin-omacs-8.3.0-1908.9058.el6.x86_64
srvadmin-omcommon-8.3.0-1908.9058.el6.x86_64
srvadmin-omilcore-8.3.0-1908.9058.el6.x86_64
srvadmin-ominst-8.3.0-1908.9058.el6.x86_64
srvadmin-realssd-8.3.0-1908.9058.el6.x86_64
srvadmin-server-cli-8.3.0-1908.9058.el6.x86_64
srvadmin-smcommon-8.3.0-1908.9058.el6.x86_64
srvadmin-storage-8.3.0-1908.9058.el6.x86_64
srvadmin-storage-cli-8.3.0-1908.9058.el6.x86_64
srvadmin-storageservices-cli-8.3.0-1908.9058.el6.x86_64
srvadmin-storelib-8.3.0-1908.9058.el6.x86_64
srvadmin-storelib-sysfs-8.3.0-1908.9058.el6.x86_64
srvadmin-sysfsutils-8.3.0-1908.9058.el6.x86_64
srvadmin-xmlsup-8.3.0-1908.9058.el6.x86_64

Afterwards, omreport does not recognize any components to monitor. There is no error message logged in /var/log/messages or anywhere else.

On a different PowerEdge R430 (same OS), OMSA is just segfaulting:

# /opt/dell/srvadmin/sbin/srvadmin-services.sh status
/etc/init.d/instsvcdrv: line 1589: 788703 Segmentation fault ${ISVCDD_SBIN_DIR}/${ISVCDD_DCHCFG_EXE} command=getsystype > /dev/null 2>&1
dcdbas (module) is running
dell_rbu (module) is running
/etc/init.d/instsvcdrv: line 1589: 788726 Segmentation fault ${ISVCDD_SBIN_DIR}/${ISVCDD_DCHCFG_EXE} command=getsystype > /dev/null 2>&1
dsm_sa_datamgrd is stopped
dsm_sa_eventmgrd is stopped
dsm_om_shrsvcd (pid 719966) is running

At least an error message is logged in /var/log/messages:
kernel: dchcfg[788726]: segfault at 0 ip 000000393f5336bf sp 00007ffccea3cf78 error 4 in libc-2.12.so[393f400000+18a000]

Any idea how to fix this and get proper hardware monitoring back?

Regards,
Stefan

--
------------------------------------------------------------------------
Stefan Dietrich Deutsches Elektronen-Synchrotron (IT-Systems)
Ein Forschungszentrum der Helmholtz-Gemeinschaft
Notkestr. 85
phone: +49-40-8998-4696 22607 Hamburg
e-mail: stefan.dietrich at desy.de Germany
------------------------------------------------------------------------



------------------------------

Message: 3
Date: Fri, 29 Apr 2016 12:06:04 +0530
From:
Subject: Re: [Linux-PowerEdge] OMSA 8.3.0 not starting or segfaulting
To: ,
Message-ID:


Content-Type: text/plain; charset="us-ascii"

Dell - Internal Use - Confidential
Hi,

If Customer installs OMSA 8.3 in an unsupported OS, it might fail because of NVMe implementation. But, you can suppress loading NVMe library by using the following steps to avoid any crash.

Please do the following steps for disabling psrvil which is required for detecting NVMe Devices:

1. To stop all the services, please execute the command - srvadmin-services.sh stop

2. Go to /opt/dell/srvadmin/etc/srvadmin-storage/ directory

3. Open stsvc.ini file

4. Please comment the line - vil7=dsm_sm_psrvil. For commenting it out, you need to write ";" and one space before the statement, like "; vil7=dsm_sm_psrvil"

5. Save and close stsvc.ini file

6. To start all the services, please execute the command - srvadmin-services.sh start

It will stop loading psrvil library and OMSS should work fine.

Note: After commenting out psrvil library in stsvc.ini file, NO NVMe devices will get detected by OMSA

Thanks
Souvik

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.us.dell.com/pipermail/linux-poweredge/attachments/20160429/33f7a202/attachment-0001.html

------------------------------

Message: 4
Date: Fri, 29 Apr 2016 09:01:27 +0200 (CEST)
From: "Dietrich, Stefan"
Subject: Re: [Linux-PowerEdge] OMSA 8.3.0 not starting or segfaulting
To: Souvik Bose
Cc: linux-poweredge at lists.us.dell.com
Message-ID:
Content-Type: text/plain; charset=utf-8

Hi Souvik,

sorry, I forgot to mention, that this workaround is already deployed.

It is also very strange, that I am seeing this issue not on all machines.
Other machines with the same hardware and OS release are not having any issues with OMSA.

Regards,
Stefan

> Dell - Internal Use - Confidential
> Hi,
>
> If Customer installs OMSA 8.3 in an unsupported OS, it might fail
> because of NVMe implementation. But, you can suppress loading NVMe
> library by using the following steps to avoid any crash.
>
> Please do the following steps for disabling psrvil which is required
> for detecting NVMe Devices:
>
> 1. To stop all the services, please execute the command -
> srvadmin-services.sh stop
>
> 2. Go to /opt/dell/srvadmin/etc/srvadmin-storage/ directory
>
> 3. Open stsvc.ini file
>
> 4. Please comment the line - vil7=dsm_sm_psrvil. For commenting it out,
> you need to write ";" and one space before the statement, like ";
> vil7=dsm_sm_psrvil"
>
> 5. Save and close stsvc.ini file
>
> 6. To start all the services, please execute the command -
> srvadmin-services.sh start
>
> It will stop loading psrvil library and OMSS should work fine.
>
> Note: After commenting out psrvil library in stsvc.ini file, NO NVMe
> devices will get detected by OMSA
>
> Thanks
> Souvik



------------------------------

_______________________________________________
Linux-PowerEdge mailing list
Linux-PowerEdge at dell.com
https://lists.us.dell.com/mailman/listinfo/linux-poweredge

End of Linux-PowerEdge Digest, Vol 143, Issue 23
************************************************
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.us.dell.com/pipermail/linux-poweredge/attachments/20160429/da2ca1bd/attachment.html 


More information about the Linux-PowerEdge mailing list