scanning for bad ram

Paul A razor at meganet.net
Tue Nov 6 13:42:33 CST 2007


Nick, thanks for the information. 

The reason I'm asking is because we have 3 1900's bought refurbished and one
application is exiting with status 11 (SIGSEGV) on two of the servers. The
provider of the software tells me it's probably due to hardware or ram
failure.

Can run osma and test the ram hardware while the server is up, will it
affect data stored in memory. I can always take one of the servers I'm
testing offline if it does. 

paul
________________________________________
From: Nick_Parrott at Dell.com [mailto:Nick_Parrott at Dell.com] 
Sent: Tuesday, November 06, 2007 1:19 PM
To: razor at meganet.net; linux-poweredge at lists.us.dell.com
Subject: RE: scanning for bad ram 

It will indeed, both Single Bit errors and Multi Bit errors

Single bit's are ECC recoverable - so you won't see effect on applications,
multi-bit you'll probably know about..

Any fault that the BMC (Baseboard Management Controller) logs will be
displayed in OMSA under "Logs" > "BMC/SEL"

Use Dell MPmemory to test DIMMs offline, it's on the 32-bit Diagnostics
CD's. It's a modified memtest86, however memtest86 will fail immediately as
the system BIOS reserves a very small portion of memory space upon boot

Nick

From: linux-poweredge-bounces at dell.com
[mailto:linux-poweredge-bounces at dell.com] On Behalf Of Paul A
Sent: 06 November 2007 17:49
To: linux-poweredge-Lists
Subject: scanning for bad ram 

I have yet to install osma on my 1950's I know it will monitor disks but
will it monitor and report problems with ram.
Paul



More information about the Linux-PowerEdge mailing list