[Linux-PowerEdge] C6145 ECC Error, how to find bad DIMM?

Robert Jacobson Robert.C.Jacobson at nasa.gov
Tue Jan 29 06:58:49 CST 2013


On 1/28/2013 4:15 PM, John Hanks wrote:
> [snip]
>
> Does anyone know hos I can map #0x60 back to a specific DIMM slot or 
> even to a specific bank/CPU?

I don't, sorry...

> I'm really not looking forward to searching through 32 DIMMs, swapping 
> them one at a time and waiting to see if I get another ECC error.

You don't need to do them one at a time -- Boot from a memtest disk and 
use a binary search method to find the bad DIMM.   If you're really 
lucky you'll only have to run 5 tests.  At most, 10 tests will find it.

-- 
-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
Robert Jacobson               Robert.C.Jacobson at nasa.gov
Lead System Admin       Solar Dynamics Observatory (SDO)
Bldg 14, E222                             (301) 286-1591



More information about the Linux-PowerEdge mailing list