[Poweredgec-tools] C6200 Memory error, how to locate the faulty memory

Sven Ulland sveniu at opera.com
Tue Jan 28 02:48:49 CST 2014


> How can I find which memory module has problem?

Instead of contacting Dell support, someone could send a patch to the
edac-utils project, specifically the src/etc/labels.db file, which
would map the various Dell PE server models' DIMM layouts to memory
controller addresses. It would then be a small matter of running
'edac-ctl --print-labels' after being notified about issues by the
EDAC driver:

# edac-util -v
mc0: 0 Uncorrected Errors with no DIMM info
mc0: 0 Corrected Errors with no DIMM info
mc0: csrow0: 0 Uncorrected Errors
mc0: csrow0: CPU_SrcID#0_Channel#0_DIMM#0: 10581 Corrected Errors
mc0: csrow0: CPU_SrcID#0_Channel#1_DIMM#0: 0 Corrected Errors
mc0: csrow0: CPU_SrcID#0_Channel#2_DIMM#0: 0 Corrected Errors
mc0: csrow0: CPU_SrcID#0_Channel#3_DIMM#0: 0 Corrected Errors
mc0: csrow1: 0 Uncorrected Errors
mc0: csrow1: CPU_SrcID#0_Channel#0_DIMM#1: 0 Corrected Errors
mc0: csrow1: CPU_SrcID#0_Channel#1_DIMM#1: 0 Corrected Errors
mc0: csrow1: CPU_SrcID#0_Channel#2_DIMM#1: 0 Corrected Errors
mc0: csrow1: CPU_SrcID#0_Channel#3_DIMM#1: 0 Corrected Errors
mc1: 0 Uncorrected Errors with no DIMM info
mc1: 0 Corrected Errors with no DIMM info
mc1: csrow0: 0 Uncorrected Errors
mc1: csrow0: CPU_SrcID#1_Channel#0_DIMM#0: 0 Corrected Errors
mc1: csrow0: CPU_SrcID#1_Channel#1_DIMM#0: 0 Corrected Errors
mc1: csrow0: CPU_SrcID#1_Channel#2_DIMM#0: 0 Corrected Errors
mc1: csrow0: CPU_SrcID#1_Channel#3_DIMM#0: 0 Corrected Errors
mc1: csrow1: 0 Uncorrected Errors
mc1: csrow1: CPU_SrcID#1_Channel#0_DIMM#1: 0 Corrected Errors
mc1: csrow1: CPU_SrcID#1_Channel#1_DIMM#1: 0 Corrected Errors
mc1: csrow1: CPU_SrcID#1_Channel#2_DIMM#1: 0 Corrected Errors
mc1: csrow1: CPU_SrcID#1_Channel#3_DIMM#1: 0 Corrected Errors

# edac-ctl --print-labels
edac-ctl: Error: No dimm labels for Dell Inc. 03C9JJ

A full map of all Dell systems with EDACs supported by Linux would be
awesome:

e752x_edac:   MC support for Intel e752x/3100 memory controllers
i3000_edac:   MC support for Intel 3000 memory hub controllers
i3200_edac:   MC support for Intel 3200 memory hub controllers
i5000_edac:   MC Driver for Intel I5000 memory controllers
i5100_edac:   MC Driver for Intel I5100 memory controllers
i5400_edac:   MC Driver for Intel I5400 memory controllers
i7300_edac:   MC Driver for Intel I7300 memory controllers
i7core_edac:  MC Driver for Intel i7 Core memory controllers
i82975x_edac: MC support for Intel 82975 memory hub controllers
sb_edac:      MC Driver for Intel Sandy Bridge memory controllers
x38_edac:     MC support for Intel X38 memory hub controllers

sven



More information about the Poweredgec-tools mailing list