By timatworkhomeandinbetween on Mar 04, 2009
"Herd 2.0" what is that ?
"Hardware Error Report and Decode (HERD) utility" . Its a monitoring daemon for all Sun AMD processor based systems running fairly recent versions of suse and redhat linux that is designed to monitor the machine check log and decode any errors there. Earlier version were available but had knowledge of the hardware topology of only certain Sun AMD processor based systems but with version 2.0 its ALL Sun AMD based systems.
With the complete topology of the systems it can decode the information that the CPU provides for a error into physical components eg to a specific DIMM
From the Sun download centre ... https://cds.sun.com/is-bin/INTERSHOP.enfinity/WFS/CDS-CDS_SMI-Site/en_US/-/USD/ViewProductDetail-Start?ProductRef=HERD-2.0-M-G-F@CDS-CDS_SMI
Low frequencies of correctable errors are an expected part of modern computing and you need to consult your hardware service provider to determine when a DIMM is generating too many Correctable Errors and should be swapped.