Wednesday Mar 04, 2009

I see we quietly announced HERD 2.0 for Sun AMD based machines.

"Herd 2.0" what is that ?

"Hardware Error Report and Decode (HERD) utility" . Its a monitoring daemon for all Sun AMD processor based systems running fairly recent versions of suse and redhat linux that is designed to monitor the machine check log and decode any errors there.  Earlier version were available but had knowledge of the hardware topology of only certain Sun AMD processor based systems but with version 2.0 its ALL Sun AMD based systems.

With the complete topology of the systems it can decode the information that the CPU provides for a error into physical components eg to a specific DIMM

From the Sun download centre ... https://cds.sun.com/is-bin/INTERSHOP.enfinity/WFS/CDS-CDS_SMI-Site/en_US/-/USD/ViewProductDetail-Start?ProductRef=HERD-2.0-M-G-F@CDS-CDS_SMI

Low frequencies of correctable errors are an expected part of modern computing and you need to consult your hardware service  provider to determine when a DIMM is generating too many Correctable Errors and should be swapped.

About

timatworkhomeandinbetween

Search

Archives
« March 2009
MonTueWedThuFriSatSun
      
1
2
3
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
     
Today
News

No bookmarks in folder

Blogroll

No bookmarks in folder