x86/edac/amd64: Add heterogeneous node support
From: | Naveen Krishna Chatradhi <nchatrad-AT-amd.com> | |
To: | <linux-edac-AT-vger.kernel.org>, <x86-AT-kernel.org> | |
Subject: | [PATCH v5 0/5] x86/edac/amd64: Add heterogeneous node support | |
Date: | Mon, 25 Oct 2021 20:20:13 +0530 | |
Message-ID: | <20211025145018.29985-1-nchatrad@amd.com> | |
Cc: | <linux-kernel-AT-vger.kernel.org>, <bp-AT-alien8.de>, <mingo-AT-redhat.com>, <mchehab-AT-kernel.org>, <yazen.ghannam-AT-amd.com>, Naveen Krishna Chatradhi <nchatrad-AT-amd.com> | |
Archive-link: | Article |
On newer heterogeneous systems with AMD CPUs the data fabrics of GPUs can be connected directly via custom links. This series of patchset does the following 1. amd_nb.c: a. Add support for northbridges on Aldebaran GPU nodes b. export AMD node map details to be used by edac and mce modules 2. mce_amd module: a. Identify the node ID where the error occurred and map the node id to linux enumerated node id. 3. amd64_edac module a. Add new family op routines b. Enumerate UMCs and HBMs on the GPU nodes c. Move fam_type structure into amd64_pvt struct This patchset is rebased on top of " commit 07416cadfdfa38283b840e700427ae3782c76f6b Author: Yazen Ghannam <yazen.ghannam@amd.com> Date: Tue Oct 5 15:44:19 2021 +0000 EDAC/amd64: Handle three rank interleaving mode " Muralidhara M K (3): x86/amd_nb: Add support for northbridges on Aldebaran EDAC/amd64: Extend family ops functions EDAC/amd64: Move struct fam_type into amd64_pvt structure Naveen Krishna Chatradhi (2): EDAC/mce_amd: Extract node id from MCA_IPID EDAC/amd64: Enumerate memory on Aldebaran GPU nodes arch/x86/include/asm/amd_nb.h | 9 + arch/x86/kernel/amd_nb.c | 150 +++++++-- drivers/edac/amd64_edac.c | 592 +++++++++++++++++++++++++--------- drivers/edac/amd64_edac.h | 39 ++- drivers/edac/mce_amd.c | 24 +- include/linux/pci_ids.h | 1 + 6 files changed, 630 insertions(+), 185 deletions(-) -- 2.25.1