AMD EPYC 9005 Turin Turns Transcendent Performance with 768 Threads Per Server

8
AMD EPYC 9965 In Hand 2
AMD EPYC 9965 In Hand 2

The AMD EPYC 9005 is here. Not content with Intel Xeon 6900P reasserting Intel Server Leadership for a few weeks, the new AMD EPYC 9005 turned in what one might call a transcendent performance. In reality, AMD’s formula was simple: Increase TDP by 25%+. Increase core counts by 50%. It uses new process technology and update its server processors to Zen 5/ Zen 5c architecture. Combining all that, we have the newest AMD EPYC processors that are performing exceptionally well.

Of course, since this is a major industry launch, we have a video version:

We suggest watching this in its own browser, tab, or app for the best viewing experience. AMD sent us the hardware we are using for testing, including the server platform and three sets of SKUs. We must say this is sponsored by AMD. At the same time, we also needed to augment the configuration with parts that Solidigm and Broadcom sent us for other pieces, but that we decided to use here.

Solidigm D7 PS1010 Side By Side
Solidigm D7 PS1010 Side By Side

One of the most important findings we had while doing our performance testing was that using slower SSDs and NICs led to notable performance loss, especially at 192 cores.

Broadcom 400GbE OCP NIC 3.0 Angle 2
Broadcom 400GbE OCP NIC 3.0 Angle 2

Although neither company sent us these parts specifically for this piece, but we had these samples on hand and just put them to use since they were here. We must also say this is sponsored by Solidigm and Broadcom. Now that that is out of the way let us get to it.

AMD EPYC 9005 “Turin” SKUs

Let us start with the first and perhaps most impactful change. AMD EPYC “Turin” is the new Zen 5 and Zen 5c generation of server processors.

AMD EPYC 9965 Front 2
AMD EPYC 9965 Front 2

Since the SKU stack and pricing are big points here, let us just lead with what many of you want to see:

AMD EPYC 9005 SKU Stack 2024 10
AMD EPYC 9005 SKU Stack 2024 10

AMD has Zen 5c core counts ranging from 96 to 192 cores. Still, these are all “Turin” so there is no Genoa/ Bergamo split. Originally, when powering on our test platform, I saw the AMD EPYC 9755 and thought it was a Zen 5c part, like the EPYC 9754 was a Zen 4c part.

AMD EPYC 9755 Front 1
AMD EPYC 9755 Front 1

That was wrong. AMD made a strong move to unify the stack and it makes sense. Intel is showing P-cores and E-cores as different architectures and, importantly, with and without SMT. AMD is saying that it has the same instruction set with different core counts, clock speeds, and cache. Strong move AMD.

AMD EPYC Turin Zen 5 And Zen 5c
AMD EPYC Turin Zen 5 And Zen 5c

Two of the SKUs are 500W parts, which require updated power delivery in platforms. We still get “P” series single-socket SKUs, but only up to 96 cores. That is a change from previous AMD generations since we do not get a full core count P-series part.

AMD EPYC Naming Convention 2024 10
AMD EPYC Naming Convention 2024 10

The chip that is perhaps the most fun is the AMD EPYC 9175F. 16 cores for Microsoft and other per-core licensing, but with a 5GHz turbo and 512MB of L3 cache (32MB per core). That chip looks like a monster if you are paying on a per-core basis.

AMD EPYC 9005 Turin Pins 1
AMD EPYC 9005 Turin Pins 1

AMD’s list pricing seems very aggressive compared to what we are seeing from Intel in this generation. Intel list prices are designed to be discounted and it seems that AMD is taking a different approach.

AMD EPYC Revenue Share Gain 2017 2024H1
AMD EPYC Revenue Share Gain 2017 2024H1

We did not get a Genoa-X update in this generation. AMD told us that Genoa-X is still a strong platform, but we do not have roadmap updates beyond that other than it is a potential technology for future parts. While Turin is going to be a mass-market product, let us get real here. If you are building something like a 24-bay hard drive storage server or something else that does not need the full 12-channel SP5 socket, PCIe Gen5, and so forth, then you are probably better off getting AMD EPYC Siena, Milan, or the EPYC 4004 series.

AMD EPYC Roadmap 2024 10
AMD EPYC Roadmap 2024 10

We already covered the Zen 5 and Zen 5c improvements in a lot of detail earlier this year. Unlike on the client side, server chips are homogeneous with either Zen 5 or Zen 5c, not both.

AMD EPYC 9005 Generational Improvements
AMD EPYC 9005 Generational Improvements

If you are familiar with the 4th Gen AMD EPYC SoCs, then this is going to be the big one. Here are all of the features of the new SoCs. Major differences are highlighted such as the core configurations, 512b AVX-512, memory speeds, CXL support, and more. This is the slide many will want to see at STH.

AMD EPYC 9005 Turin SoC
AMD EPYC 9005 Turin SoC

Still, it is the same instruction set between the two, so unlike Intel with different P and E cores, the right way to think about the new cores is that they are being tuned with different frequency, power, cache, and density profiles. The overall platform, however is the same instead of Intel’s two platform (Xeon 6700 and Xeon 6900 series) model. AMD instead packages higher core density Zen 5c cores and higher cache and clock speed Zen 5 cores into different CCDs and packages.

AMD EPYC 9005 Turin Chiplets
AMD EPYC 9005 Turin Chiplets

I asked AMD about the 32x SATA support since that seems like an inefficient use of lanes. AMD agreed that it is not the most useful today, but needs to have the support for platform compatibility. My sense is that we will see SATA support drop in some future generation because that is a very legacy feature that can be supported well by add-in cards for those applications that need it.

AMD Volcano Turin Platform Populated
AMD Volcano Turin Platform Populated

AMD continues to have the ability to use its I/O lanes either for socket-to-socket or as more PCIe lanes for single socket servers. AMD’s single socket offering has more PCIe Gen5 I/O than the Intel Xeon 6900P, but the upcoming Intel R1S single-socket solution will have more. In 2P configurations, the Intel Xeon 6900P has 192 PCIe Gen5 lanes while AMD is at up to 160 lanes. Competition is good!

AMD EPYC 9005 Turin Platform Overview
AMD EPYC 9005 Turin Platform Overview

Next, let us get to memory capabilities.

8 COMMENTS

  1. Smart Data Cache Injection (SDCI) which allows direct insertion of data from I/O devices into L3 cache could be a huge gain for low latency network IO workloads. It’s similar to Intel’s Data Direct I/O (DDIO).

  2. 768 threads in a server & benchmarks… for a fun test you could run a CPU rendered 3D FPS game. IIRC there is a cpu version of Crysis out there somewhere.

  3. i have a few questions_

    Why does the client needs so many vms to run a workload instead of using containers and drastically reduce overhead?

    Second question: can you go buy a 9175f and test that one with gaming?

  4. I’m surprised you perform benchmarks on such 2P system with NPS=1 and “L3 as Numa Domain” turned off.
    Such a processor deserved an NPS=4 + L3_LLC=On to let the Linux kernel do proper scheduling.

LEAVE A REPLY

Please enter your comment!
Please enter your name here

This site uses Akismet to reduce spam. Learn how your comment data is processed.