Welcome Back Intel Xeon 6900P Reasserts Intel Server Leadership

4

A Quick Word on Intel Xeon 6980P Power

We usually try to avoid testing power consumption in development platforms, but that is all we had.

Intel Xeon 6980P Development Platform 2
Intel Xeon 6980P Development Platform 2

Generally these platforms just blast fans 24×7 to keep parts cool, but that can add a huge amount of power. Also, this test platform had more of the temperament of my 4 month old son. Sometimes it worked flawlessly, and sometimes you uninstall four DIMMs and it freaked out.

Intel Xeon 6980P Development Platform 5
Intel Xeon 6980P Development Platform 5

What we will simply say is this, each CPU can use 500W. Add to that another 120W or more for 24 DDR5 DIMMs, and a few watts for storage and other board components, and you are at around 1.2kW without cooling. One notable omission in the new systems however is the PCH, so that is 10W+ and a discrete part no longer sitting on the motherboard. Still, adding 10-20% for fans depending on the form factor and 1.3-1.6kW is going to be the new range for a dual socket server, before add-in cards or storage are brought into the mix. If you add a 100W+ NVIDIA BlueField-3 DPU, those numbers go up.

Something that Intel did, which was great, is that it showed a look at the power consumption of the CPUs below 100%. Most server CPUs do not run at 100% load. In fact, for a huge number of workloads, you do not want to run a CPU at 100% because that is when you start to see higher tail latencies, packets processed slowly, and so forth. Most cloud instances run at very low CPU utilization (under 25%.) As a result, the actual load on most CPUs in virtualization clusters is well under 50% at any given time. With that context, Intel showed that in the 30-70% utilization range it can get up to a 1.9x performance per watt gain.

Intel Xeon 6 Granite Rapids AP Launch Power Curve
Intel Xeon 6 Granite Rapids AP Launch Power Curve

The chips may use more power per socket, but doubling the cores per socket while not doubling the TDP means we can get a lot more efficiency.

Key Lessons Learned: Market Impact

Normally we would go into a look at the impact to AMD’s parts, but let us take a step back for a moment. This is really the high-performance line from Intel. If you want better power efficiency for running web servers, Sierra Forest is the answer. If you want a storage server, you are probably buying an older generation CPU or waiting for the smaller socket P-core CPUs and the R1S configuration in Q1.

Intel Xeon 6 Family With Clearwater Forest
Intel Xeon 6 Family With Clearwater Forest

Since you can see it in the above photo, yes, there is an image of the upcoming 18A Clearwater Forest CPU where we expect the E-core CPUs to get very attractive.

Intel Xeon Clearwater Forest
Intel Xeon Clearwater Forest

Still, from a raw performance perspective the Intel Xeon 6900P with 128 cores is stellar and takes the performance crown for now. Of course, AMD said Turin is coming in 2H 2024, so we are not far from AMD’s modern offering. In any event, Intel will be competing with the same core count, and a more comparable process technology, instead of at a 50% or more core count deficit and much older process technology that it has found itself at for half a decade.

AMD Computex 2024 Keynote 5th Gen AMD EPYC Turin 13 Chiplets 3nm And 6nm Process
AMD Computex 2024 Keynote 5th Gen AMD EPYC Turin 13 Chiplets 3nm And 6nm Process

What is interesting is that competing with fewer cores meant that Intel has focused on things like AMX for AI inference acceleration, QuickAssist for crypto and compression that is now supported by default in Ubuntu.

Intel Xeon 6 Granite Rapids AP Launch AI Capabilities
Intel Xeon 6 Granite Rapids AP Launch AI Capabilities

For those looking at more inference, Intel is also launching its Gaudi 3 AI accelerators today.

Intel Xeon 6 Granite Rapids AP Launch Gaudi 3 Head Nodes
Intel Xeon 6 Granite Rapids AP Launch Gaudi 3 Head Nodes

The big change is that Intel is also saying it is good for NVIDIA head nodes.

Intel Xeon 6 Granite Rapids AP Launch NVIDIA Head Nodes
Intel Xeon 6 Granite Rapids AP Launch NVIDIA Head Nodes

In the AI industry, one of the reasons companies like Supermicro took off was offering AMD EPYC versions of NVIDIA compute servers when AMD EPYC Genoa offered many more cores than Intel. Still, when you look at very large HGX H100 deployments, the 5th Gen Intel Xeon Emerald Rapids was extremely popular even at a core count deficit. One of the big reasons was just Intel’s memory controller and I/O die configuration. That advantage should be maintained with Granite Rapids, and the AMD EPYC advantage for higher core counts will go away. Still, in most AI servers four expensive GPUs are sold for each CPU.

Final Words

Intel Xeon 6 introduces a lot more complexity. P-cores, E-cores, 8 channel or 12 channel memory configurations, different memory types, accelerators and so forth. Through all of that though, there is one simple message: Intel is back. New process and packaging technology is allowing Intel to get back to core-to-core competitiveness with AMD. AMD will fire its Turin salvo soon-ish, and we will be back to debating architectures and accelerators. No longer will the question be 64 EPYC cores versus 28 Xeon cores, or 96 EPYC cores versus 64 Xeon cores. Intel is back in the game.

Intel Xeon 6900P Delidded
Intel Xeon 6900P Delidded

Of course, Intel is pursuing a much more complex strategy splitting E-cores and P-cores. My feedback to Intel last week that Sierra Forest-AP needs to be pulled into Q4 2024 so that Intel can have its cloud native 288 E-core versus 192 core/ 384 thread debate this year. Those CPUs, however are for a different segment.

For now, Intel has finally solved its core count issue, while also offering things like AMX, QAT, and more for acceleration, high-end DDR5 and MRDIMM (MCR DIMM) solutions for big memory bandwidth, and more. Intel lost its top-end edge when it lost process leadership with 10nm and Ice Lake. As the manufacturing and packaging side has improved, Intel is back in the fight. Welcome back Intel.

4 COMMENTS

  1. Wow can’t even hide Patrick’s love affair with Intel anymore can we ? Intel has not even properly launched this but yet it’s 128c Intel vs 96c Genoa, but AMD will have same 128c in 2 weeks time……just be honest finally and call it servingintel.com ;-)

  2. Yawn… Still low on PCIe lanes for a server footprint when GPUs and NVME storage is coming fast and furious. Intel needs to be sold so someone can finally innovate.

  3. Whether love or not, the numbers are looking good. For many an important question will be yield rates and pricing.

    I wonder why Epyc is missing from the povray speed comparison.

    One thing I’d like to see is a 4-core VM running Geekbench 6 while everything else is idle. After that Geekbench for an 8-core VM, 16-core, 32-core and so forth under similar circumstances. This sort of scaling analysis would help determine how well balanced the MCRDIMM memory subsystem is to the high-core-count processors–just the kind of investigative journalism needed right now.

    As an asside, I had to work over eight captchas for this post.

  4. The keyword would be availability. I checked just now, and these newer parts don’t have 1k Tray Pricing published yet. So not sure when would they be available. It felt painful to restrict the On-Premise Server procurement specification at 64 cores to get competitive bidding across vendors. Hope for the best.

LEAVE A REPLY

Please enter your comment!
Please enter your name here

This site uses Akismet to reduce spam. Learn how your comment data is processed.