Broadcom AI Compute ASIC with Optical Attach Detailed at Hot Chips 2024

0
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_22
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_22

Broadcom is one of the companies behind many custom AI accelerators. As such, it is a big deal that the company is showing off an AI compute ASIC with optical attach during Hot Chips 2024, as this is likely being done for a customer project. Broadcom makes custom AI accelerators for hyper-scalers. The company also showed off its co-packaged optics and silicon photonics which we love to see.

Please note that we are doing these live at Hot Chips 2024 this week, so please excuse typos.

Broadcom AI Compute ASIC with Optical Attach Detailed at Hot Chips 2024

Co-packaged optics have been a topic for a long time.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_02
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_02

One of the big challenges in interconnects and networking is just the electrical I/O reach through PCB. There are designs using cables to go externally from the switch chip to the optical cages, but the endgame still feels like it is optics. On the other hand, the NVIDIA NVLink NVL72 is a great example of copper rather than optical attach.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_03
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_03

Silicon Photonics makes for optical modules with fewer components.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_04
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_04

Broadcom is now using silicon photonics and co-packaged optics for not just switches, but also for scale-up compute.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_05
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_05

For those wondering, that switch photo is the Broadcom Tomahawk 5 Bailly.

Broadcom Tomahawk 5 Bailly Co Packaged Optics Sampling
Broadcom Tomahawk 5 Bailly Co Packaged Optics Sampling

Here is the CPO schematic. An important difference between Intel’s old silicon photonics and what Broadcom is doing, is that Broadcom is using a pluggable laser. Lasers fail, so having a pluggable laser helps with serviceability. Likewise,

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_07
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_07

Broadcom has a version of the Tomahawk 5 slide we have shown before in this presentation with the components of CPO (co-packaged optics.)

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_08
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_08

The Tomahawk 4 “Humboldt” was the first generation system.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_10
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_10

That looks a lot like the OCP 2022 CPO demo for Broadcom.

Broadcom Co Packaged Optics Silicon Photonics OCP Summit 2022 5
Broadcom Co Packaged Optics Silicon Photonics OCP Summit 2022 5

This is the Tomahawk 4 implementation of Silicon Photonics.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_11
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_11

The new version is Tomahawk 5 Bailly. This is a 51.2T Ethernet switch in the same class as the Marvell Teralynx 10 51.2T 64-port 800GbE Switch we recently showed.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_12
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_12

In the new version, Broadcom has improved packaging. The FOWLP is used in things like mobile chips so it created a more stable platform for the optical interconnect.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_13
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_13

Here are the steps for creating the chips with engines.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_14
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_14

Here is the cross-section of the optical engine. Broadcom says that this is a more scalable way to manufacture and integrate the optical engines.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_15
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_15

Broadcom showed this idea of having 128 ports of 400G optics on the table a few months ago versus the Bially. Those optical modules include light sources, so it would have been more accurate if Broadcom included the light sources.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_16
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_16

Broadcom says that an 800G module will take 13-15W of power. With CPO, and removing things like DSP complexity, this is down to under 4.8W.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_17
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_17

One of the significant challenges is not just getting optical attached but also operating without errors.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_18
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_18

Here, Broadcom is saying that it is further optimizing the solution. These are steps to show vendors that the quality is up to what one gets with traditional pluggable optics.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_19
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_19

In 51.2T switches, the optical network pluggable use a lot of power. Using co-packaged optics lowers the total power. As a result, one saves around 30% power.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_20
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_20

The next step is using similar technology to combine compute ASICs with the co-packaged optics. Here we can see the CoWoS packaged with HBM, a compute ASIC, and the optical chiplet.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_22
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_22

The GPU attach looks a bit more advanced than the chip above with more HBM and more compute tiles. Still the idea is that one can get 64 links off of the chip to connect directly to switches.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_23
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_23

Moving the optics away from the xPU is also important because it moves optical engines further from the hot compute. This may not seem like a big deal, but it matters to ensure optics operate reliably.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_24
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_24

Today it is a 64x 100G device and two devices per for 12.8T. Where this gets crazy is going from 12.8T of density to 102.4T. That is an almost crazy amount of bandwidth.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_25
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_25

Broadcom is showing the use of bi-directional optics for high-radix networks. The transmit and receive are on different wavelengths.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_27
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_27

That bi-directional optics approach lowers the cost of fiber which is why it is attractive.

Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_28
Broadcom AI ASIC With Optical Connect Hot Chips 2024_Page_28

There is a lot here.

Final Words

Optical attach must happen at some point. Intel showed its Silicon Photonics Connector in 2022, but it is still using copper interconnects on its chips, even though there were plans to replace HBM stacks with optical connections via Lightbender next year in 2025 (Intel stopped Lightbender.) Broadcom does a lot of optical networking just by virtue of its networking business and is shipping co-packaged optics switches now. Moving to optical I/O is going to be a big deal for the AI space as companies strive to build bigger packages.

This is easily one of the coolest presentations this year.

LEAVE A REPLY

Please enter your comment!
Please enter your name here

This site uses Akismet to reduce spam. Learn how your comment data is processed.