Here we will run the NVIDIA RTX 6000 Ada through Unigine-related benchmarks.
Unigine Superposition
The latest Benchmark from Unigine is Superposition, which measures your graphics cards’ extreme performance and stability.
For this Benchmark, we set the resolution to 4K or 3840×2160 to push the graphics card as hard as possible. Again, generationally, this is not a top-end card, but it is faster than two previous-generation cards.
Unigine Heaven
With Unigine Heaven, we see the professional cards pull ahead of their consumer counterparts.
Next, we will look at the NVIDIA RTX 5000 Ada power and temperature tests and then give our final words.
I thought gaming GPUs didn’t use blowers because of an enforced market segmentation by Nvidia to prevent those GPUs from being used in the data center.
Yes. The dual slot blower is officially banned from nvidia for top-end cards. However we still have vendors making them and offering it as “cheap alternatives to dc cards”. This is something Nvidia does not want OEMs to make, but unable to ban OEM from making because the potential market is just so high.
The article doesn’t mention it, but I see these have been crippled when it comes to dual precision compute, just like the previous generations.
Ada, Ampere, Turing – no DP capable cards (or at least not severely cut down to 1/64 of SP performance).
Volta (GV100) is the last DP compute card released, and that’s becoming a bit outdated.
Once upon a time, these Quadros (or whatever they are named today) were engineer’s cards. You had to have a high-end one to accelerate engineering simulations (FE, CFD and similar). I guess AI stole the show, and nobody is going to cater to that small market anymore.
What’s the proper way to build a DP crunching workstation nowadays, anyway?
A key reason not discussed in the review for why some users will almost have to use this card vs a much cheaper and faster 4090 is the ECC Memory this card has. For some applications and uses, that is required, also for liability reasons.
Apart from all that, I prefer blower cards if available; unfortunately, current generation consumer cards in that design are almost impossible to find.
Actually you can enable ECC on 4090 the same way on RTX workstation cards. These cards both lose a portion of the total VRAM capacity if ECC is enabled, 4090 just has half of the total capacity.
@TurboFEM: That’s because both NVIDIA and AMD have abandoned FP64 in mainstream architectures as its market share is not worth the cost of silicon to implement it at full speed. Gaming doesn’t need it, and neither does AI/ML.
On NVIDIA side the H100 can perform one FP64 operation every 2 cycles, while the Ada can do it every only every 4 cycles. AMD has implemented native FP64 since CDNA 2, and further improved it in CDNA 3.
So basically for FP64 you need to go for the highest end compute accelerators.
@eastcoast_pete: There’s also driver support and qualifications that are critical for certain uses. Using mainstream cards and mainstream drivers is out of the question for them.
@Gnattu: NVIDIA specifically forbids using mainstream cards in commercial compute (via CUDA and driver EULAs), and actively goes after companies who, for example, rent them as public clouds. While you can try to use them internally, your legal department won’t be happy if they ever find out.
I’m glad to see that others have already mentioned blower GPUs didn’t fall out of favor with consumers, Nvidia enforced that consumer AIBs couldn’t use blowers to ensure that the cheaper RTX3090/4090 wouldn’t be used in workstations instead of their astronomically priced workstation GPUs. I see it’s very popular right now by blogs of all types to gloss over Nvidia’s hostile behavior towards consumers but it’s a damn shame.
Thanks John. That is good info as I am weighing upgrading from A6000 cards.
Running large language models is becoming increasingly common. Suggest having a benchmark for that in the future. For example running the 8x7b mixtral model is common these days.
@Kyle Actually I know some companies get caught. The nvidia geforce driver has its own telemetry so Nvidia knows what you are doing if you don’t cut this connection. The result? Companies now start to disable public Internet access for these nodes and you have to distribute work through a gateway so that the telemetry never reaches Nvidia. I know it is prohibited by Nvidia, but the amount of money we are talking about here is unlikely to be limited by an EULA.
@Gnattu: Oh sure you can work around this issue for internal use. The problem is when you try to sell it to the public as a cloud offering, for example. You can’t really hide the fact you’re using a consumer GPU then – your clients will be able to tell. The issue is whether those clients will care.
EULAs in general are a murky topic, but most “serious” companies will not even try to get into the grey zones.