Skip to content

Here’s How NVIDIA’s Blackwell Ultra GB300 AI Racks Are Dominating Long-Context DeepSeek Workloads, Delivering Impressive Gains Versus GB200

A person stands next to a large NVIDIA data center server rack with multiple GPUs and visible branding.

NVIDIA’s GB300 NVL72 AI racks have been tested across DeepSeek’s latest open source models, and through fine-tuning and optimized inference, the results are indeed promising. NVIDIA’s Blackwell Ultra Scores Up to a 1.5x Lead Over GB200 NVL72 In Latency-Sensitive Workloads With GB300, NVIDIA’s primary focus has been on delivering optimal long-context performance in order to capitalize on the agentic AI wave. In a recent post, we discussed how Blackwell Ultra delivers a 50x increase in throughput per megawatt compared to Hopper GPUs through its extreme co-design approach. Now, the Large Model Systems Organization (LMSYS) has tested GB300 NVL72 for long-context […]

Read full article at https://wccftech.com/heres-how-nvidia-blackwell-ultra-is-dominating-long-context-deepseek-workloads/