MLPerf Inference tests see the new Azure ND GB300 v6 VMs achieve token performance that ‘fundamentally alters the calculus of ...
The deal will give Microsoft access to compute infrastructure built with Nvidia's GB300 GPUs, which will be deployed over ...
Microsoft sets AI inference speed record with Azure ND GB300 v6 VMs, achieving 1.1M tokens/sec using Nvidia GB300 GPUs.