
NVIDIA and Google Cut AI Inference Costs
Google and NVIDIA announced new hardware at Google Cloud Next to reduce AI inference costs at scale. A5X instances on Vera Rubin NVL72 systems deliver 10 times lower cost per token and higher throughput. Features include confidential computing, agent platforms, and manufacturing tools used by OpenAI, CrowdStrike, and others.
