Anthropic's Claude family of AI models is now generally available on Microsoft Azure, running on Nvidia's Blackwell Ultra GB300 GPU systems — the startup's first deployment on Nvidia hardware and a milestone in the three-way partnership announced last November.
"This deployment is designed to improve inference performance and efficiency while reducing the total cost of ownership for enterprise AI workloads," Anthropic said in a statement Monday. The models run on Nvidia GB300 NVL72 systems with Quantum-X800 InfiniBand networking, enabling customers to deploy autonomous agents that can operate across different business functions.
The initial lineup includes Claude Opus 4.8 and Claude Haiku 4, with Anthropic saying it will continue expanding model availability on Azure. Microsoft handles billing, authentication and governance through its Foundry platform, lowering the integration barrier for enterprises already in the Azure ecosystem. The GB300 NVL72 systems pair a 72-core ARM CPU with the Blackwell Ultra GPU, a configuration Nvidia also uses in its recently announced DGX Station desktop — a $90,000 to $100,000 workstation with 748GB of unified memory capable of running 70-billion-parameter models locally.
The technical emphasis falls on autonomous agent workloads. Through Nvidia-verified agent skills, enterprises can equip Claude agents with domain-specific capabilities — effectively embedding AI agents into core operational workflows rather than treating them as standalone tools. Nvidia's Secure Agent Workspace Reference Design provides infrastructure-level controls for identity, networking, credentials and runtime policies, a design that targets regulated industries such as finance, healthcare and legal services where data compliance requirements are most stringent.
The commercial launch converts a November 2025 framework agreement among Microsoft, Nvidia and Anthropic into a deliverable product. For Nvidia, the deployment validates the Blackwell Ultra GB300 as an enterprise inference platform at a time when hyperscalers are racing to secure GPU capacity. Microsoft gains an exclusive enterprise distribution channel for Claude on Azure, strengthening its position against Amazon Web Services and Google Cloud in the AI agent market. For Anthropic, the Azure partnership provides a distribution path that competes directly with OpenAI's Microsoft relationship — though OpenAI remains the dominant workload on Azure's AI infrastructure.
The enterprise agent market is the prize. As companies shift from experimenting with large language models to deploying production systems that automate complex business tasks, the infrastructure layer that supports those agents becomes a strategic bottleneck. Nvidia's GB300 systems, with their unified memory architecture and high-bandwidth networking, are designed to handle the inference demands of multi-agent architectures where specialized sub-agents coordinate across departments. Bit Origin Ltd, an emerging AI infrastructure firm, recently acquired 16 Nvidia Blackwell B300 servers for about $11 million, expecting them to generate roughly $360,000 in monthly revenue — a data point that illustrates the revenue potential of Blackwell-based infrastructure.
Nvidia shares have gained more than 140% over the past 12 months, trading at about 35 times forward earnings, as enterprise AI spending continues to accelerate. The Blackwell Ultra GB300 represents the company's latest attempt to defend its estimated 80% share of the AI accelerator market against in-house chips from Amazon, Google and AMD. For investors, the question is whether Nvidia can maintain pricing power as hyperscalers develop custom alternatives — or whether the shift to agentic AI workloads creates enough incremental demand to absorb both Nvidia's supply and its competitors' output.
This article is for informational purposes only and does not constitute investment advice.