NVIDIA Introduces AI Foundry Service to Accelerate Custom Generative AI Applications on Microsoft Azure

NVIDIA has launched an AI foundry service that aims to enhance the development and optimization of custom generative AI applications for enterprises and startups deploying on Microsoft Azure. The service combines NVIDIA AI Foundation Models, NVIDIA NeMo framework and tools, and NVIDIA DGX Cloud AI supercomputing services to provide an end-to-end solution for creating custom generative AI models. These models can be deployed using NVIDIA AI Enterprise software to power various generative AI applications, including intelligent search, summarization, and content generation. Industry leaders such as SAP SE, Amdocs, and Getty Images are already utilizing the service to build their own custom models.

"Enterprises need custom models to perform specialized skills trained on the proprietary DNA of their company—their data," said Jensen Huang, founder and CEO of NVIDIA. "NVIDIA's AI foundry service combines our generative AI model technologies, LLM training expertise, and giant-scale AI factory. We built this in Microsoft Azure so enterprises worldwide can connect their custom model with Microsoft's world-leading cloud services."

"Our partnership with NVIDIA spans every layer of the Copilot stack—from silicon to software—as we innovate together for this new age of AI," said Satya Nadella, chairman and CEO of Microsoft. "With NVIDIA's generative AI foundry service on Microsoft Azure, we're providing new capabilities for enterprises and startups to build and deploy AI applications on our cloud."

Industry Leaders Building Tailored, Timely LLMs

NVIDIA's AI foundry service can be used to customize models for generative AI-powered applications across various industries, including enterprise software, telecommunications, and media. Enterprises can utilize retrieval-augmented generation (RAG) to connect their models with their enterprise data and gain new insights.

SAP, as the first customer of NVIDIA DGX Cloud on Microsoft Azure, plans to use the service and optimized RAG workflow to customize and deploy Joule, its new natural language generative AI copilot. Amdocs, a leading provider of software and services to communications and media companies, is also optimizing models for the Amdocs amAIz framework to accelerate the adoption of generative AI applications and services for telcos globally.

Curated, Optimized Models for Custom Generative AI

Customers using the NVIDIA foundry service can choose from a range of NVIDIA AI Foundation models, including the new Nemotron-3 8B models. These models, hosted in the Azure AI model catalog, have been optimized with 8 billion parameters and offer multilingual capabilities for building custom enterprise generative AI applications.

NVIDIA DGX Cloud AI supercomputing is now available on Azure Marketplace, allowing customers to rent instances and scale up to thousands of NVIDIA Tensor Core GPUs. It comes with NVIDIA AI Enterprise software, including NeMo, to speed up LLM customization. Additionally, NVIDIA AI Enterprise software is integrated into Azure Machine Learning, providing Azure customers with access to NVIDIA's secure and supported AI and data science software.

NVIDIA AI Enterprise is also available on Azure Marketplace, offering businesses worldwide a wide range of options for production-ready AI development and deployment of custom generative AI applications.