Productivity AutomationVerified

NVIDIA offers NIM (NVIDIA Inference Microservices), a set of prepackaged AI model containers that let developers and enterprises deploy large language models and other AI models in minutes rather than weeks, across clouds, data centers, and local workstations. NIM became generally available to developers in June 2024.

Details

NIM microservices provide pre-optimized AI model containers including NVIDIA TensorRT, TensorRT-LLM, and other inference engines, along with industry-standard APIs, runtime dependencies, and enterprise-grade support. Developers can access NIM endpoints for over 40 AI models from NVIDIA and partners—including Meta Llama 3, Google Gemma, Microsoft Phi-3, and Mistral—through the NVIDIA Developer Program for free research and development, or in production via the NVIDIA AI Enterprise software platform. NIM reduces model deployment times from weeks to minutes and has been embedded into platforms from Amazon SageMaker, Microsoft Azure AI, and over 150 ecosystem partners.

Products affected

NVIDIA NIMNVIDIA AI EnterpriseNVIDIA Developer Program

Sources & Evidence

Company Disclosure

Company DisclosureVerified

NVIDIA NIM Revolutionizes Model Deployment, Now Available to Transform World's Millions of Developers Into Generative AI Developers

NVIDIA Newsroom·Jun 2024

https://nvidianews.nvidia.com/news/nvidia-nim-model-deployment-generative-ai-developers

Company DisclosureVerified

NVIDIA NIM Microservices for Accelerated AI Inference | NVIDIA

NVIDIA·Jun 2024

https://www.nvidia.com/en-us/ai-data-science/products/nim-microservices/

Other practices by NVIDIA

Data AnalysisNVIDIA offers Morpheus, an open-source AI cybersecurity framework that allows enterprises and security vendors to build real-time pipelines for detecting threats such as anomalous user behavior, spear phishing emails, and sensitive data leakage across data center networks. Companies including Best Buy and Booz Allen Hamilton have deployed it.Creative GenNVIDIA offers Picasso, a cloud service that lets businesses build applications capable of generating images, videos, and 3D objects from text prompts using NVIDIA's Edify foundation models. Enterprise customers including Getty Images, Shutterstock, and Adobe have integrated these tools into their own creative platforms.OtherNVIDIA deployed Cosmos, an open-source platform of world foundation models that generates physics-based synthetic video data to help robotics companies and autonomous vehicle developers train their AI systems without needing as much real-world data collection. Cosmos launched at CES in January 2025 and was adopted by companies including Agility Robotics, Figure AI, Uber, and Waabi.

Have evidence about NVIDIA's AI practices? Submit a report.

Report a Sighting →