NVIDIA NemoClaw is an open source stack that simplifies running OpenClaw always-on assistants more safely, with a single command. It installs the NVIDIA OpenShell runtime, part of NVIDIA Agent Toolkit, a secure environment for running autonomous agents, and open source models like NVIDIA Nemotron.
NVIDIA OpenShell is an open source runtime to build and deploy autonomous, self-evolving agents more safely. OpenShell sits between your agent and your infrastructure to govern how the agent executes, what the agent can see and do, and where inference goes. It enables claws to run in isolated sandboxes, with fine-grained control over privacy and security.
A flexible, component based, data center scale inference serving framework designed to meet the demands of complex use cases including those of Generative AI.
NVIDIA NeMo is a modular software suite of APIs and libraries that help developers manage the AI agent lifecycle—building, deploying, and optimizing AI agents at scale.
Part of NVIDIA AI Enterprise, NVIDIA NIM microservices are a set of easy-to-use microservices for accelerating the deployment of foundation models on any cloud or data center and helps keep your data secure. NIM microservices have production-grade runtimes including on-going security updates.
NVIDIA AI Enterprise is an end-to-end platform for developing, deploying, and managing AI applications. It includes AI frameworks, NIM microservices, and SDKs in the Application Layer and GPU drivers, Kubernetes operators, and cluster management tools in the Infrastructure Layer — each with independent release branches, lifecycle policies, and enterprise support.
NVIDIA Omniverse is a cloud-native, multi-GPU, real-time simulation and collaboration platform for 3D production pipelines based on Pixar's Universal Scene Description (USD) and NVIDIA RTX.
NVIDIA virtual GPU (vGPU) software is a graphics virtualization platform that extends the power of NVIDIA GPU technology to virtual desktops and apps, offering improved security, productivity, and cost-efficiency.
Built from the ground up for enterprise AI, the NVIDIA DGX platform incorporates the best of NVIDIA software, infrastructure, and expertise in a modern, unified AI development and training solution.
The NVIDIA JetPack SDK, which is the most comprehensive solution for building AI applications, along with L4T and L4T Multimedia, provides the Linux kernel, bootloader, NVIDIA drivers, flashing utilities, sample filesystem, and more for the Jetson platform.
NVIDIA TensorRT is an SDK for high-performance deep learning inference. It is designed to work in a complementary fashion with training frameworks such as TensorFlow, PyTorch, and MXNet. It focuses specifically on running an already-trained network quickly and efficiently on NVIDIA hardware.
CUTLASS is a collection of CUDA C++ template abstractions for implementing high-performance matrix-matrix multiplication (GEMM) and related computations at all levels and scales within CUDA.
The integration of NVIDIA RAPIDS into the Cloudera Data Platform (CDP) provides transparent GPU acceleration of data analytics workloads using Apache Spark. This documentation describes the integration and suggested reference architectures for deployment.
nvCOMP is a high performance GPU enabled data compression library. Includes both open-source and non-OS components. The nvCOMP library provides fast lossless data compression and decompression using a GPU. It features generic compression interfaces to enable developers to use high-performance GPU compressors in their applications.
The NVIDIA Agent Intelligence toolkit is an open-source library for efficiently connecting, profiling and optimizing teams of AI agents. With it, developers can easily accelerate and evaluate enterprise-ready agentic AI systems.
NVIDIA AI Aerial™ is a portfolio of accelerated computing platforms, software, and services for designing, simulating, and operating wireless networks. Aerial contains hardened RAN software libraries for telcos, cloud service providers (CSPs), and enterprises building commercial 5G networks. Academic and industry researchers can access Aerial on cloud or on-premises setups for advanced wireless and AI/machine learning (ML) research for 6G.
NVIDIA AI Enterprise is an end-to-end platform for developing, deploying, and managing AI applications. It includes AI frameworks, NIM microservices, and SDKs in the Application Layer and GPU drivers, Kubernetes operators, and cluster management tools in the Infrastructure Layer — each with independent release branches, lifecycle policies, and enterprise support.
NVIDIA AI for Media SDKs provide programmatic control of AI features that enhance audio, video, and augmented reality (AR) effects for video conferencing and telepresence.
An AI grid is a set of geographically distributed and interconnected AI infrastructure that works as a unified intelligence platform. This platform enables secure placement of workloads where they run best, balancing performance, cost, and latency.
The essential companion for PC gamers and creators. Keep your PC up to date with the latest NVIDIA drivers and technology. Optimize games and applications with a new unified GPU control center, capture your favorite moments with powerful recording tools through the in-game overlay, and discover the latest NVIDIA tools and software.
The NVIDIA Attestation Suite enhances Confidential Computing by providing robust mechanisms to ensure the integrity and security of devices and platforms. The suite includes NVIDIA Remote Attestation Service (NRAS), the Reference Integrity Manifest (RIM) Service, and the NDIS OCSP Responder.
NVIDIA Base Command Manager streamlines cluster provisioning, workload management, and infrastructure monitoring. It provides all the tools you need to deploy and manage an AI data center.
NVIDIA Base Command Platform is a world-class infrastructure solution for businesses and their data scientists who need a premium AI development experience.
NVIDIA BaseOS delivers robust, production-ready operating environments optimized for AI, machine learning, and data analytics workloads. It includes tailored system configurations, optimized drivers, comprehensive diagnostics, and advanced monitoring tools. BaseOS is supported on many Linux distributions.
An AI/ML development platform that allows you to run, build, train, and deploy ML models on the cloud. Brev allows you to start small on a CPU instance and effortlessly scale to larger GPU clusters for any workload.
NVIDIA Bright Cluster Manager offers fast deployment and end-to-end management for heterogeneous HPC and AI server clusters at the edge, in the data center and in multi/hybrid-cloud environments. It automates provisioning and administration for clusters ranging in size from a single node to hundreds of thousands, supports CPU-based and NVIDIA GPU-accelerated systems, and orchestration with Kubernetes.
NVIDIA’s program that enables enterprises to confidently deploy hardware solutions that optimally run accelerated workloads—from desktop to data center to edge.
NVIDIA® Clara™ is an open, scalable computing platform that enables developers to build and deploy medical imaging applications into hybrid (embedded, on-premises, or cloud) computing environments to create intelligent instruments and automate healthcare workflows.
Compute Sanitizer is a functional correctness checking suite included in the CUDA toolkit. This suite contains multiple tools that can perform different type of checks. The memcheck tool is capable of precisely detecting and attributing out of bounds and misaligned memory access errors in CUDA applications. The tool can also report hardware exceptions encountered by the GPU. The racecheck tool can report shared memory data access hazards that can cause data races. The initcheck tool can report cases where the GPU performs uninitialized accesses to global memory. The synccheck tool can report cases where the application is attempting invalid usages of synchronization primitives. This document describes the usage of these tools.
Cosmos Curator and post-training services are fully managed AI services for video curation and model customization, enabling enterprises to efficiently process, fine-tune, and deploy video and world foundation models.
The NVIDIA® CUDA® Toolkit provides a comprehensive development environment for C and C++ developers building GPU-accelerated applications. With the CUDA Toolkit, you can develop, optimize, and deploy your applications on GPU-accelerated embedded systems, desktop workstations, enterprise data centers, cloud-based platforms and HPC supercomputers.
The NVIDIA CUDA® Deep Neural Network (cuDNN) is a GPU-accelerated library of primitives for deep neural networks. cuDNN provides highly tuned implementations for standard routines such as forward and backward convolution, attention, matmul, pooling, and normalization.
The CUDA Profiling Tools Interface (CUPTI) enables the creation of profiling tools that target CUDA applications. CUPTI Python provides Python APIs for creation of profiling tools that target CUDA Python applications.
cuPyNumeric is a Legate library that aims to provide a distributed and accelerated drop-in replacement for the NumPy API on top of the Legion runtime. Using cuPyNumeric you do things like run the final example of the Python CFD course completely unmodified on 2048 A100 GPUs in a DGX SuperPOD and achieve good weak scaling.
CUTLASS is a collection of CUDA C++ template abstractions for implementing high-performance matrix-matrix multiplication (GEMM) and related computations at all levels and scales within CUDA.
NVIDIA cuVS is an open-source library for GPU-accelerated vector search and data clustering that enables higher throughput search, lower latency, and faster index build times.
The NVIDIA Data Loading Library (DALI) is a collection of highly optimized building blocks, and an execution engine, for accelerating the pre-processing of input data for deep learning applications. DALI provides both the performance and the flexibility for accelerating different data pipelines as a single library. This single library can then be easily integrated into different deep learning training and inference applications.
NVIDIA CPUs are purpose-built for the modern data center and delivers high bandwidth, deterministic latency, and exceptional energy efficiency. The NVIDIA® Grace™ CPU powers tightly coupled, memory-coherent systems with NVIDIA Blackwell and NVIDIA Hopper™ GPUs and efficient CPU-only platforms. Featuring 72 Arm® v9 cores, LPDDR5X memory, and the Scalable Coherency Fabric, the Grace CPU delivers up to twice the energy efficiency of traditional CPUs. Built on the Arm SystemReady SR, the CPU supports Arm-compatible operating systems, peripherals, and NVIDIA’s full AI and HPC software ecosystem.
NVIDIA Data Center GPU drivers are used in Data Center GPU enterprise deployments for AI, HPC, and accelerated computing workloads. Documentation includes release notes, supported platforms, and cluster setup and deployment.
Deep Graph Library (DGL) is a framework-neutral, easy-to-use, and scalable Python library used for implementing and training Graph Neural Networks (GNN). Being framework-neutral, DGL is easily integrated into an existing PyTorch, TensorFlow, or an Apache MXNet workflow.
GPUs accelerate machine learning operations by performing calculations in parallel. Many operations, especially those representable as matrix multipliers will see good acceleration right out of the box. Even better performance can be achieved by tweaking operation parameters to efficiently use GPU resources. The performance documents present the tips that we think are most widely useful.
Built from the ground up for enterprise AI, the NVIDIA DGX platform incorporates the best of NVIDIA software, infrastructure, and expertise in a modern, unified AI development and training solution. Every aspect of the DGX platform is infused with NVIDIA AI expertise, featuring world-class software, record-breaking NVIDIA-accelerated infrastructure in the cloud or on-premises, and direct access to NVIDIA DGXPerts to speed the ROI of AI for every enterprise.
Deployment and management guides for NVIDIA DGX SuperPOD, an AI data center infrastructure platform that enables IT to deliver performance—without compromise—for every user and workload. DGX SuperPOD offers leadership-class accelerated infrastructure and agile, scalable performance for the most challenging AI and high-performance computing (HPC) workloads, with industry-proven results.
The NVIDIA Deep Learning GPU Training System (DIGITS) can be used to rapidly train highly accurate deep neural networks (DNNs) for image classification, segmentation, and object-detection tasks. DIGITS simplifies common deep learning tasks such as managing data, designing and training neural networks on multi-GPU systems, monitoring performance in real time with advanced visualizations, and selecting the best-performing model from the results browser for deployment.