video-sdk-samples
Sample applications that demonstrate usage of NVIDIA Video SDK APIs for GPU-accelerated video encoding/decoding.
There are no reviews yet. Be the first to send feedback to the community and the maintainers!
nvidia-docker
Build and run Docker containers leveraging NVIDIA GPUsopen-gpu-kernel-modules
NVIDIA Linux open GPU kernel module sourceDeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.FastPhotoStyle
Style transfer, deep learning, feature transformNeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.vid2vid
Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.Megatron-LM
Ongoing research training transformer models at scaleapex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorchpix2pixHD
Synthesizing and manipulating 2048x1024 images with conditional GANsTensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.FasterTransformer
Transformer related optimization, including BERT, GPTcuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkitthrust
[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/ccclDALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inferencecutlass
CUDA Templates for Linear Algebra SubroutinesDIGITS
Deep Learning GPU Training SystemNeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.flownet2-pytorch
Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networksnccl
Optimized primitives for collective multi-GPU communicationlibcudacxx
[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/ccclk8s-device-plugin
NVIDIA device plugin for Kuberneteswaveglow
A Flow-based Generative Network for Speech Synthesistrt-llm-rag-windows
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLMMinkowskiEngine
Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensorssemantic-segmentation
Nvidia Semantic Segmentation monorepoDeepRecommender
Deep learning for recommender systemsStable-Diffusion-WebUI-TensorRT
TensorRT Extension for Stable Diffusion Web UIcub
[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/ccclwarp
A Python framework for high performance GPU simulation and graphicsOpenSeq2Seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLPGenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.VideoProcessingFramework
Set of Python bindings to C++ libraries which provides full HW acceleration for video decoding, encoding and GPU-accelerated color space and pixel format conversionsnvidia-container-toolkit
Build and run containers leveraging NVIDIA GPUstrt-samples-for-hackathon-cn
Simple samples for TensorRT programmingQ2RTX
NVIDIA’s implementation of RTX ray-tracing in Quake IIopen-gpu-doc
Documentation of NVIDIA chip/hardware interfacesstdexec
`std::execution`, the proposed C++ framework for asynchronous and parallel programming.deepops
Tools for building GPU clusterspartialconv
A New Padding Scheme: Partial Convolution based PaddingCUDALibrarySamples
CUDA Library Samplesgpu-operator
NVIDIA GPU Operator creates/configures/manages GPUs atop KubernetesMatX
An efficient C++17 GPU numerical computing library with Python-like syntaxaistore
AIStore: scalable storage for AI applicationssentiment-discovery
Unsupervised Language Modeling at scale for robust sentiment classificationnvidia-container-runtime
NVIDIA container runtimegpu-monitoring-tools
Tools for monitoring NVIDIA GPUs on Linuxretinanet-examples
Fast and accurate object detection with end-to-end GPU optimizationflowtron
Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfermellotron
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training datajetson-gpio
A Python library that enables the use of Jetson's GPIOsgdrcopy
A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technologynv-wavenet
Reference implementation of real-time autoregressive wavenet inferencelibnvidia-container
NVIDIA container runtime librarytensorflow
An Open Source Machine Learning Framework for Everyonespark-rapids
Spark RAPIDS plugin - accelerate Apache Spark with GPUscuda-python
CUDA Python Low-level Bindingscccl
CUDA C++ Core LibrariesMAXINE-AR-SDK
NVIDIA AR SDK - API headers and sample applicationsnvvl
A library that uses hardware acceleration to load sequences of video frames to facilitate machine learning traininggvdb-voxels
Sparse volume compute and rendering on NVIDIA GPUsnccl-tests
NCCL Testsmodulus
Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methodsBigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)runx
Deep Learning Experiment ManagementDLSS
NVIDIA DLSS is a new and improved deep learning neural network that boosts frame rates and generates beautiful, sharp images for your gamesdcgm-exporter
NVIDIA GPU metrics exporter for Prometheus leveraging DCGMDataset_Synthesizer
NVIDIA Deep learning Dataset Synthesizer (NDDS)NVFlare
NVIDIA Federated Learning Application Runtime Environmentnvcomp
Repository for nvCOMP docs and examples. nvCOMP is a library for fast lossless compression/decompression on the GPU that can be downloaded from https://developer.nvidia.com/nvcomp.jitify
A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).libglvnd
The GL Vendor-Neutral Dispatch libraryenroot
A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.multi-gpu-programming-models
Examples demonstrating available options to program multiple GPUs in a single node or a clusterMDL-SDK
NVIDIA Material Definition Language SDKPyProf
A GPU performance profiling tool for PyTorch modelsAMGX
Distributed multigrid linear solver library on GPUgpu-rest-engine
A REST API for Caffe using Docker and Gonvbench
CUDA Kernel Benchmarking Libraryframework-reproducibility
Providing reproducibility in deep learning frameworkscuCollections
hpc-container-maker
HPC Container MakerNeMo-Framework-Launcher
NeMo Megatron launcher and toolsNvPipe
NVIDIA-accelerated zero latency video compression library for interactive remoting applicationscuda-quantum
C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflowsdata-science-stack
NVIDIA Data Science stack toolscuQuantum
Home for cuQuantum Python & NVIDIA cuQuantum SDK C++ samplesai-assisted-annotation-client
Client side integration example source code and libraries for AI-Assisted Annotation SDKnvidia-settings
NVIDIA driver control panelDCGM
NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUscnmem
A simple memory manager for CUDA designed to help Deep Learning frameworks manage memoryradtts
Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained Control over of Low Dimensional (F0 and Energy) Speech Attributes.fsi-samples
A collection of open-source GPU accelerated Python tools and examples for quantitative analyst tasks and leverages RAPIDS AI project, Numba, cuDF, and Dask.tensorrt-laboratory
Explore the Capabilities of the TensorRT PlatformCleanUNet
Official PyTorch Implementation of CleanUNet (ICASSP 2022)gpu-feature-discovery
GPU plugin to the node feature discovery for Kubernetestorch-harmonics
Differentiable spherical harmonic transforms and spherical convolutions in PyTorchegl-wayland
The EGLStream-based Wayland external platformLove Open Source and this site? Check out how you can help us