YoloDotNet.ExecutionProvider.Cuda
1.0.0
dotnet add package YoloDotNet.ExecutionProvider.Cuda --version 1.0.0
NuGet\Install-Package YoloDotNet.ExecutionProvider.Cuda -Version 1.0.0
<PackageReference Include="YoloDotNet.ExecutionProvider.Cuda" Version="1.0.0" />
<PackageVersion Include="YoloDotNet.ExecutionProvider.Cuda" Version="1.0.0" />
<PackageReference Include="YoloDotNet.ExecutionProvider.Cuda" />
paket add YoloDotNet.ExecutionProvider.Cuda --version 1.0.0
#r "nuget: YoloDotNet.ExecutionProvider.Cuda, 1.0.0"
#:package YoloDotNet.ExecutionProvider.Cuda@1.0.0
#addin nuget:?package=YoloDotNet.ExecutionProvider.Cuda&version=1.0.0
#tool nuget:?package=YoloDotNet.ExecutionProvider.Cuda&version=1.0.0
Information
YoloDotNet uses modular execution providers to run inference on different hardware backends. Each provider targets a specific platform or accelerator and may require additional system-level dependencies such as runtimes, drivers, or SDKs.
Installing the NuGet package alone is not always sufficient — proper setup depends on the selected provider and the target system.
This document describes the installation, requirements, and usage of the CUDA & TensorRT execution provider.
Core Library Requirement
All execution providers require the core YoloDotNet package, which contains the shared inference pipeline, models, and APIs.
NuGet Package
dotnet add package YoloDotNet
Execution Provider - CUDA and TensorRT
The CUDA & TensorRT execution provider enables GPU-accelerated inference on NVIDIA GPUs using ONNX Runtime’s CUDA backend.
Optionally, NVIDIA TensorRT can be enabled to further optimize models for maximum throughput and ultra-low latency.
⚠️ Note
This execution provider is supported on Windows and Linux only.
CUDA and TensorRT are not available on macOS.
Requirements
- CUDA Toolkit 12.x
- cuDNN 9.x
- Windows or Linux (x64)
Important
This execution provider depends on native CUDA and cuDNN libraries.
Installing the NuGet package alone is not sufficient — system-level dependencies must be installed correctly.
Installation (Windows)
CUDA
Download and install the following from NVIDIA’s official websites:
After installing cuDNN, locate the folder containing the cuDNN DLL files. This is typically:
C:\Program Files\NVIDIA\CUDNN\v9.x\bin\v12.x(Replace v9.x and v12.x with the versions installed on your system)
Add cuDNN to the System PATH
Copy the full folder path to your cuDNN
bin\v12.xfolderSearch
Edit the system environment variablesin Windows search and select it.Click
Environment Variables.Under
System variables, selectPathand click Edit.Click
Newand paste the copied cuDNN path.Click
OKto save and close all dialogs.Reboot your system.
TensorRT (optional)
TensorRT is NVIDIA’s high-performance inference engine and can significantly improve performance by optimizing models for your specific GPU.
Download the
TensorRT 10.13.3release forCUDA 12.x.Extract the archive to a folder on your system.
Locate the
libfolder inside the extracted TensorRT folder.Copy the full path to this lib folder.
Add the path to your system's
PATHenvironment variable (same process as described in the CUDA installation steps).Reboot your system.
Installation (Linux)
CUDA
Install CUDA Toolkit 12.x for your Linux distribution
Install cuDNN 9.x for your Linux distribution
Reboot your system
TensorRT (optional)
Download the
TensorRT 10.13.3release forCUDA 12.x.Follow NVIDIA’s TensorRT installation instructions for Linux.
NuGet Package
dotnet add package YoloDotNet.ExecutionProvider.Cuda
Usage Example:
using YoloDotNet;
using YoloDotNet.ExecutionProvider.Cuda;
using var yolo = new Yolo(new YoloOptions
{
ExecutionProvider = new CudaExecutionProvider(
model: "path/to/model.onnx",
// GPU device index (default: 0)
gpuId: 0,
// Optional TensorRT configuration for maximum performance
trtConfig: new TensorRt
{
Precision = TrtPrecision.FP16,
EngineCachePath = "path/to/cache/folder",
EngineCachePrefix = "MyCachePrefix"
}
),
// ...other options
});
// See the TensorRT demo project for advanced configuration options.
Notes & Recommendations
- Use CUDA only if you want simple GPU acceleration with minimal setup.
- Enable TensorRT if you need maximum performance and are comfortable managing engine caches.
- TensorRT engine generation happens once per model and configuration and is cached for subsequent runs.
- CUDA and TensorRT are not supported on macOS.
| Product | Versions Compatible and additional computed target framework versions. |
|---|---|
| .NET | net8.0 is compatible. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. net9.0 was computed. net9.0-android was computed. net9.0-browser was computed. net9.0-ios was computed. net9.0-maccatalyst was computed. net9.0-macos was computed. net9.0-tvos was computed. net9.0-windows was computed. net10.0 was computed. net10.0-android was computed. net10.0-browser was computed. net10.0-ios was computed. net10.0-maccatalyst was computed. net10.0-macos was computed. net10.0-tvos was computed. net10.0-windows was computed. |
-
net8.0
- Microsoft.ML.OnnxRuntime.Gpu (>= 1.23.2)
- YoloDotNet (>= 4.0.0)
NuGet packages
This package is not used by any NuGet packages.
GitHub repositories
This package is not used by any popular GitHub repositories.
| Version | Downloads | Last Updated |
|---|---|---|
| 1.0.0 | 248 | 12/14/2025 |
This is the first standalone release of the CUDA execution provider for YoloDotNet following the introduction of the new modular architecture.
The CUDA execution provider enables GPU-accelerated inference using ONNX Runtime’s CUDA backend and supports optional NVIDIA TensorRT integration for maximum performance, lower latency, and optimized execution on supported NVIDIA GPUs.
This provider targets high-performance and real-time inference workloads on Windows and Linux systems and requires the CUDA Toolkit and cuDNN to be installed on the host system. It is fully compatible with the YoloDotNet core library and follows the new execution-provider-agnostic design.