WACS.WASI.NN.LlamaSharp 0.2.2

There is a newer version of this package available.
See the version list below for details.

dotnet add package WACS.WASI.NN.LlamaSharp --version 0.2.2

NuGet\Install-Package WACS.WASI.NN.LlamaSharp -Version 0.2.2

This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.

<PackageReference Include="WACS.WASI.NN.LlamaSharp" Version="0.2.2" />

For projects that support PackageReference, copy this XML node into the project file to reference the package.

<PackageVersion Include="WACS.WASI.NN.LlamaSharp" Version="0.2.2" />
                    

                            Directory.Packages.props

<PackageReference Include="WACS.WASI.NN.LlamaSharp" />
                    

                            Project file

For projects that support Central Package Management (CPM), copy this XML node into the solution Directory.Packages.props file to version the package.

paket add WACS.WASI.NN.LlamaSharp --version 0.2.2

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

#r "nuget: WACS.WASI.NN.LlamaSharp, 0.2.2"

#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.

#:package WACS.WASI.NN.LlamaSharp@0.2.2

#:package directive can be used in C# file-based apps starting in .NET 10 preview 4. Copy this into a .cs file before any lines of code to reference the package.

#addin nuget:?package=WACS.WASI.NN.LlamaSharp&version=0.2.2
                    

                            Install as a Cake Addin

#tool nuget:?package=WACS.WASI.NN.LlamaSharp&version=0.2.2
                    

                            Install as a Cake Tool

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

WACS.WASI.NN.LlamaSharp

GGUF / llama.cpp backend for WACS.WASI.NN. Implements IBackend for graph-encoding.ggml against LLamaSharp; GPU runtimes pluggable via LlamaSharp's backend NuGets (CPU default; CUDA / Vulkan / MacMetal swaps with no source change).

Follows the WasmEdge wasi-nn GGUF convention:

Models resolved by name through wasi:nn/graph.load-by-name(name) — multi-GB GGUFs aren't passed through the canonical-ABI byte-loader path
Input / output are U8 tensors carrying UTF-8 prompt / response bytes. No tokenization or chat-template work in the guest — LlamaSharp does it all host-side

Install

dotnet add package WACS.WASI.NN.LlamaSharp

The package's bin ships its NuGet transitives + RID-specific native libs (via <EnableDynamicLoading>true</EnableDynamicLoading>), so Assembly.LoadFrom resolves everything from the LoadFromContext probe — no manual deps staging.

CLI quick start

LlamaSharp isn't bundled with WACS.Cli by default (unlike OnnxRuntime — the natives are too chunky to ride along). Pass the explicit path to the backend's bin:

mkdir -p ./models
huggingface-cli download Qwen/Qwen2.5-0.5B-Instruct-GGUF \
    qwen2.5-0.5b-instruct-q4_k_m.gguf --local-dir ./models

export WACS_WASINN_GGUF_DIR="$(pwd)/models"

# After dotnet build of this project's repo:
LLAMA=$(realpath Wacs.WASI/Wacs.WASI.NN/Wacs.WASI.NN.LlamaSharp/bin/Release/net8.0/Wacs.WASI.NN.LlamaSharp.dll)

wacs run my-llm.component.wasm --wasip2 --bind "$LLAMA"

--bind auto-pulls the WASI.NN typed surface + DI sibling onto host-packages when the identity starts with Wacs.WASI.NN.. The Preview 2 DI scope's auto-wire registers the backend in BOTH Backends[GGML] AND LoadByNameBackend; guests calling wasi:nn/graph.load-by-name(...) direct-link cleanly.

The full chain (with under-the-hood walkthrough) lives at docs/COMPONENT_CHAINING.md#gguf-inference-example-llamasharp-backend.

Embedder

Interpreter / one-line:

using Wacs.Core.Runtime;
using Wacs.WASI.NN;
using Wacs.WASI.NN.LlamaSharp;
using Wacs.WASI.NN.Types;

var registry = new Dictionary<string, string>
{
    ["qwen2.5-0.5b"] = "/path/to/qwen2.5-0.5b-instruct-q4_k_m.gguf",
};
var backend = LlamaSharpBackend.FromPaths(registry);

var runtime = new WasmRuntime();
runtime.UseWasiNN(b =>
{
    b.AddBackend(GraphEncoding.GGML, backend);
    b.Configuration.LoadByNameBackend = backend;   // route load-by-name
});

For the transpiler-direct-link / DI flow, just --bind <path> — the Preview 2 DI scope auto-discovers and wires.

GPU backend swap

Replace LLamaSharp.Backend.Cpu in the project's csproj with one of:

LLamaSharp.Backend.Cuda12 — NVIDIA
LLamaSharp.Backend.Vulkan — cross-vendor GPU
LLamaSharp.Backend.MacMetal — Apple Silicon

Then rebuild. The EnableDynamicLoading bin layout copies whichever backend NuGet's natives are pulled — no source change.

What it provides

LlamaSharpBackend : IBackend — implements LoadGraphByName(name, target) for registry-resolved GGUF files. The byte-loader path (LoadGraph(builders, target)) traps with UnsupportedOperation by design — see class docs for why
LlamaSharpBackend.FromPaths(IDictionary<string,string>) — convenience static factory for the simple "drop GGUFs in a directory" embedder flow
WasiNNLlamaSharpBindable : IBindable — parameterless adapter for --bind. Reads WACS_WASINN_GGUF_DIR, scans *.gguf, registers each under its filename-sans-extension

Backend choice

Use case	Package
GGUF / llama.cpp generative LLMs (`load-by-name` flow)	WACS.WASI.NN.LlamaSharp (this)
Standard ONNX inference	`WACS.WASI.NN.OnnxRuntime`
ONNX with ML.NET pipeline integration	`WACS.WASI.NN.MLNet`

Documentation

docs/COMPONENT_CHAINING.md — runtime requirements + GGUF example + chaining model
Wacs.WASI/Wacs.WASI.NN/README.md — backend matrix

License

Apache-2.0

Product	Compatible and additional computed target framework versions.
.NET	net8.0 is compatible. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. net9.0 was computed. net9.0-android was computed. net9.0-browser was computed. net9.0-ios was computed. net9.0-maccatalyst was computed. net9.0-macos was computed. net9.0-tvos was computed. net9.0-windows was computed. net10.0 was computed. net10.0-android was computed. net10.0-browser was computed. net10.0-ios was computed. net10.0-maccatalyst was computed. net10.0-macos was computed. net10.0-tvos was computed. net10.0-windows was computed.

Product

.NET

Compatible target framework(s)

Included target framework(s) (in package)

Learn more about Target Frameworks and .NET Standard.

net8.0
- LLamaSharp (>= 0.20.0)
- LLamaSharp.Backend.Cpu (>= 0.20.0)
- WACS.WASI.NN (>= 0.3.1)

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version	Downloads	Last Updated
0.2.4	94	5/12/2026
0.2.3	95	5/11/2026
0.2.2	93	5/10/2026
0.2.0	96	5/10/2026