Cerebras.Cloud.Sdk.Unofficial 2025.7.8-rc.6

This is a prerelease version of Cerebras.Cloud.Sdk.Unofficial.

There is a newer prerelease version of this package available.
See the version list below for details.

dotnet add package Cerebras.Cloud.Sdk.Unofficial --version 2025.7.8-rc.6

NuGet\Install-Package Cerebras.Cloud.Sdk.Unofficial -Version 2025.7.8-rc.6

This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.

<PackageReference Include="Cerebras.Cloud.Sdk.Unofficial" Version="2025.7.8-rc.6" />

For projects that support PackageReference, copy this XML node into the project file to reference the package.

<PackageVersion Include="Cerebras.Cloud.Sdk.Unofficial" Version="2025.7.8-rc.6" />
                    

                            Directory.Packages.props

<PackageReference Include="Cerebras.Cloud.Sdk.Unofficial" />
                    

                            Project file

For projects that support Central Package Management (CPM), copy this XML node into the solution Directory.Packages.props file to version the package.

paket add Cerebras.Cloud.Sdk.Unofficial --version 2025.7.8-rc.6

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

#r "nuget: Cerebras.Cloud.Sdk.Unofficial, 2025.7.8-rc.6"

#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.

#:package Cerebras.Cloud.Sdk.Unofficial@2025.7.8-rc.6

#:package directive can be used in C# file-based apps starting in .NET 10 preview 4. Copy this into a .cs file before any lines of code to reference the package.

#addin nuget:?package=Cerebras.Cloud.Sdk.Unofficial&version=2025.7.8-rc.6&prerelease
                    

                            Install as a Cake Addin

#tool nuget:?package=Cerebras.Cloud.Sdk.Unofficial&version=2025.7.8-rc.6&prerelease
                    

                            Install as a Cake Tool

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

Cerebras.Cloud.Sdk (Unofficial)

An unofficial .NET SDK for the Cerebras Cloud API, providing easy integration with Cerebras AI models in your .NET applications.

Features

Chat Completions: Full support for chat-based completions with system, user, and assistant messages
Text Completions: Traditional text completion API for prompt-based generation
Streaming Support: Real-time streaming responses for both chat and text completions
Models API: List and retrieve information about available models
Advanced Features: Tool/function calling, JSON mode, logit bias, and more
Robust Error Handling: Built-in retry logic with exponential backoff and typed exceptions
Dependency Injection: First-class support for ASP.NET Core dependency injection
Comprehensive Logging: Detailed logging for debugging and monitoring
Type-safe Models: Strongly typed request/response models with IntelliSense support

Installation

dotnet add package Cerebras.Cloud.Sdk.Unofficial

Quick Start

Basic Usage - Chat Completions

using Cerebras.Cloud.Sdk;
using Cerebras.Cloud.Sdk.Chat;

// Option 1: Using the enhanced client (recommended)
var services = new ServiceCollection();
services.AddCerebrasClientV2(options =>
{
    options.ApiKey = "your-api-key-here"; // Or set via CEREBRAS_API_KEY environment variable
});

var serviceProvider = services.BuildServiceProvider();
var client = serviceProvider.GetRequiredService<ICerebrasClientV2>();

// Create a chat completion
var request = new ChatCompletionRequest
{
    Model = "llama-3.3-70b",
    Messages = new List<ChatMessage>
    {
        new() { Role = "system", Content = "You are a helpful assistant." },
        new() { Role = "user", Content = "What is the capital of France?" }
    },
    MaxTokens = 100,
    Temperature = 0.7
};

var response = await client.Chat.CreateAsync(request);
Console.WriteLine(response.Choices[0].Message.Content);

Text Completions

using Cerebras.Cloud.Sdk.Completions;

// Generate text completion
var request = new TextCompletionRequest
{
    Model = "llama-3.3-70b",
    Prompt = "The capital of France is",
    MaxTokens = 100,
    Temperature = 0.7
};

var response = await client.Completions.CreateAsync(request);
Console.WriteLine(response.Choices[0].Text);

Streaming Responses

// Chat streaming
var chatRequest = new ChatCompletionRequest
{
    Model = "llama-3.3-70b",
    Messages = new List<ChatMessage>
    {
        new() { Role = "user", Content = "Tell me a story" }
    },
    MaxTokens = 500,
    Stream = true
};

await foreach (var chunk in client.Chat.CreateStreamAsync(chatRequest))
{
    if (chunk.Choices[0].Delta?.Content != null)
    {
        Console.Write(chunk.Choices[0].Delta.Content);
    }
}

// Text completion streaming
var textRequest = new TextCompletionRequest
{
    Model = "llama-3.3-70b",
    Prompt = "Once upon a time",
    MaxTokens = 500,
    Stream = true
};

await foreach (var chunk in client.Completions.CreateStreamAsync(textRequest))
{
    Console.Write(chunk.Choices[0].Text);
}

Dependency Injection

// In your Program.cs or Startup.cs
builder.Services.AddCerebrasClientV2(builder.Configuration);

// Or with custom configuration
builder.Services.AddCerebrasClientV2(options =>
{
    options.ApiKey = "your-api-key";
    options.DefaultModel = "llama-3.3-70b";
    options.TimeoutSeconds = 60;
});

// Then inject ICerebrasClientV2 wherever needed
public class MyService
{
    private readonly ICerebrasClientV2 _client;
    
    public MyService(ICerebrasClientV2 client)
    {
        _client = client;
    }
    
    public async Task<string> GenerateResponseAsync(string prompt)
    {
        var response = await _client.Chat.CreateAsync(new ChatCompletionRequest
        {
            Model = "llama-3.3-70b",
            Messages = new List<ChatMessage>
            {
                new() { Role = "user", Content = prompt }
            }
        });
        
        return response.Choices[0].Message.Content;
    }
}

Legacy Client Support

The original ICerebrasClient interface is still supported for backward compatibility:

// Using the legacy client
services.AddCerebrasClient(configuration);

// Inject and use
public class LegacyService
{
    private readonly ICerebrasClient _client;
    
    public LegacyService(ICerebrasClient client)
    {
        _client = client;
    }
}

Configuration

The SDK can be configured through code or via appsettings.json:

{
  "CerebrasClient": {
    "ApiKey": "your-api-key-here",
    "BaseUrl": "https://api.cerebras.ai/v1/",
    "DefaultModel": "llama-3.3-70b",
    "DefaultTemperature": 0.7,
    "DefaultMaxTokens": 1024,
    "TimeoutSeconds": 30,
    "MaxRetries": 3,
    "EnableLogging": true,
    "RateLimit": {
      "Enabled": true,
      "RequestsPerMinute": 60,
      "TokensPerMinute": 100000
    }
  }
}

Available Models

List all available models:

var models = await client.Models.ListAsync();
foreach (var model in models)
{
    Console.WriteLine($"{model.Id}: {model.Name} (Context: {model.ContextWindow} tokens)");
}

Get specific model details:

var model = await client.Models.RetrieveAsync("llama-3.3-70b");
Console.WriteLine($"Model: {model.Name}");
Console.WriteLine($"Context window: {model.ContextWindow}");
Console.WriteLine($"Description: {model.Description}");

Advanced Features

Tool/Function Calling

var request = new ChatCompletionRequest
{
    Model = "llama-3.3-70b",
    Messages = new List<ChatMessage>
    {
        new() { Role = "user", Content = "What's the weather in Paris?" }
    },
    Tools = new List<ChatTool>
    {
        new()
        {
            Type = "function",
            Function = new ChatFunction
            {
                Name = "get_weather",
                Description = "Get the current weather for a location",
                Parameters = new
                {
                    type = "object",
                    properties = new
                    {
                        location = new { type = "string", description = "City name" }
                    },
                    required = new[] { "location" }
                }
            }
        }
    }
};

var response = await client.Chat.CreateAsync(request);

JSON Mode

var request = new ChatCompletionRequest
{
    Model = "llama-3.3-70b",
    Messages = new List<ChatMessage>
    {
        new() { Role = "user", Content = "List 3 programming languages as JSON" }
    },
    ResponseFormat = new ResponseFormat { Type = "json_object" }
};

Error Handling

The SDK provides comprehensive error handling with typed exceptions:

try
{
    var response = await client.Chat.CreateAsync(request);
}
catch (CerebrasApiException ex)
{
    Console.WriteLine($"API Error: {ex.Message}");
    Console.WriteLine($"Status Code: {ex.StatusCode}");
    Console.WriteLine($"Error Type: {ex.ErrorType}");
    Console.WriteLine($"Error Code: {ex.ErrorCode}");
    
    // Handle specific error types
    switch (ex.StatusCode)
    {
        case HttpStatusCode.Unauthorized:
            Console.WriteLine("Invalid API key");
            break;
        case HttpStatusCode.TooManyRequests:
            Console.WriteLine("Rate limit exceeded");
            break;
        case HttpStatusCode.BadRequest:
            Console.WriteLine($"Invalid request: {ex.Message}");
            break;
    }
}
catch (TaskCanceledException)
{
    Console.WriteLine("Request timed out");
}
catch (HttpRequestException ex)
{
    Console.WriteLine($"Network error: {ex.Message}");
}

Building and Testing

Building the SDK

# Build the SDK
./scripts/build.sh

# Create NuGet package
./scripts/pack.sh

Running Tests

# Run all tests (unit + integration if API key is set)
./scripts/run-tests.sh

# Run only unit tests
dotnet test --filter "FullyQualifiedName~Unit"

# Run integration tests (requires CEREBRAS_API_KEY environment variable)
export CEREBRAS_API_KEY=your-api-key
dotnet test --filter "FullyQualifiedName~Integration"

# Run tests with coverage
dotnet test --collect:"XPlat Code Coverage" --results-directory ./TestResults
# Generate HTML report
reportgenerator -reports:./TestResults/*/coverage.cobertura.xml -targetdir:./coverage-report -reporttypes:Html

Running Examples

# Set your API key
export CEREBRAS_API_KEY=your-api-key

# Run the quick start example
./scripts/run-example.sh

# Or run specific examples directly
dotnet run --project examples/Cerebras.Cloud.Sdk.Examples/Cerebras.Cloud.Sdk.Examples.csproj

API Coverage

This SDK implements the complete Cerebras Cloud API surface:

Chat Completions (/v1/chat/completions)
- Streaming and non-streaming
- Tool/function calling
- JSON mode
- System messages
- All parameters (temperature, top_p, frequency_penalty, etc.)
Text Completions (/v1/completions)
- Streaming and non-streaming
- Multiple prompts
- Echo mode
- Suffix support
- Logit bias
Models (/v1/models)
- List available models
- Retrieve model details
Error Handling
- Typed exceptions
- Automatic retry with exponential backoff
- Request timeout configuration

Contributing

This is an unofficial SDK. Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the Apache License, Version 2.0 - see the LICENSE file for details.

Disclaimer

This is an unofficial SDK and is not affiliated with, endorsed by, or supported by Cerebras Systems Inc.

Product	Compatible and additional computed target framework versions.
.NET	net8.0 is compatible. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. net9.0 was computed. net9.0-android was computed. net9.0-browser was computed. net9.0-ios was computed. net9.0-maccatalyst was computed. net9.0-macos was computed. net9.0-tvos was computed. net9.0-windows was computed. net10.0 was computed. net10.0-android was computed. net10.0-browser was computed. net10.0-ios was computed. net10.0-maccatalyst was computed. net10.0-macos was computed. net10.0-tvos was computed. net10.0-windows was computed.

Product

.NET

Compatible target framework(s)

Included target framework(s) (in package)

Learn more about Target Frameworks and .NET Standard.

net8.0
- Microsoft.Extensions.Http (>= 8.0.0)
- Microsoft.Extensions.Logging.Abstractions (>= 8.0.0)
- Microsoft.Extensions.Options.ConfigurationExtensions (>= 8.0.0)
- Microsoft.Extensions.Options.DataAnnotations (>= 8.0.0)
- Polly (>= 8.2.0)
- System.Text.Json (>= 8.0.5)

NuGet packages (1)

Showing the top 1 NuGet packages that depend on Cerebras.Cloud.Sdk.Unofficial:

Package	Downloads
Andy.Llm Andy.Llm — Large Language Model integration library for .NET.	871

GitHub repositories

This package is not used by any popular GitHub repositories.

Version	Downloads	Last Updated
2025.7.13-rc.8	193	7/13/2025
2025.7.8-rc.6	126	7/8/2025

Initial release with support for chat completions, text completions, and tool calling.