DataFusionSharp 0.2.2
dotnet add package DataFusionSharp --version 0.2.2
NuGet\Install-Package DataFusionSharp -Version 0.2.2
<PackageReference Include="DataFusionSharp" Version="0.2.2" />
<PackageVersion Include="DataFusionSharp" Version="0.2.2" />
<PackageReference Include="DataFusionSharp" />
paket add DataFusionSharp --version 0.2.2
#r "nuget: DataFusionSharp, 0.2.2"
#:package DataFusionSharp@0.2.2
#addin nuget:?package=DataFusionSharp&version=0.2.2
#tool nuget:?package=DataFusionSharp&version=0.2.2
DataFusionSharp
.NET bindings for Apache DataFusion, a fast, extensible query engine built on Apache Arrow for high-performance analytical query processing.
Note: This is an independent community project and is not officially associated with or endorsed by the Apache Software Foundation or the Apache DataFusion project.
Installation
dotnet add package DataFusionSharp
Quick Start
using DataFusionSharp;
// Create runtime, which manages Tokio runtime and native resources, per application lifetime
using var runtime = DataFusionRuntime.Create();
// Create session context, which manages query execution and state, per logical session lifetime
using var context = runtime.CreateSessionContext();
// Register a CSV file as a table (supports CSV, Parquet, JSONL)
await context.RegisterCsvAsync("orders", "path/to/orders.csv");
// await context.RegisterParquetAsync("orders", "path/to/orders.parquet");
// await context.RegisterJsonAsync("orders", "path/to/orders.json");
// Execute SQL query
using var df = await context.SqlAsync( "SELECT customer_id, sum(amount) AS total FROM orders GROUP BY customer_id");
// Display results to console
await df.ShowAsync();
// Access schema
var schema = await df.GetSchemaAsync();
foreach (var field in schema.FieldsList)
... // Process schema field (name, type, etc.)
// Collect as Arrow batches
using var collectedData = await df.CollectAsync();
foreach (var batch in collectedData.Batches)
... // Process Arrow RecordBatch...
// Collect as stream of Arrow batches
using var stream = await df.ExecuteStreamAsync();
await foreach (var batch in stream)
... // Process streamed RecordBatch...
Features
- Runtime Management - Create and manage Tokio runtime with configurable threads
- Session Context - Execute SQL queries with full DataFusion SQL support
- Data Sources - Read and write CSV, Parquet, and JSON files
- DataFrame API - Count rows, get schema, collect data, and stream results
Requirements
- .NET 8.0 or later
- Supported platforms:
- Linux (x64, arm64)
- Windows (x64)
- macOS (arm64)
Documentation
For more information, examples, and source code, visit the GitHub repository.
Also see the Apache DataFusion documentation for details on SQL syntax, supported functions, and features.
License
DataFusionSharp is licensed under the Apache License 2.0. See LICENSE.txt for details.
This project contains bindings to Apache DataFusion, which is also licensed under Apache License 2.0. See NOTICE.txt for attribution details.
Acknowledgments
- Apache DataFusion - The underlying query engine
- Apache Arrow - Columnar memory format
- The Apache Software Foundation
Apache®, Apache DataFusion™, Apache Arrow™, and the Apache feather logo are trademarks of The Apache Software Foundation.
| Product | Versions Compatible and additional computed target framework versions. |
|---|---|
| .NET | net8.0 is compatible. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. net9.0 was computed. net9.0-android was computed. net9.0-browser was computed. net9.0-ios was computed. net9.0-maccatalyst was computed. net9.0-macos was computed. net9.0-tvos was computed. net9.0-windows was computed. net10.0 is compatible. net10.0-android was computed. net10.0-browser was computed. net10.0-ios was computed. net10.0-maccatalyst was computed. net10.0-macos was computed. net10.0-tvos was computed. net10.0-windows was computed. |
-
net10.0
- Apache.Arrow (>= 22.1.0)
- Google.Protobuf (>= 3.33.5)
-
net8.0
- Apache.Arrow (>= 22.1.0)
- Google.Protobuf (>= 3.33.5)
NuGet packages
This package is not used by any NuGet packages.
GitHub repositories
This package is not used by any popular GitHub repositories.