Bendo.SimpleDataFrameBuilder
0.1.4
dotnet add package Bendo.SimpleDataFrameBuilder --version 0.1.4
NuGet\Install-Package Bendo.SimpleDataFrameBuilder -Version 0.1.4
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="Bendo.SimpleDataFrameBuilder" Version="0.1.4" />
For projects that support PackageReference, copy this XML node into the project file to reference the package.
<PackageVersion Include="Bendo.SimpleDataFrameBuilder" Version="0.1.4" />
<PackageReference Include="Bendo.SimpleDataFrameBuilder" />
For projects that support Central Package Management (CPM), copy this XML node into the solution Directory.Packages.props file to version the package.
paket add Bendo.SimpleDataFrameBuilder --version 0.1.4
The NuGet Team does not provide support for this client. Please contact its maintainers for support.
#r "nuget: Bendo.SimpleDataFrameBuilder, 0.1.4"
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
#addin nuget:?package=Bendo.SimpleDataFrameBuilder&version=0.1.4
#tool nuget:?package=Bendo.SimpleDataFrameBuilder&version=0.1.4
The NuGet Team does not provide support for this client. Please contact its maintainers for support.
SimpleParquetWriter
A lightweight, efficient .NET library for writing Parquet files with a simple builder pattern.
Features
- Stream-like API for building dataframes row by row
- Background thread writing for improved performance
- Reasonable defaults for most use cases:
- zstandard compression
- disabled dictionary encoding for floating point numbers
- 64K rowgroups
- Strong typing for various data types
- Minimal memory footprint and allocations
- No dictionary lookups for column names (uses ordered columns)
- Uses ParquetSharp under the hood
Installation
dotnet add package Bendo.SimpleDataFrameBuilder
Quick Start
// Create a writer for a new parquet file
using var writer = new SimpleParquetWriter("data.parquet");
// Add records one by one
writer.WriteField("name", "Alice");
writer.WriteField("age", 28);
writer.WriteField("active", true);
writer.WriteField("score", 94.5);
writer.WriteField("hired_date", DateTime.UtcNow);
writer.NextRecord();
writer.WriteField("name", "Bob");
writer.WriteField("age", 32);
writer.WriteField("active", false);
writer.WriteField("score", 88.0);
writer.WriteField("hired_date", DateTime.UtcNow.AddYears(-2));
writer.NextRecord();
// More records...
// The file is written on a background thread and closed when the writer
// is disposed (dispose blocks the current thread).
Important Notes
- Column order must be the same for every row (this improves performance by avoiding dictionary lookups)
- The parquet file is written in a background thread and finalized when the writer is disposed
- The writer must be disposed to ensure all data is written correctly
- Try to avoid string construction in the
WriteField
method to reduce allocations (use string constants or reuse strings).- You can use the optional
int idx
argument to help with this, e.g.:
- You can use the optional
Example: use the optional idx
argument to avoid string construction:
for (int i = 0; i < 3; i++)
{
writer.WriteField("address_line_{0}", addresses[i], i);
}
// ...
// is equivalent to:
writer.WriteField("address_line_0", addresses[0]);
writer.WriteField("address_line_1", addresses[1]);
writer.WriteField("address_line_2", addresses[2]);
Configuration
Logging
By default, the library logs a few events to the console, but you can configure your own logging:
// Set custom logger
SimpleParquetWriter.Logger = (message) => _myLoggingFramework.Log(message);
Column Order Validation
For development or debugging, you can enable column order validation:
// Enable column order validation (default is false for performance)
SimpleParquetWriter.VerifyColumnOrder = true;
Project status
Stable and used in production by the author.
License
MIT
Product | Versions Compatible and additional computed target framework versions. |
---|---|
.NET | net8.0 is compatible. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. net9.0 was computed. net9.0-android was computed. net9.0-browser was computed. net9.0-ios was computed. net9.0-maccatalyst was computed. net9.0-macos was computed. net9.0-tvos was computed. net9.0-windows was computed. |
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.
-
net8.0
- ParquetSharp (>= 18.1.0)
NuGet packages
This package is not used by any NuGet packages.
GitHub repositories
This package is not used by any popular GitHub repositories.