SentencePieceTokenizer 0.1.3

dotnet add package SentencePieceTokenizer --version 0.1.3
                    
NuGet\Install-Package SentencePieceTokenizer -Version 0.1.3
                    
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="SentencePieceTokenizer" Version="0.1.3" />
                    
For projects that support PackageReference, copy this XML node into the project file to reference the package.
<PackageVersion Include="SentencePieceTokenizer" Version="0.1.3" />
                    
Directory.Packages.props
<PackageReference Include="SentencePieceTokenizer" />
                    
Project file
For projects that support Central Package Management (CPM), copy this XML node into the solution Directory.Packages.props file to version the package.
paket add SentencePieceTokenizer --version 0.1.3
                    
#r "nuget: SentencePieceTokenizer, 0.1.3"
                    
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
#addin nuget:?package=SentencePieceTokenizer&version=0.1.3
                    
Install SentencePieceTokenizer as a Cake Addin
#tool nuget:?package=SentencePieceTokenizer&version=0.1.3
                    
Install SentencePieceTokenizer as a Cake Tool

SentencePieceTokenizer

Nuget (with prereleases) GitHub License

Usage

The tokenizers should be thread-safe, as the underlying sentencepiece processor is thread-safe. This has not been extensively tested however!

MarianTokenizer

Uses the tokenizer of a sentencepiece model, but a different vocabulary for the ids.

References

Product Compatible and additional computed target framework versions.
.NET net9.0 is compatible.  net9.0-android was computed.  net9.0-browser was computed.  net9.0-ios was computed.  net9.0-maccatalyst was computed.  net9.0-macos was computed.  net9.0-tvos was computed.  net9.0-windows was computed. 
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.

NuGet packages (1)

Showing the top 1 NuGet packages that depend on SentencePieceTokenizer:

Package Downloads
Darcara.TextAnalysis

TextAnalysis, sentence spliting, named entity recognition, translation and more

GitHub repositories

This package is not used by any popular GitHub repositories.

Version Downloads Last updated
0.1.3 163 1/6/2025
0.1.2 103 1/6/2025
0.1.1 121 12/29/2024