OpenccJiebaLib 1.0.1

dotnet add package OpenccJiebaLib --version 1.0.1
                    
NuGet\Install-Package OpenccJiebaLib -Version 1.0.1
                    
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="OpenccJiebaLib" Version="1.0.1" />
                    
For projects that support PackageReference, copy this XML node into the project file to reference the package.
<PackageVersion Include="OpenccJiebaLib" Version="1.0.1" />
                    
Directory.Packages.props
<PackageReference Include="OpenccJiebaLib" />
                    
Project file
For projects that support Central Package Management (CPM), copy this XML node into the solution Directory.Packages.props file to version the package.
paket add OpenccJiebaLib --version 1.0.1
                    
#r "nuget: OpenccJiebaLib, 1.0.1"
                    
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
#:package OpenccJiebaLib@1.0.1
                    
#:package directive can be used in C# file-based apps starting in .NET 10 preview 4. Copy this into a .cs file before any lines of code to reference the package.
#addin nuget:?package=OpenccJiebaLib&version=1.0.1
                    
Install as a Cake Addin
#tool nuget:?package=OpenccJiebaLib&version=1.0.1
                    
Install as a Cake Tool

OpenccJiebaLib

NuGet NuGet Downloads License

A .NET Standard 2.0 library providing a managed C# wrapper for the Rust-based OpenCC and Jieba C API, enabling efficient Chinese text conversion (Simplified/Traditional), segmentation, and keyword extraction in .NET applications.

Features

  • Chinese Text Conversion: Convert between Simplified, Traditional, and other Chinese variants using OpenCC.
  • Word Segmentation: Segment Chinese text into words using Jieba.
  • Keyword Extraction: Extract keywords using TF-IDF or TextRank algorithms.
  • Native Performance: Leverages native OpenCC/Jieba libraries for high performance.

Supported OpenCC Configurations

s2t, t2s, s2tw, tw2s, s2twp, tw2sp, s2hk, hk2s, t2tw,
t2twp, t2hk, tw2t, tw2tp, hk2t, t2jp, jp2t

Getting Started

Prerequisites

  • .NET Standard 2.0 or higher (.NET Framework, .NET Core/5+/6+, Mono, Xamarin, etc.).
  • .NET 6.0 or later recommended.
  • Native opencc_jieba_capi library (must be available to the runtime).

Installation

Option 1 — As Project Reference
  • Add a project reference to OpenccJiebaLib in your solution.
  • Manually copy the native binary to your app’s output directory (bin/<Config>/<TFM>):
    • Windows: opencc_jieba_capi.dll
    • Linux: libopencc_jieba_capi.so
    • macOS: libopencc_jieba_capi.dylib
  • Alternative: mark the native file Copy to Output Directory: Copy always/if newer.

🧪 Unit tests (MSTest/xUnit/nUnit) also need the native binaries in the test project’s output folder. Use the same copy strategy as above or add a Target to auto-copy natives after build.

Option 2 — From NuGet
  • Install via NuGet:
    dotnet add package OpenccJiebaLib
    
  • The NuGet package includes platform-specific native runtimes and will automatically deploy them. No manual copying needed.

Usage

using OpenccJiebaLib;

using (var openccJieba = new OpenccJieba())
{
    // Convert Simplified → Traditional
    string traditional = openccJieba.Convert("汉字转换测试", "s2t");
    Console.WriteLine(traditional); // 漢字轉換測試

    // Segment text
    string[] words = openccJieba.JiebaCut("我来到北京清华大学", hmm: true);
    // => ["我", "来到", "北京", "清华大学"]

    // Extract keywords (TF-IDF)
    string[] keywords = openccJieba.JiebaKeywordExtractTfidf("这是一个用于关键词提取的测试文本", topK: 5);
    // 提取/ 关键词/ 测试/ 用于/ 文本

    // Extract keywords with weights (TextRank)
    var (kw, weights) = openccJieba.JiebaExtractKeywordsWeights("这是一个用于关键词提取的测试文本", 5, "textrank");  
    // Keywords Weights TextRank: [('提取', 12214076549.586092), ('关键词', 12213038715.272404), ('测试', 9971894336.779804), ('用于', 9968689471.76825), ('文本', 7771637141.591653)]
}

Error Handling

If initialization fails or a native error occurs, an InvalidOperationException is thrown.
Use OpenccJieba.LastError() (if available) to get the last native error message.

API Overview

  • Convert(string input, string config, bool punctuation = false)
  • JiebaCut(string input, bool hmm)
  • JiebaCutAndJoin(string input, bool hmm, string delimiter)
  • JiebaKeywordExtractTfidf(string input, int topK)
  • JiebaKeywordExtractTextRank(string input, int topK)
  • JiebaExtractKeywordsWeights(string input, int topK, string method)

Troubleshooting

1) DllNotFoundException / Unable to load shared library 'opencc_jieba_capi'

  • Ensure the native file exists in your app output folder or is discoverable via PATH/LD_LIBRARY_PATH.
  • If using NuGet, clean + rebuild (natives are auto-copied).

2) BadImageFormatException

  • Architecture mismatch. Match your app (x64 vs x86) with the native build.

3) Platform-specific Notes

  • Linux: may require LD_LIBRARY_PATH adjustment if .so not next to the app.
  • macOS: remove Gatekeeper quarantine flags for .dylib:
    xattr -dr com.apple.quarantine libopencc_jieba_capi.dylib
    

4) Crashes / Thread Safety

  • Create separate OpenccJieba instances per thread, or ensure calls are thread-safe.
  • Dispose properly after use (using block recommended).

Tip: NuGet is easiest for handling natives. Use manual copy only when debugging custom native builds.

License

This project is licensed under the MIT License.
See LICENSE for details.

Acknowledgements


Powered by OpenCC and Jieba. C# wrapper by laisuk.

Product Compatible and additional computed target framework versions.
.NET net5.0 was computed.  net5.0-windows was computed.  net6.0 was computed.  net6.0-android was computed.  net6.0-ios was computed.  net6.0-maccatalyst was computed.  net6.0-macos was computed.  net6.0-tvos was computed.  net6.0-windows was computed.  net7.0 was computed.  net7.0-android was computed.  net7.0-ios was computed.  net7.0-maccatalyst was computed.  net7.0-macos was computed.  net7.0-tvos was computed.  net7.0-windows was computed.  net8.0 was computed.  net8.0-android was computed.  net8.0-browser was computed.  net8.0-ios was computed.  net8.0-maccatalyst was computed.  net8.0-macos was computed.  net8.0-tvos was computed.  net8.0-windows was computed.  net9.0 was computed.  net9.0-android was computed.  net9.0-browser was computed.  net9.0-ios was computed.  net9.0-maccatalyst was computed.  net9.0-macos was computed.  net9.0-tvos was computed.  net9.0-windows was computed.  net10.0 was computed.  net10.0-android was computed.  net10.0-browser was computed.  net10.0-ios was computed.  net10.0-maccatalyst was computed.  net10.0-macos was computed.  net10.0-tvos was computed.  net10.0-windows was computed. 
.NET Core netcoreapp2.0 was computed.  netcoreapp2.1 was computed.  netcoreapp2.2 was computed.  netcoreapp3.0 was computed.  netcoreapp3.1 was computed. 
.NET Standard netstandard2.0 is compatible.  netstandard2.1 was computed. 
.NET Framework net461 was computed.  net462 was computed.  net463 was computed.  net47 was computed.  net471 was computed.  net472 was computed.  net48 was computed.  net481 was computed. 
MonoAndroid monoandroid was computed. 
MonoMac monomac was computed. 
MonoTouch monotouch was computed. 
Tizen tizen40 was computed.  tizen60 was computed. 
Xamarin.iOS xamarinios was computed. 
Xamarin.Mac xamarinmac was computed. 
Xamarin.TVOS xamarintvos was computed. 
Xamarin.WatchOS xamarinwatchos was computed. 
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version Downloads Last Updated
1.0.1 187 8/28/2025
1.0.0 178 8/27/2025

v1.0.1
Added snupkg symbol package