ManySpeech.K2TransducerAsr 1.0.8

.NET 6.0 .NET Core 3.1 .NET Standard 2.0 .NET Framework 4.6.1

dotnet add package ManySpeech.K2TransducerAsr --version 1.0.8

NuGet\Install-Package ManySpeech.K2TransducerAsr -Version 1.0.8

This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.

<PackageReference Include="ManySpeech.K2TransducerAsr" Version="1.0.8" />

For projects that support PackageReference, copy this XML node into the project file to reference the package.

<PackageVersion Include="ManySpeech.K2TransducerAsr" Version="1.0.8" />
                    

                            Directory.Packages.props

<PackageReference Include="ManySpeech.K2TransducerAsr" />
                    

                            Project file

For projects that support Central Package Management (CPM), copy this XML node into the solution Directory.Packages.props file to version the package.

paket add ManySpeech.K2TransducerAsr --version 1.0.8

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

#r "nuget: ManySpeech.K2TransducerAsr, 1.0.8"

#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.

#:package ManySpeech.K2TransducerAsr@1.0.8

#:package directive can be used in C# file-based apps starting in .NET 10 preview 4. Copy this into a .cs file before any lines of code to reference the package.

#addin nuget:?package=ManySpeech.K2TransducerAsr&version=1.0.8
                    

                            Install as a Cake Addin

#tool nuget:?package=ManySpeech.K2TransducerAsr&version=1.0.8
                    

                            Install as a Cake Tool

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

K2TransducerAsr

Introduction:

K2TransducerAsr is a "speech recognition" library written in C#. It calls Microsoft.ML.OnnxRuntime at the bottom layer to decode onnx models, supporting multiple environments such as net461+, net60+, netcoreapp3.1, and netstandard2.0+, supports cross-platform compilation, and supports AOT compilation. It is simple and convenient to use.

Supported Models (ONNX)

Model Name	Type	Supported Language	Download Address
k2transducer-lstm-en-onnx-online-csukuangfj-20220903	Streaming	English	modelscope
k2transducer-lstm-zh-onnx-online-csukuangfj-20221014	Streaming	Chinese	modelscope
k2transducer-zipformer-en-onnx-online-weijizhuang-20221202	Streaming	English	modelscope
k2transducer-zipformer-en-onnx-online-zengwei-20230517	Streaming	English	modelscope
k2transducer-zipformer-multi-zh-hans-onnx-online-20231212	Streaming	Chinese	modelscope
k2transducer-zipformer-ko-onnx-online-johnbamma-20240612	Streaming	Korean	modelscope
k2transducer-zipformer-ctc-small-zh-onnx-online-20250401	Streaming	Chinese	modelscope
k2transducer-zipformer-large-zh-onnx-online-yuekai-20250630	Streaming	Chinese	modelscope
k2transducer-zipformer-xlarge-zh-onnx-online-yuekai-20250630	Streaming	Chinese	modelscope
k2transducer-zipformer-ctc-large-zh-onnx-online-yuekai-20250630	Streaming	Chinese	modelscope
k2transducer-zipformer-ctc-xlarge-zh-onnx-online-yuekai-20250630	Streaming	Chinese	modelscope
k2transducer-conformer-en-onnx-offline-csukuangfj-20220513	Non-streaming	English	modelscope
k2transducer-conformer-zh-onnx-offline-luomingshuang-20220727	Non-streaming	Chinese	modelscope
k2transducer-zipformer-en-onnx-offline-yfyeung-20230417	Non-streaming	English	modelscope
k2transducer-zipformer-large-en-onnx-offline-zengwei-20230516	Non-streaming	English	modelscope
k2transducer-zipformer-small-en-onnx-offline-zengwei-20230516	Non-streaming	English	modelscope
k2transducer-zipformer-zh-onnx-offline-wenetspeech-20230615	Non-streaming	Chinese	modelscope
k2transducer-zipformer-zh-onnx-offline-multi-zh-hans-20230902	Non-streaming	Chinese	modelscope
k2transducer-zipformer-zh-en-onnx-offline-20231122	Non-streaming	Chinese and English	modelscope
k2transducer-zipformer-cantonese-onnx-offline-20240313	Non-streaming	Cantonese	modelscope
k2transducer-zipformer-th-onnx-offline-yfyeung-20240620	Non-streaming	Thai	modelscope
k2transducer-zipformer-ja-onnx-offline-reazonspeech-20240801	Non-streaming	Japanese	modelscope
k2transducer-zipformer-ru-onnx-offline-20240918	Non-streaming	Russian	modelscope
k2transducer-zipformer-vi-onnx-offline-20250420	Non-streaming	Vietnamese	modelscope
k2transducer-zipformer-ctc-zh-onnx-offline-20250703	Non-streaming	Chinese	modelscope github
k2transducer-zipformer-ctc-small-zh-onnx-offline-20250716	Non-streaming	Chinese	modelscope

How to Use

1. Clone the project source code

cd /path/to
git clone https://github.com/manyeyes/K2TransducerAsr.git

2. Download the models from the above list to the directory: /path/to/K2TransducerAsr/K2TransducerAsr.Examples

cd /path/to/K2TransducerAsr/K2TransducerAsr.Examples
git clone https://www.modelscope.cn/manyeyes/[model name].git

3. Load the project using vs2022 (or other IDEs)

4. Set the files in the model directory to: Copy to Output Directory → Copy if newer

5. Modify the code in the example: string modelName = [model name]

6. Run the project

Calling Method for Offline (Non-streaming) Models:

1. Add project reference

using K2TransducerAsr; using K2TransducerAsr.Model;

2. Model initialization and configuration

string applicationBase = AppDomain.CurrentDomain.BaseDirectory;
string modelName = "k2transducer-zipformer-large-en-onnx-offline-zengwei-20230516";
string encoderFilePath = applicationBase + "./" + modelName + "/encoder.int8.onnx";
string decoderFilePath = applicationBase + "./" + modelName + "/decoder.int8.onnx";
string joinerFilePath = applicationBase + "./" + modelName + "/joiner.int8.onnx";
string tokensFilePath = applicationBase + "./" + modelName + "/tokens.txt";
K2TransducerAsr.OfflineRecognizer offlineRecognizer = new K2TransducerAsr.OfflineRecognizer(encoderFilePath, decoderFilePath, joinerFilePath, tokensFilePath, threadsNum: 2);

3. Call

List<float[]> samples = new List<float[]>();
//The code for converting wav files to samples is omitted here...
//Single recognition
foreach (var sample in samples)
{
    OfflineStream stream = offlineRecognizer.CreateOfflineStream();
    stream.AddSamples(sample);
    OfflineRecognizerResultEntity result = offlineRecognizer.GetResult(stream);
    Console.WriteLine(result.text);
}
//Batch recognition
List<OfflineStream> streams = new List<OfflineStream>();
foreach (var sample in samples)
{
    OfflineStream stream = offlineRecognizer.CreateOfflineStream();
    stream.AddSamples(sample);
    streams.Add(stream);
}
List<OfflineRecognizerResultEntity> results = offlineRecognizer.GetResults(streams);
foreach (OfflineRecognizerResultEntity result in results)
{
    Console.WriteLine(result.text);
}

4. Output results:

Single recognition

 after early nightfall the yellow lamps would light up here and there the squalid quarter of the brothels

 god as a direct consequence of the sin which man thus punished had given her a lovely child whose place was on that same dishonoured bosom to connect her parent for ever with the race and descent of mortals and to be finally a blessed soul in heaven

elapsed_milliseconds:1062.28125
total_duration:23340
rtf:0.045513335475578405
end!

Batch recognition

 after early nightfall the yellow lamps would light up here and there the squalid quarter of the brothels

 god as a direct consequence of the sin which man thus punished had given her a lovely child whose place was on that same dishonoured bosom to connect her parent for ever with the race and descent of mortals and to be finally a blessed soul in heaven

elapsed_milliseconds:1268.6875
total_duration:23340
rtf:0.05435679091688089
end!

Calling Method for Real-time (Streaming) Models:

1. Add project reference

using K2TransducerAsr; using K2TransducerAsr.Model;

2. Model initialization and configuration

string applicationBase = AppDomain.CurrentDomain.BaseDirectory;
string modelName = "k2transducer-zipformer-multi-zh-hans-onnx-online-20231212";
string encoderFilePath = applicationBase + "./" + modelName + "/encoder.int8.onnx";
string decoderFilePath = applicationBase + "./" + modelName + "/decoder.int8.onnx";
string joinerFilePath = applicationBase + "./" + modelName + "/joiner.int8.onnx";
string tokensFilePath = applicationBase + "./" + modelName + "/tokens.txt";
K2TransducerAsr.OnlineRecognizer onlineRecognizer = new K2TransducerAsr.OnlineRecognizer(encoderFilePath, decoderFilePath, joinerFilePath, tokensFilePath, threadsNum: 2);

3. Call

List<List<float[]>> samplesList = new List<List<float[]>>();
//The code for converting wav files to samples is omitted here...
//The following is示意 code for batch processing:
//Batch processing
List<K2TransducerAsr.OnlineStream> onlineStreams = new List<K2TransducerAsr.OnlineStream>();
List<bool> isEndpoints = new List<bool>();
List<bool> isEnds = new List<bool>();
for (int num = 0; num < samplesList.Count; num++)
{
    K2TransducerAsr.OnlineStream stream = onlineRecognizer.CreateOnlineStream();
    onlineStreams.Add(stream);
    isEndpoints.Add(false);
    isEnds.Add(false);
}
while (true)
{
    //......(Some details are omitted here, please refer to the example code for details)
	List<K2TransducerAsr.OnlineRecognizerResultEntity> results_batch = onlineRecognizer.GetResults(streams);
	foreach (K2TransducerAsr.OnlineRecognizerResultEntity result in results_batch)
	{
		Console.WriteLine(result.text);
	}
	//......(Some details are omitted here, please refer to the example code for details)
}
//Single processing
for (int j = 0; j < samplesList.Count; j++)
{
    K2TransducerAsr.OnlineStream stream = onlineRecognizer.CreateOnlineStream();
    foreach (float[] samplesItem in samplesList[j])
    {
        stream.AddSamples(samplesItem);
        OnlineRecognizerResultEntity result_on = onlineRecognizer.GetResult(stream);
        Console.WriteLine(result_on.text);
    }
}
//Please refer to the example (K2TransducerAsr.Examples) code for details

4. Output results

Test results of Chinese model:

OnlineRecognizer:
batchSize:1



这是
这是第一种
这是第一种第二
这是第一种第二种
这是第一种第二种叫
这是第一种第二种叫
这是第一种第二种叫
这是第一种第二种叫呃
这是第一种第二种叫呃与
这是第一种第二种叫呃与 always
这是第一种第二种叫呃与 always always
这是第一种第二种叫呃与 always always什么
这是第一种第二种叫呃与 always always什么意思
是
是不是
是不是
是不是平凡
是不是平凡的啊
是不是平凡的啊不认
是不是平凡的啊不认识
是不是平凡的啊不认识记下来
是不是平凡的啊不认识记下来 f
是不是平凡的啊不认识记下来 frequent
是不是平凡的啊不认识记下来 frequently
是不是平凡的啊不认识记下来 frequently频
是不是平凡的啊不认识记下来 frequently频繁
是不是平凡的啊不认识记下来 frequently频繁的
是不是平凡的啊不认识记下来 frequently频繁的
elapsed_milliseconds:2070.546875
total_duration:9790
rtf:0.21149610572012256
end!

Test results of English model:





 after

 after early

 after early

 after early nightfa

 after early nightfall the ye

 after early nightfall the yellow la

 after early nightfall the yellow lamps

 after early nightfall the yellow lamps would light

 after early nightfall the yellow lamps would light up

 after early nightfall the yellow lamps would light up here

 after early nightfall the yellow lamps would light up here and

 after early nightfall the yellow lamps would light up here and there

 after early nightfall the yellow lamps would light up here and there the squa

 after early nightfall the yellow lamps would light up here and there the squalid

 after early nightfall the yellow lamps would light up here and there the squalid quar

 after early nightfall the yellow lamps would light up here and there the squalid quarter of

 after early nightfall the yellow lamps would light up here and there the squalid quarter of the bro

 after early nightfall the yellow lamps would light up here and there the squalid quarter of the brothel

 after early nightfall the yellow lamps would light up here and there the squalid quarter of the brothels

elapsed_milliseconds:1088.890625
total_duration:6625
rtf:0.16436084905660378
end!

Voice activity detection to solve the problem of reasonable segmentation of long audio. Project address: AliFsmnVad
Text punctuation prediction to solve the problem that the recognition results have no punctuation. Project address: AliCTTransformerPunc

Other Instructions:

Test case: K2TransducerAsr.Examples. Test CPU: Intel(R) Core(TM) i7-10750H CPU @ 2.60GHz 2.59 GHz Supported platforms: Windows 7 SP1 or later, macOS 10.13 (High Sierra) or later, iOS, etc., Linux distributions (specific dependencies are required, see the list of Linux distributions supported by .NET 6 for details), Android (Android 5.0 (API 21) or later).

References

[1] https://github.com/k2-fsa/icefall

[2] https://github.com/naudio/NAudio

Product	Compatible and additional computed target framework versions.
.NET	net5.0 was computed. net5.0-windows was computed. net6.0 is compatible. net6.0-android was computed. net6.0-ios was computed. net6.0-maccatalyst was computed. net6.0-macos was computed. net6.0-tvos was computed. net6.0-windows was computed. net7.0 was computed. net7.0-android was computed. net7.0-ios was computed. net7.0-maccatalyst was computed. net7.0-macos was computed. net7.0-tvos was computed. net7.0-windows was computed. net8.0 is compatible. net8.0-android was computed. net8.0-android34.0 is compatible. net8.0-browser was computed. net8.0-ios was computed. net8.0-ios18.0 is compatible. net8.0-maccatalyst was computed. net8.0-maccatalyst18.0 is compatible. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. net8.0-windows10.0.19041 is compatible. net9.0 was computed. net9.0-android was computed. net9.0-browser was computed. net9.0-ios was computed. net9.0-maccatalyst was computed. net9.0-macos was computed. net9.0-tvos was computed. net9.0-windows was computed. net10.0 was computed. net10.0-android was computed. net10.0-browser was computed. net10.0-ios was computed. net10.0-maccatalyst was computed. net10.0-macos was computed. net10.0-tvos was computed. net10.0-windows was computed.
.NET Core	netcoreapp2.0 was computed. netcoreapp2.1 was computed. netcoreapp2.2 was computed. netcoreapp3.0 was computed. netcoreapp3.1 is compatible.
.NET Standard	netstandard2.0 is compatible. netstandard2.1 is compatible.
.NET Framework	net461 is compatible. net462 was computed. net463 was computed. net47 was computed. net471 was computed. net472 is compatible. net48 is compatible. net481 was computed.
MonoAndroid	monoandroid was computed.
MonoMac	monomac was computed.
MonoTouch	monotouch was computed.
Tizen	tizen40 was computed. tizen60 was computed.
Xamarin.iOS	xamarinios was computed.
Xamarin.Mac	xamarinmac was computed.
Xamarin.TVOS	xamarintvos was computed.
Xamarin.WatchOS	xamarinwatchos was computed.

Compatible target framework(s)

Included target framework(s) (in package)

Learn more about Target Frameworks and .NET Standard.

.NETCoreApp 3.1
- ManySpeech.SpeechFeatures (>= 1.1.7)
- Microsoft.ML.OnnxRuntime (>= 1.22.1)
.NETFramework 4.6.1
- ManySpeech.SpeechFeatures (>= 1.1.7)
- Microsoft.ML.OnnxRuntime (>= 1.22.1)
.NETFramework 4.7.2
- ManySpeech.SpeechFeatures (>= 1.1.7)
- Microsoft.ML.OnnxRuntime (>= 1.22.1)
.NETFramework 4.8
- ManySpeech.SpeechFeatures (>= 1.1.7)
- Microsoft.ML.OnnxRuntime (>= 1.22.1)
.NETStandard 2.0
- ManySpeech.SpeechFeatures (>= 1.1.7)
- Microsoft.ML.OnnxRuntime (>= 1.22.1)
.NETStandard 2.1
- ManySpeech.SpeechFeatures (>= 1.1.7)
- Microsoft.ML.OnnxRuntime (>= 1.22.1)
net6.0
- ManySpeech.SpeechFeatures (>= 1.1.7)
- Microsoft.ML.OnnxRuntime (>= 1.22.1)
net8.0
- ManySpeech.SpeechFeatures (>= 1.1.7)
- Microsoft.ML.OnnxRuntime (>= 1.22.1)
net8.0-android34.0
- ManySpeech.SpeechFeatures (>= 1.1.7)
- Microsoft.ML.OnnxRuntime (>= 1.22.1)
net8.0-ios18.0
- ManySpeech.SpeechFeatures (>= 1.1.7)
- Microsoft.ML.OnnxRuntime (>= 1.22.1)
net8.0-maccatalyst18.0
- ManySpeech.SpeechFeatures (>= 1.1.7)
- Microsoft.ML.OnnxRuntime (>= 1.22.1)
net8.0-windows10.0.19041
- ManySpeech.SpeechFeatures (>= 1.1.7)
- Microsoft.ML.OnnxRuntime (>= 1.22.1)

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version	Downloads	Last Updated
1.0.8	79	8/23/2025
1.0.7	68	8/23/2025
1.0.6	146	8/13/2025
1.0.5	118	8/9/2025
1.0.4	125	8/9/2025
1.0.3	216	8/7/2025
1.0.2	109	7/29/2025
1.0.1	291	6/10/2025

ManySpeech.K2TransducerAsr 1.0.8

K2TransducerAsr

Introduction:

Supported Models (ONNX)

How to Use

1. Clone the project source code

2. Download the models from the above list to the directory: /path/to/K2TransducerAsr/K2TransducerAsr.Examples

3. Load the project using vs2022 (or other IDEs)

4. Set the files in the model directory to: Copy to Output Directory → Copy if newer

5. Modify the code in the example: string modelName = [model name]

6. Run the project

Calling Method for Offline (Non-streaming) Models:

1. Add project reference

2. Model initialization and configuration

3. Call

4. Output results:

Calling Method for Real-time (Streaming) Models:

1. Add project reference

2. Model initialization and configuration

3. Call

4. Output results

Related Projects:

Other Instructions:

References

.NETCoreApp 3.1

.NETFramework 4.6.1

.NETFramework 4.7.2

.NETFramework 4.8

.NETStandard 2.0

.NETStandard 2.1

net6.0

net8.0

net8.0-android34.0

net8.0-ios18.0

net8.0-maccatalyst18.0

net8.0-windows10.0.19041

NuGet packages

GitHub repositories