FastTextProcess 1.0.2
dotnet add package FastTextProcess --version 1.0.2
NuGet\Install-Package FastTextProcess -Version 1.0.2
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="FastTextProcess" Version="1.0.2" />
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add FastTextProcess --version 1.0.2
The NuGet Team does not provide support for this client. Please contact its maintainers for support.
#r "nuget: FastTextProcess, 1.0.2"
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
// Install FastTextProcess as a Cake Addin #addin nuget:?package=FastTextProcess&version=1.0.2 // Install FastTextProcess as a Cake Tool #tool nuget:?package=FastTextProcess&version=1.0.2
The NuGet Team does not provide support for this client. Please contact its maintainers for support.
FastTextProcess
Natural Language Processing feature extraction multilingual toolset. (in the development phase now)
- Axaprj.FastTextProcess Text to embedded vectors conversion high-performance framework (Facebook FastText engine and dictionaries based).
- Axaprj.Textc.Vect Natural Language features extraction framework (Takenet.Textc based)
- Axaprj.WordToVecDB DB access layer of embedded vectors processing tools.
code
VS 2017, .NET Core, SQLite, XUnit
toolchain and data (currently used)
- FastText pre-trained word vectors model extract into
DataArcDir
- FastText pre-trained word vectors dictionary extract into
DataArcDir
- Large Movie Review Dataset v1.0 extract into
DataArcDir
- own FastText VS 2017 CMake compatibility fix
- own cnn-text-classification-tf preprocessed data loader
- own Takenet.Textc .Net Core port
start of usage
- setup
FastTextProcess.Tests.json
{
"DataArcDir": "c:/data.arc/",
"FastTextBin": "c:/data.arc/fasttext.exe"
}
- Create pretrained vectors DB FastTextProcess.Tests.FastTextRoutines
ProcCreateDbEn()
- Append Train and Tests AclImdb Data FastTextProcess.Tests.FastTextRoutines
ProcAclImdbTrain()
ProcAclImdbTest()
- Looking for result into
$DataArcDir/AclImdb_proc.db
advanced and experimental
FastTextRoutinesCyr - multilingual processing: languages detection and Ru, Uk, En texts processor. Aligned word vectors used.
reference
[1] T. Mikolov, E. Grave, P. Bojanowski, C. Puhrsch, A. Joulin. Advances in Pre-Training Distributed Word Representations @inproceedings{mikolov2018advances, title={Advances in Pre-Training Distributed Word Representations}, author={Mikolov, Tomas and Grave, Edouard and Bojanowski, Piotr and Puhrsch, Christian and Joulin, Armand}, booktitle={Proceedings of the International Conference on Language Resources and Evaluation (LREC 2018)}, year={2018}}
[2] A. Joulin, P. Bojanowski, T. Mikolov, H. Jegou, E. Grave, Loss in Translation: Learning Bilingual Word Mapping with a Retrieval Criterion @InProceedings{joulin2018loss, title={Loss in Translation: Learning Bilingual Word Mapping with a Retrieval Criterion}, author={Joulin, Armand and Bojanowski, Piotr and Mikolov, Tomas and J\'egou, Herv\'e and Grave, Edouard}, booktitle={Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing}, year={2018}}
[3] P. Bojanowski*, E. Grave*, A. Joulin, T. Mikolov, Enriching Word Vectors with Subword Information @article{bojanowski2017enriching, title={Enriching Word Vectors with Subword Information}, author={Bojanowski, Piotr and Grave, Edouard and Joulin, Armand and Mikolov, Tomas}, journal={Transactions of the Association for Computational Linguistics}, volume={5}, year={2017}, issn={2307-387X}, pages={135--146}}
author
Product | Versions Compatible and additional computed target framework versions. |
---|---|
.NET | net5.0 was computed. net5.0-windows was computed. net6.0 was computed. net6.0-android was computed. net6.0-ios was computed. net6.0-maccatalyst was computed. net6.0-macos was computed. net6.0-tvos was computed. net6.0-windows was computed. net7.0 was computed. net7.0-android was computed. net7.0-ios was computed. net7.0-maccatalyst was computed. net7.0-macos was computed. net7.0-tvos was computed. net7.0-windows was computed. net8.0 was computed. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. |
.NET Core | netcoreapp2.0 was computed. netcoreapp2.1 was computed. netcoreapp2.2 was computed. netcoreapp3.0 was computed. netcoreapp3.1 was computed. |
.NET Standard | netstandard2.0 is compatible. netstandard2.1 was computed. |
.NET Framework | net461 was computed. net462 was computed. net463 was computed. net47 was computed. net471 was computed. net472 was computed. net48 was computed. net481 was computed. |
MonoAndroid | monoandroid was computed. |
MonoMac | monomac was computed. |
MonoTouch | monotouch was computed. |
Tizen | tizen40 was computed. tizen60 was computed. |
Xamarin.iOS | xamarinios was computed. |
Xamarin.Mac | xamarinmac was computed. |
Xamarin.TVOS | xamarintvos was computed. |
Xamarin.WatchOS | xamarinwatchos was computed. |
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.
-
.NETStandard 2.0
- Axaprj.WordToVecDB (>= 1.0.2)
NuGet packages
This package is not used by any NuGet packages.
GitHub repositories
This package is not used by any popular GitHub repositories.