FieldCure.DocumentParsers.Imaging
1.0.0
dotnet add package FieldCure.DocumentParsers.Imaging --version 1.0.0
NuGet\Install-Package FieldCure.DocumentParsers.Imaging -Version 1.0.0
<PackageReference Include="FieldCure.DocumentParsers.Imaging" Version="1.0.0" />
<PackageVersion Include="FieldCure.DocumentParsers.Imaging" Version="1.0.0" />
<PackageReference Include="FieldCure.DocumentParsers.Imaging" />
paket add FieldCure.DocumentParsers.Imaging --version 1.0.0
#r "nuget: FieldCure.DocumentParsers.Imaging, 1.0.0"
#:package FieldCure.DocumentParsers.Imaging@1.0.0
#addin nuget:?package=FieldCure.DocumentParsers.Imaging&version=1.0.0
#tool nuget:?package=FieldCure.DocumentParsers.Imaging&version=1.0.0
FieldCure.DocumentParsers.Imaging
PDF page image rendering — extension for FieldCure.DocumentParsers.
Core DocumentParsers v2.0 ships with pure-managed PDF text extraction (PdfPig). This package adds page image rendering via PDFtoImage (PDFium native) — useful for vision models, thumbnails, or feeding OCR engines.
Install
dotnet add package FieldCure.DocumentParsers.Imaging
Quick Start
using FieldCure.DocumentParsers;
using FieldCure.DocumentParsers.Imaging;
// Registers PdfImageRenderer, upgrading the factory's .pdf entry
// from text-only PdfParser to a full IMediaDocumentParser.
DocumentParserFactoryImagingExtensions.AddImagingSupport();
var parser = (IMediaDocumentParser)DocumentParserFactory.GetParser(".pdf")!;
// Text extraction (same pipeline as the core package — no regression).
var text = parser.ExtractText(File.ReadAllBytes("document.pdf"));
// Page rendering (new capability).
var images = parser.ExtractImages(File.ReadAllBytes("document.pdf"), dpi: 150);
foreach (var img in images)
File.WriteAllBytes($"{img.Label}.png", img.Data);
Native Dependency
PDFium binaries are bundled via the PDFtoImage package (Windows/Linux/macOS).
For pure managed deployments or environments where native PDFium cannot load,
use the core FieldCure.DocumentParsers package directly — text extraction
does not require this package.
Related Packages
- FieldCure.DocumentParsers — Core text extraction (DOCX/HWPX/XLSX/PPTX/HTML/PDF)
- FieldCure.DocumentParsers.Ocr — Tesseract OCR fallback for scanned PDFs
License
MIT — Copyright (c) 2026 FieldCure Co., Ltd.
| Product | Versions Compatible and additional computed target framework versions. |
|---|---|
| .NET | net8.0 is compatible. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. net9.0 was computed. net9.0-android was computed. net9.0-browser was computed. net9.0-ios was computed. net9.0-maccatalyst was computed. net9.0-macos was computed. net9.0-tvos was computed. net9.0-windows was computed. net10.0 is compatible. net10.0-android was computed. net10.0-browser was computed. net10.0-ios was computed. net10.0-maccatalyst was computed. net10.0-macos was computed. net10.0-tvos was computed. net10.0-windows was computed. |
-
net10.0
- FieldCure.DocumentParsers (>= 2.0.0)
- PDFtoImage (>= 5.2.0)
-
net8.0
- FieldCure.DocumentParsers (>= 2.0.0)
- PDFtoImage (>= 5.2.0)
NuGet packages (1)
Showing the top 1 NuGet packages that depend on FieldCure.DocumentParsers.Imaging:
| Package | Downloads |
|---|---|
|
FieldCure.DocumentParsers.Ocr
Tesseract OCR fallback for scanned PDFs on Windows. Plugs into FieldCure.DocumentParsers as an IDocumentParser for .pdf that renders pages with PDFium and recognizes text with Tesseract when PdfPig yields no content. |
GitHub repositories
This package is not used by any popular GitHub repositories.
| Version | Downloads | Last Updated |
|---|---|---|
| 1.0.0 | 250 | 4/20/2026 |
v1.0.0 — Initial release. Splits page image rendering out of the legacy FieldCure.DocumentParsers.Pdf package. `PdfImageRenderer : IMediaDocumentParser` renders PDF pages via PDFium (PDFtoImage); `ExtractText` delegates to the core `PdfParser` so registration is additive. Use `DocumentParserFactoryImagingExtensions.AddImagingSupport()`.