WebReaper.Puppeteer
9.0.0
dotnet add package WebReaper.Puppeteer --version 9.0.0
NuGet\Install-Package WebReaper.Puppeteer -Version 9.0.0
<PackageReference Include="WebReaper.Puppeteer" Version="9.0.0" />
<PackageVersion Include="WebReaper.Puppeteer" Version="9.0.0" />
<PackageReference Include="WebReaper.Puppeteer" />
paket add WebReaper.Puppeteer --version 9.0.0
#r "nuget: WebReaper.Puppeteer, 9.0.0"
#:package WebReaper.Puppeteer@9.0.0
#addin nuget:?package=WebReaper.Puppeteer&version=9.0.0
#tool nuget:?package=WebReaper.Puppeteer&version=9.0.0
WebReaper.Puppeteer
Headless-browser (Puppeteer/Chromium) page-load transport for WebReaper, for scraping JavaScript-rendered pages.
Satellite package (ADR-0009): the headless-browser transport is kept out of
the WebReaper core so the core stays dependency-light and Native-AOT-clean.
The core is HTTP-only by default; install this package and call
.WithPuppeteerPageLoader() to scrape Dynamic pages (GetWithBrowser /
FollowWithBrowser / PaginateWithBrowser) — without it a Dynamic load
throws an actionable message. The first Dynamic run downloads Chromium.
Install
dotnet add package WebReaper.Puppeteer
Pulls WebReaper (the core) as a dependency.
Usage
Adds WithPuppeteerPageLoader() to ScraperEngineBuilder:
using WebReaper.Builders;
using WebReaper.Puppeteer;
var engine = await new ScraperEngineBuilder()
.WithPuppeteerPageLoader()
.GetWithBrowser("https://example.com/blog")
.FollowWithBrowser(".post-link")
.Parse(new() { new("title", "h1"), new("text", "article") })
.BuildAsync();
await engine.RunAsync();
License
GPL-3.0-or-later. Part of the WebReaper project.
| Product | Versions Compatible and additional computed target framework versions. |
|---|---|
| .NET | net10.0 is compatible. net10.0-android was computed. net10.0-browser was computed. net10.0-ios was computed. net10.0-maccatalyst was computed. net10.0-macos was computed. net10.0-tvos was computed. net10.0-windows was computed. |
-
net10.0
- Microsoft.Extensions.Logging.Abstractions (>= 10.0.8)
- PuppeteerExtraSharp (>= 3.1.1)
- PuppeteerSharp (>= 24.42.0)
- WebReaper (>= 9.0.0)
NuGet packages
This package is not used by any NuGet packages.
GitHub repositories
This package is not used by any popular GitHub repositories.
9.0.0: lockstep republish against the core 9.0.0 (ADR-0023: the core public surface is now the documented contract; Tier-2 implementation types internalised). No functional change to this package. 8.0.0: lockstep republish against the core 8.0.0 major (ADR-0022); no functional change to this package. 7.0.0: initial release. Headless-browser (Puppeteer/Chromium) page-load transport (ScraperEngineBuilder.WithPuppeteerPageLoader) extracted from the WebReaper core per ADR-0009 so the core stays dependency-light, Native-AOT-clean and HTTP-only by default (core no longer references PuppeteerSharp/PuppeteerExtraSharp or the Chromium provisioning path). Requires WebReaper 9.0.0.