Skip to content

docs: Add Scrapling guide#938

Open
vdusek wants to merge 3 commits into
masterfrom
docs/scrapling-guide
Open

docs: Add Scrapling guide#938
vdusek wants to merge 3 commits into
masterfrom
docs/scrapling-guide

Conversation

@vdusek
Copy link
Copy Markdown
Contributor

@vdusek vdusek commented Jun 5, 2026

Adds a new guide for using the Scrapling adaptive web scraping library in Apify Actors, following the structure of the existing scraping-library guides (BeautifulSoup, Playwright, Scrapy).

What's included

  • docs/03_guides/09_scrapling.mdx — the guide: introduction & features, choosing a fetcher (HTTP vs. browser-based), a runnable example Actor, Apify Proxy integration, and a section on running browser-based fetchers (DynamicFetcher/StealthyFetcher) with the required scrapling install step in the Dockerfile.
  • docs/03_guides/code/09_scrapling.py — the runnable example: a recursive title scraper using Scrapling's async HTTP AsyncFetcher through Apify Proxy.
  • Link to the new guide added to the quick-start "Guides" list.

Verification

  • The example Actor was run locally (apify run) and on the Apify platform (build + run SUCCEEDED), producing correct dataset output with Apify Proxy.
  • The browser-based path from the guide (Apify Playwright base image + scrapling install) was also built and run on the platform to confirm the Dockerfile works — DynamicFetcher extracted data through a real browser.
  • ruff check docs/ and ty check pass.

Numbered 09 to sit after the in-progress uv guide (#932), which takes 08.

@vdusek vdusek added adhoc Ad-hoc unplanned task added during the sprint. t-tooling Issues with this label are in the ownership of the tooling team. labels Jun 5, 2026
@vdusek vdusek self-assigned this Jun 5, 2026
@github-actions github-actions Bot added this to the 142nd sprint - Tooling team milestone Jun 5, 2026
@codecov
Copy link
Copy Markdown

codecov Bot commented Jun 5, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 86.98%. Comparing base (24a6edb) to head (2a41a3f).
⚠️ Report is 3 commits behind head on master.

Additional details and impacted files
@@            Coverage Diff             @@
##           master     #938      +/-   ##
==========================================
+ Coverage   86.94%   86.98%   +0.03%     
==========================================
  Files          48       48              
  Lines        2942     2942              
==========================================
+ Hits         2558     2559       +1     
+ Misses        384      383       -1     
Flag Coverage Δ
e2e 37.76% <ø> (+0.20%) ⬆️
integration 59.14% <ø> (+0.23%) ⬆️
unit 75.69% <ø> (ø)

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
  • 📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

@vdusek vdusek marked this pull request as ready for review June 5, 2026 09:54
@vdusek vdusek requested a review from szaganek as a code owner June 5, 2026 09:54
@vdusek vdusek requested a review from Mantisus June 5, 2026 09:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

adhoc Ad-hoc unplanned task added during the sprint. t-tooling Issues with this label are in the ownership of the tooling team.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants