Apify Web Data Scraper

I need an Apify actor that crawls a single website and delivers two things for every page. You can use Puppeteer, Playwright or any other Apify helper library that keeps the run stable and fast. Here’s how I see the workflow: • I’ll share the target domain, URL pattern, and the exact text blocks I care about. • You create or fork an Apify actor in JavaScript/TypeScript, configure the request queues, handle pagination where needed, and store results in a dataset. • The final dataset should export cleanly to JSON and CSV, and the image URLs should be downloadable in bulk (a simple link list or an Apify key-value export is fine). • When the crawl completes, I want a brief README so I can rerun it myself later without touching the code. Acceptance criteria – 100 % of the target pages processed without duplication – Text values stripped of markup and encoded in UTF-8 – All image links valid (HTTP 200) at runtime – Actor runs inside the free Apify memory limit or documents any upgrade need Drop a short note about your previous Apify actors or similar crawlers and how quickly you can turn this around; I’m ready to start as soon as you are.

Python

Регистрация