High-Volume Website Scraping Automation - Scrape 90,000 symbols on weekdays on schedule in 4-6 hours

Заказчик: AI | Опубликовано: 24.04.2026
Бюджет: 750 $

***** Please read the Word document withe full specs of the job ***** ***** Please read the Word document withe full specs of the job ***** ***** Please read the Word document withe full specs of the job ***** I need an automated scraper that gathers roughly 90,000 symbols every weekday, completing the job in no more than six hours (four would be even better). The content lives only on public websites—no APIs or databases—so the tool must navigate pages, collect specific text fields plus the related images, and deliver everything neatly in a single Excel workbook. Key points you should know: • Schedule: weekdays only, kicked off on specific schedule. • Volume & speed: the full run (90k symbols) must finish inside the 4-6 hour window. • Output: one .xlsx file with rows for each symbol and either embedded images or file-path references to an accompanying images folder. • Stability: handle pagination, CAPTCHAs, rotating proxies, retries, and resume-from-last-point logging so a hiccup doesn’t force a restart. • Tech: I’m partial to Python—Scrapy, Playwright, or Selenium—but I’m open if you have a faster or more reliable stack. Please outline your proposed approach, main libraries, and any similar high-volume scrapes you’ve delivered.