Complete Website Offline Download -- 2

Замовник: AI | Опубліковано: 04.01.2026

Title: Recover and build offline version of archived Chinese pictograph library (Wayback) Description: I need to recover and save for offline use the Chinese pictograph library that was previously available at: http://www.guoxuedashi.net/xiangxingzi/ The website is currently offline and NOT accessible directly. However, the content IS available via Wayback Machine. Verified working snapshot: https://web.archive.org/web/20190605091853/http://www.guoxuedashi.net/xiangxingzi/ The goal is to reconstruct a usable OFFLINE version of this specific section (/xiangxingzi/), accessible without internet. -------------------------------- Scope of Work -------------------------------- 1) Wayback archive verification – Verify availability of pages under /xiangxingzi/ in Wayback Machine – Check multiple archived pages (text + images) – Confirm what percentage of content is recoverable 2) Data extraction – Extract all available content from Wayback: • Chinese characters • Text explanations • Images / pictographs (if available) – Handle multiple timestamps if needed to recover missing assets 3) Offline reconstruction – Rebuild pages into a clean offline structure – Ensure: • All internal links work locally • Images load without internet • UTF-8 Chinese encoding is preserved correctly 4) Delivery – Fully offline version of the library (local HTML website preferred) – Folder structure with HTML + images – All scripts/tools used for extraction – Short README explaining how to use the archive offline -------------------------------- Technical Requirements -------------------------------- – Proven experience with web scraping – MUST have experience with Wayback Machine / archived websites – Python preferred (requests, BeautifulSoup, Scrapy, or similar) – Correct handling of Chinese (UTF-8) text IMPORTANT: Simple website mirroring tools (e.g. HTTrack) are NOT suitable for Wayback Machine archives. Custom scraping logic is required. -------------------------------- Project Terms -------------------------------- – This project is for personal, educational, offline use only – No redistribution or commercial use – Payment via milestones only -------------------------------- Before Hiring (Mandatory) -------------------------------- Before starting the contract: – Open 3–5 different archived pages from /xiangxingzi/ – Confirm that text and images load correctly – Briefly explain your extraction and offline rebuild approach -------------------------------- Proposal Requirements -------------------------------- Please include in your proposal: – Relevant similar projects (especially archived sites) – Short technical plan – Estimated timeline – Fixed price proposal Proposals without Wayback Machine experience or without a clear technical explanation will be ignored.