Python PDF Data Entry Script

Заказчик: AI | Опубликовано: 17.01.2026

I have a stack of PDF files that need to be turned into clean, structured data right away. The goal is simple: every relevant field in each PDF must end up in an Excel or CSV file, with 100 % accuracy, and the whole process driven by Python so I can rerun it whenever new documents arrive. You may choose whichever Python tools suit you best—pdfplumber, PyPDF2, Tabula, camelot-py, pandas, or even a custom Regular Expression approach—as long as the final script is well-commented, easy for me to configure, and runs on Windows. If certain pages or tables require light manual tweaks, document that in the code so I know exactly where any human intervention is expected. Deliverables (all required): • A fully working Python script (.py) with clear setup instructions • One sample run showing raw PDFs in → neatly formatted CSV/Excel out • Brief “read-me” explaining dependencies, usage, and how to add future PDFs I need this ASAP, so please reply with your estimated turnaround, the libraries you plan to use, and a quick line on how you guarantee data accuracy.