I’m building a browser-based SaaS that lets anyone generate high-quality written, visual and short-form video content from one place—think Jasper AI or ChatGPT, but truly multi-modal. The application has to: • Accept a single prompt and, depending on the user’s choice, return polished text (articles, social posts, product copy), ready-to-use images, or even 30-second video snippets. • Rely on GPT-4 or a comparable large-language model for text, Stable Diffusion / DALL-E for images, and a lightweight generative-video API (Runway, Pika, or similar) for clips. • Provide a clean, responsive front-end (React or Vue preferred) with a rich-text editor, media preview pane and project history. • Run a secure back-end (Node/Express or Django) with JWT-based auth, usage metering, Stripe-ready subscription tiers and an admin dashboard for model monitoring, prompt analytics and user management. • Ship containerised (Docker) and deployable to AWS or GCP with CI/CD in place. Deliverables will be: 1. Source code for front-end, back-end and infrastructure scripts. 2. API documentation and a brief developer handbook. 3. A live staging URL demonstrating text, image and video generation working end-to-end. I’ll provide API keys and design mock-ups; you concentrate on architecture, integration and clean code. If you have prior experience building AI content tools or SaaS dashboards, that will help us move fast.