Advanced Content Summarizer
A full-stack pipeline that turns PDFs, images, or raw text into laid-out PDFs with context-aware summaries and actionable next-step CTAs.
- Dual-LLM extraction with image fidelity. PyMuPDF + PaddleOCR + MiniCPM-V parse the source; Qwen 2.5 summarizes; Claude handles CTAs and layout reasoning; Gemini regenerates image regions; WeasyPrint renders the final PDF. Unique-ID tracking keeps every image bound to its original context. Lands at $0.05 to $0.15 per document.
- Async infrastructure for a single consumer GPU. FastAPI + Dramatiq + Redis + SSE coordinate long-running jobs while sequential GPU swaps fit MiniCPM-V and Qwen 2.5 onto a 12GB RTX 2060. No cloud GPU, no model downgrades, no compromises on quality.
- One workstation, fully containerized. Docker Compose, Traefik, and the NVIDIA Container Toolkit make the whole stack reproducible. Clone, compose up, ship.