Full product lifecycle: GPU media pipeline, payments, reliability
Swapzy
A GPU-backed face-swap video generator: pick a GIF, add a face, get a shareable clip. The portfolio's only GPU-backed generative-media pipeline.
- Role
- Sole engineer, designer, and operator
- Year
- 2026
- Status
- Live
- Live
- Visit site
- Python / FastAPI
- React / Vite
- FaceFusion + ComfyUI (GPU)
- AWS EC2 GPU Auto Scaling + SQS
- RDS PostgreSQL / ElastiCache
- Cognito / WAF / CloudFront
- Stripe
- AWS CDK / GitHub Actions OIDC
7m 25s
RDS restore drill (recovery time)
2026-04-18 · Chaos drill against a 30-minute RTO target
Challenge
Face-swap video is a generative-media problem, not a CRUD app. Each job needs a GPU, takes seconds to minutes, can fail, and costs real money per second of compute. Build it naively and it either melts the budget sitting idle or makes users wait behind a cold queue. The product had to feel instant to a first-time visitor while staying economically sane at near-zero traffic.
Process
The pipeline is fully asynchronous: an upload triggers an S3 event to SQS, a GPU worker runs FaceFusion (with ComfyUI as a second engine for stylized swaps), and the result is delivered through CloudFront. The API runs on a multi-AZ Auto Scaling Group; the GPU worker fleet scales on queue depth and runs on Spot instances, held at a minimum of one warm worker so a new user’s first job never waits behind a cold start, a cost-versus-latency tradeoff decided with the math written down.
Security and reliability were treated as first-class. GitHub Actions deploys via federated OIDC with no stored keys and tag-scoped SSM; the runtime database user is DML-only, separate from the migration owner; an append-only, hash-chained audit log is anchored hourly to S3 Object Lock in WORM mode. The system carries 21+ CloudWatch alarms and was validated through a chaos-drill campaign, including a database restore measured at 7 minutes 25 seconds against a 30-minute target. A Stripe-billed Pro tier and an admin-gated entitlement system are built and verified end-to-end on production.
Result
Live in an audience-scoped soft launch (friends-and-family beta) at swapzy.co. The generative pipeline, payments, and reliability engineering are all built and proven; the product is deliberately held in restricted beta rather than opened to the public. It is the one project in the portfolio running a GPU generative-media workload as its core, with the distinct architecture and economics that implies.