Under development

What it is

FastAPI AI Kit is the production scaffolding I rebuilt one too many times. Async FastAPI endpoints, streaming LLM responses, authentication middleware, rate limiting, background job handling, structured logging, Redis caching, and a clean containerized deploy — all properly wired together from the start. Every new AI project I built needed the same foundation and I kept building it from scratch. This is the version I actually want to start from, packaged once and made reusable.

Why I built it

I kept copy-pasting the same FastAPI setup into every AI project I built. The streaming endpoint configuration. The auth middleware. The rate limiting layer. The structured logging setup. The Docker config. It was always the same foundation and I was always rebuilding it from zero. At some point that stops being setup and starts being waste. So I packaged the version I actually wanted — with the streaming, observability, and caching already wired in — and turned it into a proper starter kit anyone can use to skip the boilerplate entirely.

What's inside

Streaming LLM endpoints out of the box, ready to connect to any model API
Auth, rate limits, and structured logging built in from day one, not added later
Background job handling for async AI workloads that shouldn’t block the request
Containerized and deploy-ready with Docker — clean from local to production
Redis caching layer for high-frequency query optimization built in from the start
Currently in active development, shipping soon