Loading video player...
Build a production-ready AI voice platform (ElevenLabs clone) from scratch using Fastify, Next.js, Drizzle ORM, Kokoro TTS, and Docker, complete with voice cloning, text-to-speech, and a full React frontend. š¦ Source Code: https://github.com/fiston-user/iilevenn ā Voice CRUD with public library & search ā Audio sample uploads to S3-compatible storage ā Async TTS job queue with RabbitMQ ā Real AI text-to-speech with Kokoro TTS ā Voice recording with live waveform ā TTS studio with voice picker & audio player ā JWT auth with API keys & refresh tokens ā Full Next.js frontend with dark mode āāāāāāāāāāāāāāāāāāāāāāāāāāāāāā š ļø TECH STACK ⢠Fastify 5 (API Services) ⢠Next.js 16 (App Router + Tailwind + shadcn/ui) ⢠Python FastAPI (Kokoro TTS Engine) ⢠Drizzle ORM + PostgreSQL 16 ⢠RabbitMQ (Job Queue) ⢠MinIO (S3 Object Storage) ⢠Redis 7 (Cache) ⢠Traefik v3 (API Gateway) ⢠Docker & Docker Compose ⢠WaveSurfer.js (Audio Playback) āāāāāāāāāāāāāāāāāāāāāāāāāāāāāā š WHAT YOU'LL LEARN ⢠Designing a microservices architecture from scratch ⢠Building REST APIs with Fastify & Zod validation ⢠Database modeling with Drizzle ORM & migrations ⢠JWT authentication with refresh token rotation ⢠API key auth with SHA-256 hashing ⢠File uploads to MinIO with presigned URLs ⢠Async job processing with RabbitMQ workers ⢠Integrating a Python TTS engine (Kokoro) ⢠Recording audio with MediaRecorder & WAV encoding ⢠Building a full React frontend with Next.js ⢠Docker infrastructure with 6+ services āāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā±ļø TIMESTAMPS 00:00:00 Intro & Demo 00:02:15 Project Bootstrap ā pnpm monorepo & project structure 00:10:07 Docker Infrastructure ā PostgreSQL, Redis, RabbitMQ, MinIO, Traefik 00:20:17 Database & Schema ā Drizzle ORM, 6 tables, migrations & seed 00:39:39 Auth Service ā Register, login, JWT & refresh token rotation 01:17:41 API Keys & Dual Auth ā xi_ keys, SHA-256 hashing 01:37:43 Voice CRUD & Library ā Create, read, update, delete with search & filters 02:09:09 Voice Sample Uploads ā MinIO presigned URLs & ownership enforcement 02:35:13 TTS Job Queue ā RabbitMQ worker & async job processing 03:08:13 Real TTS Engine ā Python FastAPI + Kokoro text-to-speech 03:55:39 Frontend Setup ā Next.js, Tailwind, shadcn/ui, sidebar & dark mode 04:17:36 Auth Pages & Context ā Login, register, protected routes & user menu 04:58:59 Dashboard & Voice Library ā Hero dashboard, search, filters & gradient avatars 05:19:07 My Voices & Management ā Voice detail, create/edit forms & sample uploads 06:03:33 Voice Recorder ā MediaRecorder, live waveform & WAV encoding 06:33:12 TTS Studio ā Voice picker, job polling & audio player āāāāāāāāāāāāāāāāāāāāāāāāāāāāāā š RESOURCES š Fastify: https://fastify.dev š Drizzle ORM: https://orm.drizzle.team š Kokoro TTS: https://github.com/hexgrad/kokoro š shadcn/ui: https://ui.shadcn.com š WaveSurfer.js: https://wavesurfer.xyz āāāāāāāāāāāāāāāāāāāāāāāāāāāāāā š¤ CONNECT WITH ME š Portfolio: https://www.fiston.net š¬ Discord: https://discord.com/invite/cMNSUQvYRt š¼ Contact: fiston.turner@outlook.com š¬ Drop a comment if you have questions. š Subscribe for more full-stack & backend engineering builds. #elevenlabs #ai #voicecloning #tts #fastify #nextjs #fullstack #webdevelopment #docker #typescript #backend #microservices #python