PROJECT TITLE
AI Coaches - Creator Monetization
I led the end-to-end design, development, and deployment of AI Coaches — a creator monetization platform built as a Whop App.
My responsibilities included product design, full-stack engineering, and AI architecture. I built the multi-format knowledge processing pipeline (PDFs, YouTube, websites, images with OCR), the RAG-powered chat engine with Google Vertex AI and Pinecone, and the real-time voice input system using Deepgram WebSocket streaming.
I also designed the monetization layer with Whop API integration for subscription and one-time payment handling, built a 3-tier context caching system (in-memory → Redis → PostgreSQL), and implemented group chat, multi-coach conversations, theming, and comprehensive creator analytics.
AI Coaches is a creator monetization platform — think "NotebookLM meets Gumroad" — that enables creators to transform their knowledge into AI-powered coaches their audience pays to interact with.
Creators upload their content (documents, videos, websites) and the platform builds intelligent AI assistants that understand and communicate their specific expertise. Each coach supports customizable personas, flexible pricing models (free, subscription, one-time), voice input, file attachments, and full theming.
Built as a Whop App, it integrates directly into Whop's creator ecosystem for seamless monetization, with features including group chats, multi-coach conversations, BYOK (bring your own API key), and detailed revenue and engagement analytics.
Content creators have valuable knowledge but limited ways to provide 24/7 interactive support to their audience. Building custom AI assistants traditionally requires significant technical expertise. <br/><br/> Monetizing AI-powered services requires separate infrastructure for payments, hosting, and scaling — creating barriers that prevent most creators from offering AI-driven experiences.
a. User Challenges: Creators struggle to scale their expertise beyond live sessions and pre-recorded content. Their audience wants on-demand, personalized answers but creators can't be available 24/7. Existing AI tools are generic and don't reflect the creator's unique knowledge and voice.
b. Business Challenges: Building a multi-tenant platform that processes diverse knowledge formats into searchable vector embeddings, delivers real-time AI chat with source citations, handles payment flows through Whop's ecosystem, and scales cost-effectively with per-creator usage tracking and quota management.






Course Creator Persona
Alex, an online course creator with 50,000+ students, wants to offer a premium AI coach trained on his course material that students can chat with for instant answers — generating recurring revenue while reducing his support workload.

Community Leader Persona
Maria, a Whop community owner running a fitness coaching group, needs an AI coach that answers nutrition and workout questions based on her published guides and videos — available 24/7 to her paying members without her manual involvement.

Frontend
Built with Next.js 15 and TypeScript, featuring Tailwind CSS 4, Radix UI components, Recharts for analytics, SiriWave for voice visualization, and dnd-kit for drag-and-drop interactions.
Backend
Next.js API routes (serverless) with Prisma ORM, handling chat streaming via SSE, knowledge processing, and Whop webhook integration for payment events.
Database
PostgreSQL via Supabase with Prisma ORM for coaches, conversations, messages, knowledge sources, processing jobs, analytics, and payment tracking across 15 interconnected models.
AI Integration
Google Vertex AI (Gemini 2.5 Flash Lite) as primary LLM with BYOK support for OpenAI, Anthropic, and Google models. Vertex AI Text Embeddings for vector generation and Pinecone for semantic search.
Voice & OCR
Deepgram WebSocket API for real-time voice transcription with SiriWave audio visualization. AWS Textract for OCR on uploaded images (PNG, JPG, TIFF).
Monetization
Whop API integration (GraphQL + REST) for automated product/plan creation, subscription management, webhook-driven access control, and per-member message quota tracking.
Infrastructure
AWS S3 for file storage with presigned URLs, Redis/Valkey (Upstash) for warm caching, AWS Lambda for async processing, Sentry for error monitoring, and PostHog for product analytics.
a. Multi-Format Knowledge Processing Pipeline
b. RAG Chat Engine with 3-Tier Caching
c. Creator Monetization & Whop Integration
d. Advanced Chat Features