
30+ Specialized RAG Tutoring Agents
An AI-powered e-learning backend, featuring a multi-agent, multi-modal AI tutor capable of teaching over 30 distinct subjects in multiple languages.
Defining the core problem and identified pain points that necessitated this technical intervention.
Educational platforms lack scalable, interactive support. It's difficult to provide 24/7, on-demand, expert-level tutoring for complex subjects, especially in multiple languages, leaving students with static content.
The architectural and implementation strategy developed to resolve the challenge.
Endpoints for role-based access control (Students, Instructors, Admins), course creation, blog management, student progress tracking, video lectures, auto-graded quizzes, and certificates.
Users interact via text, file upload, or a full voice-in/voice-out conversation. The voice system uses OpenAI's streaming TTS and Whisper transcription for real-time audio.
A central orchestrator receives all new queries and uses an LLM call to classify the user's intent and route it to the correct specialist agent.
Each agent (e.g., 'MathBot', 'ScienceBot', 'PythonBot', 'PublicSpeakingBot') is a RAG-powered LangGraph chain that connects to its own vectorized knowledge base (trained from a dedicated S3 bucket) for expert, context-aware answers.
My specific roles, responsibilities, and the technical value I added to the project lifecycle.
Architected the entire backend application, including the FastAPI API and MongoDB database schema for courses, users, and progress tracking.
Designed and built the core AI Tutor system, featuring a central LLM-based query classifier and an orchestrator to route requests to 30+ specialized agents.
Developed the standardized RAG agent template used by all bots, integrating chat history, vectorized document retrieval, and dynamic Matplotlib plot generation.
Engineered the multi-modal API endpoints for both streaming text (`/invoke-worker`) and streaming voice (`/invoke-voice-worker`).
Integrated OpenAI's Whisper (transcription) and streaming TTS (speech synthesis) to create a seamless, real-time conversational audio experience.
Implemented the complete user authentication and security system (JWT/OAuth2) for role-based access control across Students, Instructors, and Admins.
Deployed 30+ specialized RAG agents covering math, science, programming, public speaking, and more — each with its own vetted knowledge base.
AI tutor absorbed 90% of repeat student queries previously handled by human teaching assistants.
Students receive instant, expert-level answers in multiple languages at any hour — no scheduling required.
Supports text chat, document/file upload, and full real-time voice-in/voice-out conversation via Whisper and TTS.
Central LLM orchestrator classifies intent and routes to the correct specialist agent in under 2 seconds.
Full role-based access control supporting Students, Instructors, and Admins with distinct permission sets.