Project Case Study

RAG CHATBOT Braxton Animal Works

PROJECT 01 Braxton Assistant AI-powered customer support chatbot for a local small business, built on a zero-budget RAG architecture and deployed to production. RAG · LlamaIndex · ChromaDB · Groq · Streamlit · Google Sheets The Problem Braxton's team was spending significant time answering the same customer questions...

RAG Chatbot / Braxton Animal Works

PROJECT 01

Braxton Assistant

AI-powered customer support chatbot for a local small business, built on a zero-budget RAG architecture and deployed to production.

RAG · LlamaIndex · ChromaDB · Groq · Streamlit · Google Sheets

The Problem

Braxton's team was spending significant time answering the same customer questions repeatedly — about hours, services, pricing, and policies. There was no self-serve way for customers to get answers outside of business hours, and no existing tech team or budget to bring in a solution.

The challenge for Summit was to build something genuinely useful that cost nothing to run, was easy for the client to manage without technical knowledge, and could be deployed and maintained by a single high school student.

The Solution

I designed and built a Retrieval-Augmented Generation (RAG) chatbot that answers customer questions using Braxton's own business documentation as its knowledge base. When a customer asks a question, the system retrieves the most relevant chunks of content from the business's documents and passes them to a language model to generate an accurate, grounded answer.

The entire system runs on free infrastructure. There are no ongoing costs to the client. An admin analytics dashboard gives the business owner visibility into what customers are asking, how the bot is performing, and any questions it couldn't answer — feeding a continuous improvement loop.

ARCHITECTURE DIAGRAM

Key Features

RAG pipeline with document-grounded answers — the bot only responds based on Braxton's actual content, preventing hallucinations

Admin analytics dashboard showing query volume, top questions, and unanswered queries

Google Sheets integration logging every conversation for the client to review without any technical access

Zero-cost infrastructure using entirely free-tier services — no ongoing cost to the client

Streamlit chat UI deployable via Streamlit Community Cloud with a shareable URL

Challenges & Solutions

Making RAG work on zero budget

Most RAG implementations rely on paid embedding APIs (OpenAI, Cohere) and hosted vector databases. I had to find a fully free alternative stack that still produced accurate retrieval. The solution was combining HuggingFace's free sentence-transformer models with a locally hosted ChromaDB instance — which worked reliably and with latency acceptable for a chat interface.

Building for a non-technical client

The client needed to be able to see what the chatbot was doing without touching any code. The Google Sheets logging and Streamlit admin dashboard were designed specifically around this constraint — everything the client needs to know is in a spreadsheet they already understand.

Deployment with no DevOps

Getting a Streamlit app reliably accessible to customers without a server or domain required using Streamlit Community Cloud, which provides free hosting for public apps. The deployment process was fully automated so updates push live without manual steps.

Results

Successfully deployed and live — customers can query the assistant 24/7

Client has full visibility into usage through the analytics dashboard and Sheets log

Zero ongoing infrastructure cost to the client or to Summit

Demonstrated that enterprise-grade RAG architecture is achievable on a zero budget

What I Learned

This project pushed me to find creative solutions within hard constraints — no money, no team, a client with no technical background. The biggest insight was that the user experience for the business owner matters just as much as the chatbot itself. Building the admin layer took as much thought as the RAG pipeline, and it's what made the project feel complete rather than like a demo.

I also learned that Groq's free-tier inference is genuinely fast enough for production — response times were consistently under two seconds, which is competitive with paid alternatives.