- Setup the complete infrastructure stack for a Question-Answering chatbot for your private data in just a few minutes!
- Your stack will be powered by Self-hosted Open-Source Large Language Models and Retrieval Augmented Generation running on Kubernetes Cloud clusters.
- Cluster Setup Summary
- Install Infrastructure Tools
- Install Model Serve Stack
- Model Serving
- Retrieval Augmented Generation using FAISS
- Creation of the Vector Store
- Install the RAG & LLM querying service
- Send a question to your LLM
- Uninstall
Jump to complete install doc available here.