RAG-Simplified: A DIY Approach to Retrieval-Augmented Generation

Introduction

This repository contains a simple and customizable implementation of Retrieval-Augmented Generation (RAG) using SQLModel, PostgreSQL with PGVector, and Cohere's embedding model. It accompanies the Medium blog post "Search with RAG: A Simple DIY Approach" offering a step-by-step guide to building your own RAG system.

Requirements

Python 3.7 or higher
PostgreSQL with PGVector extension
Cohere API key (sign up at Cohere to obtain one)

Setup Instructions

Clone the repository:

git clone [email protected]:helmanofer/simple_rag.git

Install the required Python packages:

cd simple_rag
pip install -r requirements.txt

Set up your local PostgreSQL database using docker compose:
```
docker compose up -d
```
Update the .env file with your Cohere API key:
```
COHERE_KEY=Your_Cohere_Key
```
You're all set! Now you can start experimenting with the provided code and adapting it to your specific use case.

Usage

The repository includes the following key components:

db_models.py: Defines the DTO models using SQLModel, including the Document class for storing text data and embeddings.
embeddings.py: Contains the co_embed function, which utilizes Cohere's multilingual embedding model to generate embeddings for a list of texts.
search.py: Implements the search functionality, allowing you to query the database and retrieve relevant documents based on cosine distance.
index.py: Implements the indexing functionality, allowing you to store documents in the database

Customization

The beauty of this DIY approach is its simplicity and flexibility. You can easily adapt the code to your specific needs:

Adjust the embedding model used in the co_embed function to match your requirements.
Modify the Document class in document.py to include additional fields or adjust the embedding dimension.
Experiment with different tokenization approaches or text preprocessing techniques to suit your data.

Contributing

Contributions are welcome! If you have suggestions, improvements, or bug fixes, feel free to open a pull request or create an issue. Please ensure that your contributions adhere to the project's code style and include appropriate documentation.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Happy searching, and stay tuned for more exciting NLP adventures!

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.env		.env
README.md		README.md
db_models.py		db_models.py
docker-compose.yml		docker-compose.yml
embeddings.py		embeddings.py
index.py		index.py
rabin.txt		rabin.txt
requirements.txt		requirements.txt
search.py		search.py
settings.py		settings.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG-Simplified: A DIY Approach to Retrieval-Augmented Generation

Introduction

Requirements

Setup Instructions

Usage

Customization

Contributing

License

About

Uh oh!

Releases

Packages

Languages

wlwwt/simple-rag

Folders and files

Latest commit

History

Repository files navigation

RAG-Simplified: A DIY Approach to Retrieval-Augmented Generation

Introduction

Requirements

Setup Instructions

Usage

Customization

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages