DocuSearch - AI-Driven Document Search and Retrieval System

DocuSearch is an AI-powered document search and retrieval system that allows users to efficiently search for and retrieve relevant document chunks based on their queries. The system leverages pre-trained NLP models and similarity search techniques to provide accurate and fast retrieval of information from a large collection of documents.

Features

User-friendly web interface for submitting document retrieval queries. AI-powered processing of user queries using HuggingFace Transformers. Efficient document retrieval using Milvus, an open-source vector database. Support for chunking large documents to optimize retrieval performance. Scalable deployment with GPU acceleration on Nutanix Kubernetes Engine (NKE) Cluster.

Tech Stack

Frontend: Next Js, Axios Backend API: Flask, Hugging Face Transformers, PyTorch Document Retrieval Engine: Milvus, Python SDK for Milvus Container Orchestration: Kubernetes, Docker GPU Acceleration: Nvidia P40 GPUs Database: PostgreSQL or MongoDB Load Balancer: Nginx or HAProxy Cloud Platform: Nutanix Cloud Platform (NCP), AWS/GCP/Azure (optional) Monitoring and Logging: Prometheus, Grafana, ELK Stack Testing: PyTest or unit test Documentation: Sphinx or MkDocs

Prerequisites

Docker is installed on the system for containerization. NVIDIA GPU drivers and CUDA toolkit for GPU acceleration (if using Nvidia P40 GPUs).

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
backend		backend
frontend		frontend
model_cache/models--sentence-transformers--all-MiniLM-L6-v2		model_cache/models--sentence-transformers--all-MiniLM-L6-v2
.gitattributes		.gitattributes
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DocuSearch - AI-Driven Document Search and Retrieval System

Features

Tech Stack

Prerequisites

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DocuSearch - AI-Driven Document Search and Retrieval System

Features

Tech Stack

Prerequisites

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages