Skip to content

Commit 0a4e74f

Browse files
authoredMar 18, 2025
Merge pull request #26 from pamelafox/ragtweaks
Tweaks after RAG presentation
2 parents c1408e1 + 9d3f489 commit 0a4e74f

5 files changed

+147711
-129207
lines changed
 

‎rag_documents_ingestion.py

+1-1
Original file line numberDiff line numberDiff line change
@@ -44,7 +44,7 @@
4444

4545
# Split the text into smaller chunks
4646
text_splitter = RecursiveCharacterTextSplitter.from_tiktoken_encoder(
47-
model_name="gpt-4o", chunk_size=500, chunk_overlap=0
47+
model_name="gpt-4o", chunk_size=500, chunk_overlap=125
4848
)
4949
texts = text_splitter.create_documents([md_text])
5050
file_chunks = [{"id": f"{filename}-{(i + 1)}", "text": text.page_content} for i, text in enumerate(texts)]

0 commit comments

Comments
 (0)