Skip to content

Conversation

@sani903
Copy link
Collaborator

@sani903 sani903 commented Dec 1, 2025

Added embedding search tool through Agent SDK MCP.

  • Indexing for each instance during training and deletion upon completion
  • Using "all-MiniLM-L6-v2" for faster indexing, can be switched to larger embedding model like "jinaai/jina-embeddings-v2-base-code" or "jinaai/jina-code-embeddings-0.5b"
  • CPU indexing during training for lower GPU usage
  • Modified prompts to support both terminal and embedding search tool. Additional instructions for final output.
  • Eval scripts for testing performance with new tool (some bugs remaining)
  • READMEs not updated yet

@neubig neubig marked this pull request as draft December 31, 2025 02:01
@neubig
Copy link
Contributor

neubig commented Dec 31, 2025

@sani903 I'm converting this to a draft but convert it back and request review if you think it's ready to review and merge in!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants