Replies: 4 comments
-
Yes, @qingwu11. I also wondered the same thing when I saw how this id is created. I think that only the |
Beta Was this translation helpful? Give feedback.
-
In our testing we actually did find better search by embedding the SQL query particularly if it contains specific terminology that may not be contained in the question. However, to @erickrribeiro 's point -- I think that the ID should only be based on the question |
Beta Was this translation helpful? Give feedback.
-
Thanks @zainhoda for considering the ID suggestion based on the question. Continuing with the subject, I separated three advantages of using the ID in this way:
|
Beta Was this translation helpful? Give feedback.
-
@erickrribeiro Is training these ddl, sql and document files mainly for offline vector storage operations? Do these files need to be defined according to their own databases? Is each input query question converted into a vector and then the similarity calculated with the three file vectors in the vector library? Then combine it into prompt and send it to llm? Is this the workflow? |
Beta Was this translation helpful? Give feedback.
-
Why is "add_question_sql" storing vectors as question+sql? SQL isn't natural language, so it wouldn't affect the semantic matching of input queries?
Beta Was this translation helpful? Give feedback.
All reactions