You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
when adding training data (350 vector) to vanna ai model, it is consuming 100%+ of the cpu (12th Gen Intel(R) i7 - 12650H) and 32 GB RAM and the kernel will "die".
i need to decrease the number of vectors to 70-80 only.
@mkhansa, I'm not familiar with Vanna AI and the problem they are solving. At a glance, it seems it is a RAG application aimed at answering SQL-related questions. Their train workflow seems to be using an LLM to create embeddings from docs, schemas, DDLs etc. Their use of Chroma is also quite straightforward. Without a deeper understanding of what their training workflow does beyond adding embeddings for the documentation in Chroma, I cannot say what could be causing this issue.
What happened?
when adding training data (350 vector) to vanna ai model, it is consuming 100%+ of the cpu (12th Gen Intel(R) i7 - 12650H) and 32 GB RAM and the kernel will "die".
i need to decrease the number of vectors to 70-80 only.
python code:
class MyVanna(ChromaDB_VectorStore, GoogleGeminiChat):
def init(self, config=None):
ChromaDB_VectorStore.init(self, config={
"path": "../path/VannaAI_path"
})
GoogleGeminiChat.init(self, config={'api_key': 'XXXXXX, "temperature":0, 'model': "gemini-1.5-pro"})
vn = MyVanna()
.....
with open('../training_data/doc_training_data.json', 'r') as f:
documentation_list = json.load(f)["documentation"]
for rule in documentation_list:
print(rule)
vn.train(documentation=rule) (here, the notebook crashed)
Versions
chromadb==0.5.3 , Python 3.12.2, windows
Relevant log output
No response
The text was updated successfully, but these errors were encountered: