Distinct RAG tools need different embedding models (Hugginface ST)

Clinbay_Ltd · November 8, 2024, 8:33am

I’m building a sequential crew with distinct tasks/tools : [DIR and FILE search about topic 1, then PDFsearch topic 2].
I’m using sentence transformer models for embedding, such as :
emb_conf=dict(
embedder=dict(
provider=“huggingface”,
config=dict(model=EMBEDDING_MODEL_NAME)
)
)
and GPT4o for LLM.
I noticed issues when the same embedding model is used for the PDF and for the DIR/FILE search tools. Task does not complete, or RAG mixes up PDF and folder info,…

Things get resolved when using distinct embedding models for the 2 tool sets.

Any one has a similar issue? Is it specific to ST model? Did I miss something?
Thanks in advance for your response.

Topic		Replies	Views
Trying to run Llama model that is downloaded on my server locally using huggingface hub CrewAI Community Support	0	66	February 6, 2025
Memory embedder integration with hugging face and mistral CrewAI Community Support memory	2	50	March 27, 2025
Problem with embeddings - PDFSearch tool CrewAI Community Support tools_issues	0	87	December 21, 2024
Difference between RAG and knowledge source General	6	378	March 16, 2025
Is it possible to use in an agent a Hugging Face model downloaded to a local folder and call it with the Hugging Face transformers library? General	13	627	February 7, 2025

Distinct RAG tools need different embedding models (Hugginface ST)

Related topics