Trying to run Llama model that is downloaded on my server locally using huggingface hub

Vibodh_K · February 6, 2025, 6:53am

Hi i am trying to make a RAG pipeline for pdf, im using llama 3.1 as my llm which has been dowloaded locally on my server with L4 gpu how do i use that model with crew ai
PS: i want everything to run locally

zinyando · May 11, 2025, 4:11am

Did you manage to get this working?

I suggest you look at Ollama and LMStudio. These should work out of the box with litellm which is used by crewAI for LLMs

iamsp · July 6, 2025, 3:14am

Hi @zinyando I am unable to use llm using LM Studio. Could you please guide if you have any idea?
I’ve posted my query here:

Topic		Replies	Views
Unable to use HuggingFace, OpenRouter nor Local LLM using LMStudio. Please help CrewAI Community Support agent , crewai	0	89	July 6, 2025
Unable to connect to local Llama3.1 model CrewAI Community Support	18	1891	November 8, 2024
Is it possible to use in an agent a Hugging Face model downloaded to a local folder and call it with the Hugging Face transformers library? General	13	1321	February 7, 2025
A question regarding using open source LLMs through Ollama with CrewAI CrewAI Community Support	12	1390	September 18, 2024
Llm connection to local server CrewAI Community Support	2	316	March 17, 2025

Trying to run Llama model that is downloaded on my server locally using huggingface hub

Related topics