:Poll What is your preferred LLM to use in your crews?

matt · September 8, 2024, 4:50pm

Hi everyone,

Being able to track what LLMs are preferred by the crewAI users allow us to understand where our focus should potentially be. Hence there is a poll!

Preferred LLM

GPT4o
GPT4o-Mini
Gemini 1.5 Pro
Claude Sonnet 3.5
Llama 3.5 8B
Llama 3.5 70B
Llama 3.5 405B
Other

0 voters

zinyando · September 9, 2024, 6:45am

I assume you mean Llama 3.1 and not 3.5

matt · September 9, 2024, 6:47am

Great catch but unfortunately we cannot change after it has been up for 5 minutes

moto · September 11, 2024, 11:38pm

This is additional work but I think it would be informative to ask about llm’s for what purpose. I use gpt4omini
for general inference a lot but have to use gpt4o for a planning llm because I can never get a decent plan out of gpt4omini. For vision rag, etc I always use gemini 1.5 flash because it has a 1M token context and is of course multimodal and not too expensive. Even there though I use gemini 1.5 pro to do planning llm work as it does a better job.

Dabnis · September 12, 2024, 9:09am

When considering such as hierachichal/planned processes I have found that Phi3-medium & to some extent Phi3-mini have far greater reasoning skills than other LLMs.
While still relatively new to CrewAI I find that choosing the right LLM for the Agent within a multi agent system as apposed to defaulting to one provides far better results. E.G: manger_llm == Phi3, general agent default to GTP4omini, etc.

jklre · September 13, 2024, 3:59am

I use multiple. Each best per task in the same crew

Amitava_Ghosh · May 4, 2025, 10:58am

For paid ones, task planning, claude sonnet and deepseek both work. And for tool calling and structured output, qwen2.5-coder 7b has been working out. We are trying out, using passing how an user prompt would look like, which can act as trigger to identify an action on our platform.

Even in that, qwen2.5-coder acting both as planner and executor llm has worked out for a limited test case. Pretty sure, for such a case, identifying an user intent to an org specific action would be better done by openai, and from there on, the classification can be with claude or qwen2.5-coder.

We are setting up a test framework, so will be able to experiment better.

Topic		Replies	Views
"Fallback" LLM Configuration LLMs gpt4o	5	38	June 10, 2025
A question regarding using open source LLMs through Ollama with CrewAI CrewAI Community Support	12	1262	September 18, 2024
How to use the gpt-4o-mini LLM? CrewAI Community Support	5	601	September 13, 2024
Please explain Manager LLM like I am 25 CrewAI Community Support crewai	6	153	May 6, 2025
Using Phi3:medium (128k) as a manger LLM in Hierachical process issues CrewAI Community Support	9	195	September 12, 2024

:Poll What is your preferred LLM to use in your crews?

Related topics