CrewAI crashing with local llama.cpp

Samir_Moreira_Antoni · July 16, 2025, 6:41pm

Hello there

I’m trying to use a local llama.cpp server with CrewAI, but I keep getting the following error from the llama server, which also causes the CrewAI code to crash:

got exception: {"code":500,"message":"Cannot have 2 or more assistant messages at the end of the list.","type":"server_error"}
srv  log_server_r: request: POST /v1/chat/completions 127.0.0.1 500

I’ve tried to host gemma3, deepseek and qwen. and got the same issue.

Is there a solution to this issue? I also tried using Ollama, but encountered a different problem, possibly related to LiteLLM.

Thanks!

Topic		Replies	Views
Unable to connect to local Llama3.1 model CrewAI Community Support	18	1793	November 8, 2024
Llm connection to local server CrewAI Community Support	2	262	March 17, 2025
Crew_Base not find LLM CrewAI Community Support	13	350	October 8, 2024
Crewai create crew demo with 6.Ollama and llama3.2:latest model not startable CrewAI Community Support	4	278	January 26, 2025
Trying to run Llama model that is downloaded on my server locally using huggingface hub CrewAI Community Support	2	127	July 6, 2025

CrewAI crashing with local llama.cpp

Related topics