Understanding Hierarchical Multi-Agent Systems with Minute-Level Response Times

coral_dou · March 7, 2025, 5:33am

Greetings,

Our team recently conducted a proof of concept with CrewAI using a simple hierarchical architecture. In this setup, a manager LLM/agent dispatches queries to specialist agents, with all specialists using only function calls that require structured inputs.

While this approach seems straightforward, we encountered minutes-level latency, which falls short of our target requirement of seconds-level response times. This latency puzzles us, as CrewAI is considered one of the more established agent frameworks with numerous production applications.

My questions are:

Have others encountered similar latency challenges, especially in function call-oriented applications?
What response times have other developers typically seen in their agent-based applications?
What might be causing these long processing times in our specific setup?

zinyando · March 7, 2025, 10:04am

Agent response times depend on multiple factors including the amount of work that your agent is doing.

How many tools are your agents calling? Are they calling an API?

Curious as to your use case that requires second level response time, are you using it in a chatbot?

coral_dou · March 7, 2025, 4:06pm

How many tools are your agents calling? Are they calling an API?

For simplicity in our POC, each specialist agent has access to only 1-3 function tools, while the manager LLM/agent handles query dispatching to a maximum of 10 specialists.

Rather than making actual API calls, we implemented function mocks using Pydantic for parameter schema validation. At this stage, we’re evaluating the system’s ability to understand intent and parse parameters, rather than the accuracy of responses.

Curious as to your use case that requires second level response time, are you using it in a chatbot?

Yes, we are presently using it in a chatbot.

Topic		Replies	Views
CrewAI Chatbot Performance - Agent Execution Time General	8	230	May 2, 2025
CrewAI/FastAPI/React Crews agent , crewai , feature	0	51	May 29, 2025
Scalability and Performance Issues for Multi-agents architechture CrewAI Community Support	1	49	May 2, 2025
Tips on Latency among agents and LLMs CrewAI Community Support	0	115	October 30, 2024
Async Execution Slowing Down Hierarchical CrewAI Setup CrewAI Community Support agent	0	7	June 6, 2025

Understanding Hierarchical Multi-Agent Systems with Minute-Level Response Times

Related topics