Patronus agent evaluation API

Moe · March 13, 2025, 8:25am

CrewAI has incorporated the Patronus evaluation system into its platform. The documentation on the CrewAI docs page is very sparse. Has anyone used this platform in CrewAI and if so, do you have a simple example of how to run this platform with your crew and get the agents to automatically evaluate their outputs using this new tool?

Thx

evanbrooks · March 20, 2025, 5:12pm

Hey @Moe have you found any answer to this?

Moe · March 21, 2025, 7:20pm

Hey Evan, I got the following link from Patronus:

maybe this can help.

evanbrooks · March 21, 2025, 8:23pm

The Patronus docs are outdated - refer to Client Challenge or their GitHub. Once I saw this, using it was a breeze.

Moe · March 22, 2025, 11:14am

Do you have an example implementation using CrewAI agents with tasks? I still don’t quite get how to implement it all properly.

For example in the crewai docs they show the example there of where the Patronus tool is being used. However how can the evaluation pass_ score and explanation etc. be extracted once the entire thing has been run in the code, rather than having to log into the platform to get all that.

also is there a list of evaluators and their associated criteria anywhere. the documentation on the Patronus site is very poor!

Topic		Replies	Views
CrewAI test scoring logic General crewai , feature	0	96	February 25, 2025
Planning and Self learning (React) CrewAI Community Support tools_issues , agent , task , crewai , feature	0	29	August 22, 2025
CrewAI Examples General crewai , memory	0	54	August 19, 2025
CrewAI o1-preview support General	1	153	September 20, 2024
Crew AI 🧱 Builder General crewai	0	342	October 10, 2024

Patronus agent evaluation API

Related topics