Hey everyone!
I’ve been digging through the CrewAI source code and docs to understand the training and evaluation features, but I’m a bit confused about how they’re meant to be used.
From what I can see, CrewAI lets you:
-
Collect human feedback using
crew.train()
-
Save training data to JSON files
-
Log some basic evaluation info with
crew.test()
But I’ve got a few questions:
-
What’s the end goal of collecting the training data? I don’t see how it’s actually used to make the agents better
-
Are there any actual metrics for measuring how well a crew is doing?
-
What’s the best way to use the human feedback to improve my crews?
I’ve looked through the code (TaskEvaluator, CrewEvaluator, etc.) but it seems like it’s mostly set up to collect data and log results without doing much with them afterward.
The docs are also pretty vague.
Thanks in advance!