Has anyone here figured out how to accurately track what each customer is actually costing you in LLM and agent execution spend?
I’ve been talking to a handful of founders building agent-powered SaaS on CrewAI and similar frameworks, and this keeps coming up. At small scale it’s manageable, but somewhere around 20-30 customers things seem to get messy fast.
The challenge I keep hearing about is that most observability tools give you visibility at the LLM call level, but not at the “which customer triggered this entire agent run” level. So when you have multi-step crews with tool calls, retries, and sub-agents, stitching that back to a specific customer for billing or margin analysis gets painful.
Curious what people here are actually doing in practice:
-
Are you attributing per-customer AI costs at all, or is it just baked into a flat margin assumption?
-
If you are tracking it, what does your setup look like? Something stitched together, a specific tool, custom instrumentation?
-
Has inaccurate cost attribution ever actually burned you, like a customer segment turning out to be way more expensive than you priced for?
Not looking to sell anything here. I’m genuinely trying to understand whether this is a real operational headache for people shipping agent products, or whether most teams have figured out a reasonable approach I haven’t seen yet.
Would appreciate any honest takes, even if your answer is “we just don’t bother and it’s fine.”