Agent Final Answer is modified before output?

Jeff_Chen · November 21, 2024, 6:42pm

I am experiencing the same issue described on Parsed final answer in AgentFinish / TaskOutput often incomplete · Issue #788 · crewAIInc/crewAI · GitHub.

The issue was closed without resolution, so I am seeking help here.

I am building on the example: crewAI-examples/write_a_book_with_flows at main · crewAIInc/crewAI-examples · GitHub

I can see from verbose log that my agent constructed the expected output as Final Answer, which is a markdown formatted chapter, but the actual output has contents stripped out of the markdown.

chapters = await asyncio.gather(*tasks)

Does anyone know how to prevent this from happening? Is this an output formatting issue or a prompting issue (e.g. tasks.yaml)

expected_output: >
A markdown-formatted chapter of around 5,000 words that covers the provided chapter title and outline description.

It is frustrating as I can verify the desired “Final Answer” in the verbose log message from the agent, but it just keeps outputting content that has pieces of markdown missing.

Any help will be appreciated.

rokbenko · November 21, 2024, 7:17pm

@Jeff_Chen If you’re using a tool for the task, you can force tool output as a result by setting the result_as_answer parameter to True. As stated in the docs:

This parameter ensures that the tool output is captured and returned as the task result, without any modifications by the agent.

Here’s an example from the docs:

from crewai.agent import Agent
from my_tool import MyCustomTool

coding_agent = Agent(
    role="Data Scientist",
    goal="Produce amazing reports on AI",
    backstory="You work with data and AI",
    tools=[MyCustomTool(result_as_answer=True)],
)

Jeff_Chen · November 21, 2024, 9:55pm

Hi @rokbenko,

I am not using a tool at the moment but was contemplating if I can define a simple tool that just passes through the Final Answer from the agent with result_as_answer=True to fix this problem. It’s a little kludgey.

I did compare the outputs of gpt-4o-mini vs gpt-4o (2024-08-06). The latter got it almost 95% correct. There was one markdown tag that was not output correctly but the rest was output correctly.

This suggests the problem is in part the instruction follow capability of the LLM for the agent?

rokbenko · November 21, 2024, 10:10pm

This was exactly my thought too!

Absolutely! The choice of LLM can significantly impact the quality of the final answer. In a past project, I switched from Anthropic’s Haiku to Sonnet, and the results improved dramatically, even though the agent and task YAML configurations stayed exactly the same.

If using a cheaper (i.e., less capable) LLM saves a lot of money and doesn’t lower the quality of the final answer (since the agent only removes some parts from the markdown), it might be worth creating a simple tool that just passes the final answer and set the result_as_answer parameter to True. You’d just need to make sure the agent always uses this tool.

Jeff_Chen · November 26, 2024, 7:32pm

I tried the approach of passing the content to a “null custom tool” then forcing the result as answer, but the agent passed in it’s self reflection about the content rather than the actual content.

This is most frustrating, but at least the usable output is in the verbose log as “Final Answer”, I am just having to harvest it by hand at the moment.

The output of gpt-4o turns out not to be consistent, so the first output I examined was very close, but then chunks started to be filtered between “Final Answer” and actual agent output. This is so very strange. I am experimenting with my task expected_output definition but have not gotten the result that I seek yet.

Himanshu_Patidar · January 10, 2025, 4:18pm

Hi @rokbenko , i am facing issue with these parameter as i have create a tool like these

@tool(“execute_sql”)
def execute_sql(sql_query: str) → str:
“”“Execute a SQL query against the database. Returns the result”“”
return QuerySQLDatabaseTool(db=db).invoke(sql_query)

and now i passing these tool in agent like these

tools=[execute_sql(result_as_answer=True)],

got these exception

TypeError: ‘Tool’ object is not callable

mbrucki · January 23, 2025, 1:48pm

you have to call result_as_answer at the beginning of your Crew code where you instantiate tools

Topic		Replies	Views
Tool input and output logging CrewAI Community Support tools_issues	5	1193	November 30, 2024
Agent not calling tool before giving the final answer General	2	594	January 2, 2025
Ensuring Controlled Output and Language in CrewAI CrewAI Community Support crew , agent , crewai , feature	2	190	February 22, 2025
How to obtain the output results of the agent, such as the name of the called tool, tool input parameters, etc General tools_issues	2	9	June 4, 2025
Forcing Tool Output as Result General	1	75	May 5, 2025

Agent Final Answer is modified before output?

Related topics