Demonstrating the `AgentEval` framework using the task of solving math problems as an example
AgentEval: a multi-agent system for assessing utility of LLM-powered applications
AgentEval: a multi-agent system for assessing utility of LLM-powered applications