AI Agents: From Architecture to Production · Lesson 7
Evaluating Agents
Trajectory eval, tool-use eval, final output eval, LLM-as-judge. The ragas framework and custom evaluators.
Trajectory eval, tool-use eval, final output eval, LLM-as-judge. The ragas framework and custom evaluators.