Prompt Evaluations: Anthropic's Official Course · Lesson 4
Code-Graded Classification Evals
Evaluating a multi-category classification prompt. Set-based grading for multiple correct categories. Improved prompt with few-shot examples: from 85% to 100% accuracy.