39. Evaluate outputs before users do
Generative AI evaluation must judge usefulness, truthfulness, safety, style, latency, and cost. This chapter covers human review, rubrics, test sets, model-graded evaluation, benchmark limits, A/B tests, and regression testing.