22. Safety alignment and refusal behavior
Handle refusals, unsafe requests, sensitive topics, and policy boundaries with more than a vague ethics checklist. You will cover safety datasets, red-team prompts, harmlessness tuning, over-refusal, and behavior testing.