Tutorial at IEEE AI Test 2025 on deploying evaluations for generative AI systems with real-world rigor. Covering system context, measurement and monitoring, agent behavior testing, and evaluation frameworks. Co-taught with Heather Frase, PhD and Sarah Luger, PhD.