How to evaluate agents in production | DailyDevLists