From Vibes to Validation: How To Evaluate LLMs and Agents | DailyDevLists