Loading video player...
AI agents are hard to deploy, and their actions are often opaque. In this joint AWS × Weights & Biases session, we’ll demonstrate how to use Amazon Bedrock AgentCore to easily deploy agent prototypes to production in a scalable and safe manner. We’ll also show you how to use it with W&B Weave to unify real-time tracing and performance views across development and production, providing you with clear visibility into agent behavior and decision-making from a single view. Finally, we’ll close the loop: turn production traces filtered by user/expert feedback into evaluation datasets, use Weave evals to measure what matters, and promote improved versions with governance and lineage. Walk away with a mental model for building trustworthy, auditable agents—from your prototype to a repeatable “observe → evaluate → improve → release” flywheel on AWS.