Session Details: Google Cloud Next 2026

phase 7 playlists session info modal

Day 3 – April 24, 2026

Breakouts

BRK3-023 • Agents, Gemini, Gemini Enterprise Agent Platform • Advanced Technical

Engineer the agent-quality flywheel: Using Gemini Enterprise Agent Platform evaluations to optimize agents

South Seas B

11:00 AM - 11:45 AM

Treating agent quality as a rigorous engineering discipline is the only way to scale. Stop guessing and start measuring. Join this session for a deep dive into the state-of-the-art techniques Google uses to build our own agents, and learn how to make them a part of your process. We’ll demonstrate how you can adopt the “Quality Flywheel” methodology, which includes bootstrapping effective offline evaluation with synthetic test generation, using LLM-as-a-judge autoraters and trajectory evaluations, performing user and environment simulation, identifying systemic failures in production with multi-turn autoraters and loss-pattern clustering, aligning test coverage with actual usage, and using automated optimization capabilities to scientifically refine performance. Ramp up with confidence.

Session Details

Engineer the agent-quality flywheel: Using Gemini Enterprise Agent Platform evaluations to optimize agents

Related Sessions