Session Details: Google Cloud Next 2026

phase 7 playlists session info modal

Day 3 – April 24, 2026

Breakouts

BRK2-116 • Kubernetes, Cloud Runtimes • Technical

Beyond fine-tuning: The next frontier of training and RL on GKE

Surf C

11:00 AM - 11:45 AM

Large-scale reinforcement learning (RL) creates infrastructure bottlenecks in the sampling and training cycle. Join this session to explore Mistral AI’s RL strategies and Anyscale’s high-performance Ray on Google Kubernetes Engine (GKE). We’ll analyze GKE primitives for faster RL loop times, focusing on sampling, weight transfer, and sandboxing for isolation. Leave with recommendations for cluster resilience against hardware failures and preemptions, validated through RL on smaller models and large-scale mixture-of-experts (MoE) architectures.

Session Details

Beyond fine-tuning: The next frontier of training and RL on GKE

Related Sessions