Breakouts
BRK2-060 • Serverless, Open Models • Technical
Build AI architectures with custom models on Cloud Run
location_on
Mandalay Bay H
schedule
4:00 PM - 4:45 PM
Modern AI architectures use specialized and open models to deliver high-performance features with greater efficiency. Join this session to explore how the Cloud Run serverless GPU architecture provides a scalable, cost-effective runtime, without the complexity of having to manage clusters. We’ll demonstrate a repeatable workflow for hosting custom models on serverless GPU instances, and show you how to harness the power of the NVIDIA RTX PRO 6000 GPU to scale your AI with the simplicity and pay-per-use efficiency of Cloud Run. We’ll also cover model fine-tuning with Cloud Run jobs.
Read more