Architecting Multi-tenant Data-center Networks for GPU Customers

Generative AI is revolutionizing cloud data centers, pushing the limits of what is possible in computing. While the industry already knows how to virtualize the regular data-center networks, virtualizing the GPU networks in a cloud introduces new challenges. In our talk we will share Google’s architecture, how we create cutting-edge cloud data centers tailored for GenAI workloads, and the experience with our choice of GPU NIC and its SDK to ensure exceptional performance, scalability, efficiency, security, operability and seamless integration with existing systems.

To help personalize content, tailor and measure ads, and provide a safer experience, we use cookies. By clicking or navigating the site, you agree to allow our collection of information on and off Facebook through cookies. Learn more, including about available controls: Cookies Policy