September 27, 2023

Large Language Models for Automatic Cloud Incident Management

Topic: Systems and Networking

Rujia Wang

Microsoft

TYPE: Videos

YEAR: 2023

Building reliable hyper-scale cloud services can be challenging. We need to quickly detect, analyze and mitigate incidents, which largely rely on human effort today. Recent breakthroughs in Large-Language Models (LLMs) have motivated us to explore their potential for automated incident diagnosis. By leveraging LLMs, we aim to accelerate the incident resolution process, leading to improved service reliability and better customer experience. For the first time, we have demonstrated the effectiveness of LLMs in improving cloud reliability. In this talk, we will share our findings, research innovations, and visions in this space.

SUBSCRIBE TO @SCALE

← Back

Large Language Models for Automatic Cloud Incident Management

Rujia Wang

TYPE: Videos

YEAR: 2023

SUBSCRIBE TO @SCALE

Thank you for your response. ✨

RECENT POSTS

RELATED POSTS