We then changed gears to a topic that is relevant to many: how to manage resources within a large-scale compute cluster in a way that optimizes system resources, all while balancing the competing needs of the various processing jobs. Microsoft’s Big Data team has embraced Apache YARN and is building around YARN as the resource manager in its clusters. In this talk, Sriram Rao described the team’s work in building a YARN-based, scale-out architecture. The system is self-configuring and tolerates failures of subcomponents while providing continued availability. This work has been contributed back to Apache YARN and ships with various Hadoop distributions.
- WATCH NOW
- 2024 EVENTS
- PAST EVENTS
- 2023
- 2022
- February
- RTC @Scale 2022
- March
- Systems @Scale Spring 2022
- April
- Product @Scale Spring 2022
- May
- Data @Scale Spring 2022
- June
- Systems @Scale Summer 2022
- Networking @Scale Summer 2022
- August
- Reliability @Scale Summer 2022
- September
- AI @Scale 2022
- November
- Networking @Scale Fall 2022
- Video @Scale Fall 2022
- December
- Systems @Scale Winter 2022
- 2021
- 2020
- 2019
- 2018
- 2017
- 2016
- 2015
- Blog & Video Archive
- Speaker Submissions