Networks (graphs) of people’s social and content interactions are a rich source of data for machine learning algorithms. Traditional machine learning algorithms do not naturally take graph-structured data as input, so unsupervised methods such as graph embeddings are used to turn graph data into features that can be used for machine learning tasks. However, modern interaction graphs, particularly in industrial applications, contain billions of nodes and trillions of edges, which exceeds the capability of typical embedding systems. In this talk, I will describe the techniques that the PyTorch-BigGraph uses to scale graph embedding methods to graphs of this size. I will also discuss new work on applying the PBG philosophy to achieve further scaling on GPUs, and how we are combining graph embeddings with graph neural network models on these extremely large graphs.
- WATCH NOW
- 2024 EVENTS
- PAST EVENTS
- 2023
- 2022
- February
- RTC @Scale 2022
- March
- Systems @Scale Spring 2022
- April
- Product @Scale Spring 2022
- May
- Data @Scale Spring 2022
- June
- Systems @Scale Summer 2022
- Networking @Scale Summer 2022
- August
- Reliability @Scale Summer 2022
- September
- AI @Scale 2022
- November
- Networking @Scale Fall 2022
- Video @Scale Fall 2022
- December
- Systems @Scale Winter 2022
- 2021
- 2020
- 2019
- 2018
- 2017
- 2016
- 2015
- Blog & Video Archive
- Speaker Submissions