Boston Data @Scale 2018 – Leveraging Sampling to Reduce Data Warehouse Resource Consumption
Gabriela Jacques Da Silva & Donghui Zhang, Software Engineers at Facebook, discuss the approaches they have been using to support the computation of analytical dashboards using sampling, where approximations result in negligible visual differences of the graphs. They discuss the challenges that this poses to approximate computation, such as the need to consider uncertainty propagation when calculating aggregated metrics. They also show the benefits in terms of resource consumption, in both compute and storage.