TOPIC: Mobile, Video and Web

Video @Scale 2021

NOVEMBER 03, 2021 @ 9:00 AM PDT - 12:35 PM PDT
NOVEMBER 04, 2021 @ 9:30 AM PDT - 12:30 PM PDT
Designed for engineers that develop or manage large-scale video systems serving millions of people. The development of large-scale video systems includes complex, unprecedented engineering challenges.


Video @Scale is an invitation-only technical conference for engineers that develop or manage large-scale video systems serving millions of people. The development of large-scale video systems includes complex, unprecedented engineering challenges. The @Scale community focuses on bringing people together to discuss these challenges and collaborate on the development of new solutions.


Event times below are displayed in PT.

November 3

November 4

09:00 AM - 09:35 AM

Hear from Paresh Rajwat, the Vice President and Head of Product for Facebook Audio, Video, and Music.

Speaker Paresh Rajwat,Facebook
09:35 AM - 10:05 AM
Measuring the Reliability of Live Video Infrastructure

Serving Live Videos with high reliability is challenging, not only from the perspective of deploying improvements on top of a distributed system but also from the perspective of defining correct measurements to capture reliability gaps that matter to users. Facebook’s Live platform spans from the ingest endpoints where creators upload their streams to services that are in charge of transcoding and generating multiple video renditions, as well as services executing the delivery on egress stack and CDN endpoints that serve broadcasts to the millions of viewers. All of these pipelines have to do the work in orchestration with realtime guarantees and even a single failure can be severe for our users. In this talk, Petar will show how we evolved our thinking about key reliability metrics over the time, and how we derive actionable insights to make Facebook Live as reliable as possible from the user’s perspective.

Speaker Petar Zuljevic,Facebook
10:05 AM - 10:35 AM
Performance of Low-Latency DASH/CMAF and Low-Latency HLS Systems

Reducing end-to-end streaming latency is critical for HTTP-based live video streaming. There are currently two new technologies in this domain: Low-Latency HTTP Live Streaming (LL-HLS) and Low-Latency Dynamic Adaptive Streaming over HTTP (LL-DASH). Several streaming players (Apple's AVPlayer, Google's Shaka Player, HLS.js, DASH.js, etc.), streaming encoding and packaging tools (Apple's HTTP Live Streaming Tools, FFmpeg, GPAC, etc.), have recently added support for these formats. There are also several live LL-HLS and LL-DASH reference/demo streams available, showcasing the capabilities of these technologies and helping the implementation community to adopt them. However, how well such systems perform in real-world deployment scenarios is not well known. Specifically, not much is known about the performance of such low-latency systems in delivery to mobile devices, connected over rapidly changing and often highly loaded wireless cellular networks. In this talk, Yuriy will review some results obtained in this direction by the Brightcove Research team. Our study consists of a series of live LL-HLS and LL-DASH streaming experiments. For each experiment, we use the same video content, encoder, encoding profiles, and identical network conditions emulated by running traces of the existing cellular networks (Verizon 4G LTE, T-Mobile 4G LTE, etc.). Several key performance metrics are captured and reported in each experiment. We measure the average streaming bitrate, the overall amounts of media data downloaded, the intensity of data requests, streaming latency, playback speed variations, buffering, stream switching statistics, etc. These results are subsequently analyzed and used to characterize some typical limits of LL-HLS and LL-DASH-based systems and differences between them.

Speaker Yuriy Reznik,Brightcove
10:35 AM - 11:05 AM
Highly Available Live Encoding Using Go and Kubernetes

Two years ago, iStreamPlanet set out to build a cloud-native software transcoder with the reliability and feature set to support some of the highest profile live channels and events in the world. Some of our goals included: 4+ 9’s of uptime, as little C/C++ code as possible; the ability to run 1000’s of live channels without human oversight; support for advanced features like SCTE-35 Signaling & Segmentation, Hitless merging of redundant video sources, and Dolby Digital Plus Surround Sound encoding; and the ability to update the software on running channels without customer impact. We’ll explain how we leveraged the power of Go, Kubernetes, and a mix of commercial and OSS components to make that vision a reality. The result powers the OTT distribution over 1000 live linear TV channels running 24/7, and has been used to deliver the streams for March Madness 2021, the Tokyo Olympics games, the UEFA Champions League, and more.

Speaker Justin Mason,iStreamPlanet
11:05 AM - 11:35 AM
Smart Crop and Smart Preview via Video Understanding

Understanding video content has been a focus for video-sharing platforms. It is one of the most important driving forces for the growth in distribution, discovery, user experience and monetization. Instream video understanding is the technology area where we analyze and utilize finer granularity video signals in the spatial and the temporal domains. The fine-grained spatial and temporal signals can be used for consumer facing products or used as signals for downstream models and pipelines. For example, in the spatial domain, we identify the salient regions inside each frame, which enables a system to automatically reframe a horizontal (landscape) video into a vertical (portrait) one. In the temporal domain, we identify the highlight score of each frame, which enables us to identify the highlight moments inside a video and create a video trailer.

Speaker David Liu,Facebook
Speaker Priyam Chatterjee,Facebook
11:35 AM - 12:05 PM
Media Provenance as a Prevention for Malicious Synthetic Media

Malicious synthetic media – both deepfakes and cheapfakes – are rising in prevalence and importance. End users are rapidly losing trust in media, and their ability to tell authentic media from inauthentic has greatly diminished. This talk will cover the dangers of synthetic media in the video ecosystem, approaches to combating malicious synthetic media, and an overview of how Media Provenance techniques, as developed by the Coalition for Content Authenticity and provenance, can be utilized to provide media trust signals to end users.

Speaker Andrew Jenks,Microsoft
12:05 PM - 12:35 PM
Behind the Curtains: A/B Tests in Video Land

A/B testing on video isn’t just about tweaking recommendations or picking the perfect thumbnail. Every aspect of video benefits from rapid experimentation including the infrastructure — streaming algorithms, codecs, bitrates, caching strategies, network congestion control algorithms. Join us for a behind the scenes tour from practitioners on how experimentation helps learn what users want. We’ll share experiment results that confounded us, share insight about infrastructure to enable experimentation for everything and bust common myths that cause people to think experimentation isn’t for them. If you care about making video better, there’ll be something for you!

Speaker Vineeth Madhusudanan,Statsig
09:30 AM - 10:00 AM
ASIC-RDX and Compute-Compression Efficiency Optimization for UGC Video Processing

Facebook and user-generated content (UGC) platforms encode videos at “billion-scale” and deliver them worldwide to a variety of devices (Mobile/Laptop/TV) across different networks. The popularity of UGC videos can vary widely ranging from millions of views for a very-popular viral video to relatively few views for a privately shared video. Improving the compression of video encoding is critical to both reducing the cost of data-usage to both end-users and infrastructure providers, as well as providing good Quality-Of-Experience in poor bandwidth networks. At the same time, due to “billion-scale” we need to take into account the computational complexity of the encoder as well as device support for the codec. In this presentation, we show how ASIC-leveraged hybrid RDX plays a central role maximizing the compression efficiency for video delivery while meeting the constraints for codec-support and available compute. We also present an optimization framework that allows us quantify the compute impact due to changes in device support for advanced codecs and video watch-time distribution, as well as point out ASIC improvements that can further overall compute efficiency.

Speaker Shankar Regunathan,Facebook
Speaker Srinath Reddy,Facebook
10:00 AM - 10:30 AM
Optimizing Storage Efficiency for FB Video Processing

Like the rest of the video world, Facebook video has significantly grown year to year. While we celebrate the growth rate, we are also concerned about the resource consumption to support the growth, which became worse during COVID. Seeing the gap between increased storage demand and supply, video infra has worked with Facebook's capacity team to invent new methods to bend the curve. We establish source + MVE (minimum variable encoding) storage policy for every FB video. Then through a video lifecycle manage system, we ensure the storage consumption of a video is proportional to its popularity. For example, purge unused encodings, and reduce source from two permanent copies to one copy if the video becomes “cold”. With such technology, 70 percent of Facebook video's projected storage growth for 2021 has been successfully suppressed. We are exploring ways to further improve the performance and accuracy of the storage management system with a cost-benefit model.

Speaker Jin Zhang,Facebook
Speaker Jamie Chen,Facebook
10:30 AM - 11:00 AM
Measuring Video Quality Using AVQT

AVQT, short for Advanced Video Quality Tool, is a macOS based command line tool which estimates perceptual video quality of compressed videos that might contain video coding and scaling artifacts. Utilizing the AVFoundation framework, AVQT supports a wide range of video formats, codecs, resolutions and frame-rates in both the SDR and HDR domains, which results in easy and efficient workflows — for example, no requirement to decode to a raw pixel format. AVQT uses Metal to achieve high processing speeds by offloading heavy pixel-level computation to the GPU, typically analyzing videos in excess of real-time video frame rates. In this talk, we'll cover key attributes of AVQT which make it useful across applications. We'll also demo how to use the tool and interpret the obtained scores correctly.

Speaker Pranav Sodhani,Apple
11:00 AM - 11:30 AM
Video Quality Assessment of User Generated Contents

Video quality of User Generated Content (UGC) is extremely difficult to wrangle with due to their high diversity of contents and quality. They bring new challenges to how we traditionally measured and assessed video quality. Most videos uploaded to YouTube, and other video sharing platforms, are UGC. To facilitate and encourage research in UGC compression and quality assessment, in 2019 we released a large scale UGC dataset (YT-UGC) that contained representative UGC raw videos along with their ground truth Mean Opinion Score (MOS), Differential MOS (DMOS), and content labels. Parallely, we have been investigating a number of efforts analyzing and optimizing UGC video quality. Recently, we built a novel deep learning based framework to understand the importance of content, technical quality, and compression level on perceptual quality. In this talk, we will walk through our video analysis framework, our latest DNN-based video quality metric called YouVQ, and present some results.

Speaker Balu Adsumilli,YouTube
Speaker Yilin Wang,YouTube
11:30 AM - 12:00 PM
Scaling Your Encoding Backend Using Eve

In this talk, we will discuss the current state in terms of bitrate/quality and complexity of Two Orioles’ Eve video encoder for the VP9 & AV1 video codecs. VP9 provides meaningful quality improvements over H.264 with a mature ecosystem and good target platform support. AV1, on the other hand, provides far superior quality, but with an ecosystem that is still being actively constructed. For both codecs, Eve provides a wide range of speed presets to choose from in terms of bitrate/quality vs. computational complexity, the fastest of which at this moment approach x264 preset=medium / x265 preset=superfast in terms of complexity, but with a better bitrate vs. quality trade-off. Such speed presets allow encoding of high-volume UGC without compression being a cost barrier, thus enabling broad deployment of modern video codecs at scale.

Speaker Ronald S. Bultje,Two Orioles
12:00 PM - 12:30 PM
Highly Efficient SVT-AV1-based Solutions for VOD Applications

This presentation will highlight the latest improvements of the VOD-targeted high-latency Constant Rate Factor (CRF) and Variable Bit Rate (VBR) modes of the SVT-AV1 encoder. It will first present the latest SVT-AV1 cycles-quality tradeoffs when applied in CRF mode to shot-based convex-hull-optimized encoding for a wide range of VOD applications. It will then demonstrate the performance of VBR encoding of a bitrate-resolution ladder that had been developed as a result of an offline statistical analysis of the parameter values generated via convex-hull CRF-mode encoding of a large dataset. The presentation concludes by presenting a highly efficient solution for reducing the cycles-cost of the Dynamic Optimizer (DO) encoding framework by using the fastest-preset SVT-AV1 encoder to generate the convex-hull encoding parameter values, which are then used to produce the final bitstreams using the desired encoder.

Speaker Nader Mahdi,Intel
Speaker Amir Naghdinezhad,Intel
Speaker Hassene Tmar,Intel


Paresh Rajwat


Petar Zuljevic


Yuriy Reznik


Justin Mason


David Liu


Priyam Chatterjee


Andrew Jenks


Vineeth Madhusudanan


Shankar Regunathan


Srinath Reddy


Jin Zhang


Jamie Chen


Pranav Sodhani


Balu Adsumilli


Yilin Wang


Ronald S. Bultje

Two Orioles

Nader Mahdi


Amir Naghdinezhad


Hassene Tmar

UPCOMING EVENT   May 22, 2024 Data @Scale

Data @Scale 2024

Data @Scale is a technical conference for engineers who are interested in building, operating, and using data systems at scale. Companies across the industry use data and underlying infrastructure to build products with user empathy,...
UPCOMING EVENT   June 12, 2024 Systems @Scale

Systems @Scale 2024

Systems @Scale 2024 is a technical conference intended for engineers that build and manage large-scale distributed systems serving millions or billions of users. The development and operation of such systems often introduces complex, unprecedented engineering...
UPCOMING EVENT   07/31/2024 AI @Scale

AI Infra @Scale 2024

Meta's Engineering and Infrastructure teams are excited to host AI Infra @Scale, a one-day virtual event featuring a range of speakers from Meta who will unveil the latest AI infrastructure investments and innovations powering Meta's...
UPCOMING EVENT   August 7, 2024 Product @Scale

Product @Scale 2024

Product @Scale conferences are designed for technologists who work on solving complex product problems at scale. This year focuses on discussions that explore the creator ecosystem, and how AI will play a role in scaling...
UPCOMING EVENT   September 4-5, 2024 (2 day event) Networking @Scale

Networking @Scale 2024

Networking @Scale is a technical conference for engineers that build and manage large-scale networks. Meta’s Networking Infrastructure team is excited to host Networking @Scale, a two-day virtual event featuring a range of speakers from Meta...
UPCOMING EVENT   October 9, 2024 Reliability @Scale

Reliability @Scale 2024

Reliability @Scale is a technical conference for engineers who are passionate about building and understanding highly resilient and reliable systems and products at massive scale. Whether it’s novel design decisions, or outages that impact billions...
UPCOMING EVENT   October 23, 2024 Mobile @Scale

Mobile @Scale 2024

Mobile @Scale is a technical conference designed for the engineers, product managers, and engineering leaders building mobile experiences at significant scale (millions to billions of daily users). Mobile @Scale provides a rare opportunity to gather...
UPCOMING EVENT   November 20, 2024 Video @Scale

Video @Scale 2024

Video @Scale 2024 is a technical conference designed for engineers that develop or manage large-scale video systems serving millions of people. The development of large-scale video systems includes complex, unprecedented engineering challenges. The @Scale community...
PAST EVENT   March 20, 2024 @ 9am PT - 3pm PT RTC @Scale

RTC @Scale 2024

RTC @Scale is for engineers who develop and manage large-scale real-time communication (RTC) systems serving millions of people. The operations of large-scale RTC systems have always involved complex engineering challenges which continue to attract attention...

To help personalize content, tailor and measure ads, and provide a safer experience, we use cookies. By clicking or navigating the site, you agree to allow our collection of information on and off Facebook through cookies. Learn more, including about available controls: Cookies Policy