Meta’s AI Translations Inference

Sravan Rekula

Meta

Jordi Cenzano

Meta

Amisha Jaiswal

Meta

TOPIC: Mobile, Video and Web

@SCALE SERIES: Mobile, Video and Web

TYPE: video

YEAR: 2024

TAGS:

In this talk we will show how we implemented a media processing pipeline to perform (autodub / lipsync) media inference at Meta scale.

We will focus on the challenges we faced from a media processing / scaling point of view, such as: inference latency and scheduling, voice isolation, media timing/alignment, alternate tracks delivery, instrumentation, model evaluation, etc.

SUBSCRIBE TO @SCALE

Your message has been sent

RECENT POSTS

Eliminating the Awkward Pause: Ultra-Low Latency Connect (ULLC) How Meta Deployed Super Resolution at Scale to Transform Video Quality Advances in Audio Real-time Communication for Natural and Interactive Conversational AI

RELATED POSTS

Eliminating the Awkward Pause: Ultra-Low Latency Connect (ULLC) How Meta Deployed Super Resolution at Scale to Transform Video Quality Advances in Audio Real-time Communication for Natural and Interactive Conversational AI

To help personalize content, tailor and measure ads, and provide a safer experience, we use cookies. By clicking or navigating the site, you agree to allow our collection of information on and off Facebook through cookies. Learn more, including about available controls: Cookies Policy