TOPIC: Dev Tools and Ops, Privacy, Sustainability and Performance

Performance @Scale 2019

JUNE 20, 2019 @ 8:30 AM PDT - 5:30 PM PDT
Designed for engineers working on the technical and organizational challenges of high-performance applications and services. @Scale is a series of technical conferences hosted by FB and Instagram for engineers building applications and services that scale to millions or even billions of people.


Performance @Scale is an invite-only conference for engineers working on the technical and organizational challenges of high-performance applications and services.

If you have ever wanted to learn best practices from the pros on how to detect performance anomalies, scale your web service, or speedup your mobile apps, then Performance @Scale is the place to be on Thursday, June 20th, 2019! If you have friends or colleagues who may also be interested in attending, feel free to forward them this invitation.

Performance @Scale will be held on Facebook’s campus in Menlo Park, California. Registration and breakfast starts at 8:30 a.m. The Women in Technology panel will be held at at 9:00 a.m. and talks begin at 10 a.m. Stick around after the day-long conference for Happy Hour.


Learn more about @Scale events. and follow us on Facebook for updates.


Event times below are displayed in PT.

June 20

08:30 AM - 10:00 AM
Registration and Breakfast
09:00 AM - 10:00 AM
Women in Tech Breakfast and Panel Discussion
10:00 AM - 10:30 AM
Keynote: The Facebook App Journey

To kick off the event, Surupa takes us through a tour of how performance is done in Facebook’s apps. She describes how product teams and central performance teams work together to improve app size, startup times, crash rates, and more. Through better tools and partnerships, the teams have scaled to more than 150 metrics across multiple apps and platforms.

Surupa Biswas is an Engineering Director overseeing Mobile app Performance, Reliability and Efficiency in Facebook’s Developer Infrastructure group. Since joining Facebook in 2013 she has overseen server and mobile infrastructure teams, previously leading teams building Facebook’s open source web server infrastructure - HHVM and GraphQL

10:30 AM - 11:05 AM
Performance Analysis of Facebook AI Workloads on Accelerated Platforms

In this talk, we describe our top-down methodology for uncovering inefficiencies in our production AI workloads, the tools and technologies we’ve built to support performance analysis, and the common pitfalls in optimizing accelerated code. Our tools and techniques are being used by thousands of ML engineers at Facebook on products that serve billions of users.

Kim Hazelwood is an Engineering Manager leading the AI Infra Foundation and AI Infra Research efforts at Facebook, which focus on the hardware and software platform design and efficiency for Facebook's many applied machine learning-based products and services. Prior to Facebook, Kim held positions including a tenured Associate Professor at the University of Virginia, Software Engineer at Google, and Director of Systems Research at Yahoo Labs. She received a PhD in Computer Science from Harvard University in 2004, and is the recipient of an NSF CAREER Award, the Anita Borg Early Career Award, the MIT Technology Review Top 35 Innovators under 35 Award, and the ACM SIGPLAN 10-Year Test of Time Award. She currently serves on the Board of Directors of CRA, MIT SystemsThatLearn, and EPFL EcoCloud. She has authored over 50 conference papers and one book.

Speaker Kim Hazelwood,Facebook
11:05 AM - 11:40 AM
Scaling ML models on Google's TPUs

Tensor Processing Units are Machine Learning accelerators developed at Google. A TPU v3 Pod offers over 100 PFLOPs of compute, leading to dramatic reductions in training time of Machine Learning models. In this talk, we will explore some of the scalability challenges, often not unique to TPUs, and techniques to address those challenges.

Naveen Kumar is a Software Engineer at Google. He currently leads Performance within Google Brain. Previously, Naveen worked on Google's second generation Tensor Processing Units. Prior to Google, Naveen focused on microprocessor research at Intel Labs. Naveen holds a PhD from University of Pittsburgh and enjoys outdoor life in the Bay Area.

Speaker Naveen Kumar,Google
11:40 AM - 12:15 PM
Scaling Deep Learning Workloads on GPUs

The computational size, complexity and footprint of neural network training has been doubling about every 3.5 months, according to OpenAI. As well, the amount of data used for training has been increasing, for instance as researchers are able to take advantage of unsupervised training methods as in BERT. These researchers now require multiple systems for training their models (a trend similar to scientific simulations on HPC systems in the past). This talk will discuss the techniques needed for running deep learning training at scale on GPUs, and state of the art results. The discussion will also review how to deploy, scale, load balance and optimize the trained network inference (or prediction) throughput on GPUs, using tools such as TensorRT Inference Server.

Ujval has spent the last 10 years working on software and libraries for deep learning and HPC at NVIDIA. Previously, he co-founded Stream Processors, a fabless-semi startup building programmable processors for signal and image processing. Ujval earned his PhD in EE at Stanford and a BS at Brown University.

Speaker Ujval Kapasi,NVIDIA
12:15 PM - 01:15 PM
01:15 PM - 01:55 PM
The Intersection of Data, Performance and Usability

Performance is more than a numbers game. This talk will share how Bing leverages behavioral analytics to identify usability bottlenecks and optimize perceived performance. We will cover a wide range of performance experiments, including good ideas that failed, and summarize the lessons we learned along the way.

Sarvesh leads the performance team at Bing, Microsoft and is passionate about solving complex data problems with rich visualizations. Sarvesh holds a M.S. in Computer Science from Columbia University, NY.

Speaker Sarvesh Nagpal,Bing
01:55 PM - 02:45 PM
Open-Source Browser Contributions at Facebook

The Web as an application platform is still very much behind native platforms like Android and Windows for performance and richness of integration APIs. This makes it challenging for developers to create sophisticated yet performant webapps which require a non-trivial amount of client-side JS code. The Browser Engineering team at Facebook finds bottlenecks in browser implementations, contributes code to open-source browsers, prototypes new Web technologies, and advances new API proposals through Web standards committees. This talk will cover our current and future projects for making Web apps as fast and as powerful as native apps, including the new isInputPending() API, the upcoming JS Self-Profiling API, and new ideas for eliminating JavaScript overheads.

Vladan is the tech lead for the Browser Engineering team at Facebook. His technical focus is browser technology, performance, and low-level systems. Previously, he lead the Firefox performance team at Mozilla, working on browser startup, responsiveness and performance measurement.

Speaker Vladan Djeric,Facebook
02:25 PM - 03:00 PM
FlameScope: A Different Way of Looking at Profilers

Even under constant load, the behavior of a system is affected by variance, perturbations, single-threaded execution and other time-based issues, and never completely uniform. Using profilers to analyze the performance of a system generally involves aggregating events or samples over a period of time, and identifying these small variations in the full profile becomes a needle-in-a-haystack problem. FlameScope solves this by combining a subsecond-offset heatmap, for navigating a profile and visualizing these perturbation, with a flame graphs for code-path analysis.

For the past 13 years Martin's career evolved around technology and performance engineering, leading major initiatives at Netflix, Expedia and other companies. Currently, as a Performance Architect at Netflix, Martin is responsible for improving the performance of the Netflix service, for its 148+ million users, watching hundreds of millions of hours of movies and TV shows every day. Martin is also a Venture Advisor at monashees+, one of the largest venture capital firms in Brazil, angel investor and advisor to multiple startups, and an avid open source contributor.

Speaker Martin Spier,Netflix
03:00 PM - 03:30 PM
03:30 PM - 04:05 PM
Monitoring Real User Perceived Performance on Native Apps

At LinkedIn, we monitor our client side performance as experienced by our members (RUM/Real User Monitoring). In this talk, we will share our journey migrating to a new generation of RUM for native apps, challenges faced in building a generic instrumentation framework, tradeoffs made to fit in our mobile architecture, lessons learnt and best practices when designing new Tier 0 performance metrics for the company.

Ramya Pasumarti is a Staff Software Engineer at LinkedIn with the Performance Engineering team. She works on mobile and server side performance focusing on a variety of monitoring, tooling and optimization projects across the stack. She currently leads initiatives to enhance mobile performance measurement, monitoring and debugging experience for developers.

Speaker Ramya Pasumarti,LinkedIn
04:05 PM - 04:40 PM
Improving iOS Startup Performance with Binary Layout Optimizations

Startup of the iOS app is an important performance metric user experience. However, poor ordering of functions in the iOS binary can greatly increase page faults during startup and significantly hurt startup performance. An “order file” can be used to direct the linker how to order functions in an iOS binary better. To generate an order file for iOS apps, we usually use dtrace, but some apps have multiple startup scenarios that we want to optimize for with the order file. The dtrace approach does not scale well and it is not easy to automate. In this talk, we describe some more scalable approaches to generating order files.

Manman Ren is a Software Engineer at Facebook. She currently works on iOS app performance. Previously, Manman worked at Apple's compiler team and on bringing Android to support IA at Intel. Manman holds a PhD from Stanford University.

Speaker Manman Ren,Facebook
Speaker Shane Nay,Facebook
04:40 PM - 05:30 PM
Networking Happy Hour


Kim Hazelwood leads the AI Infrastructure Foundation efforts at Facebook, which focus on the... read more

Kim Hazelwood


Naveen Kumar


Ujval Kapasi


Sarvesh Nagpal


Vladan Djeric


Martin Spier


Ramya Pasumarti


Manman Ren


Shane Nay

UPCOMING EVENT   07/31/2024 AI @Scale

AI Infra @Scale 2024

Meta's Engineering and Infrastructure teams are excited to host AI Infra @Scale, a one-day virtual event featuring a range of speakers from Meta who will unveil the latest AI infrastructure investments and innovations powering Meta's...
UPCOMING EVENT   August 7, 2024 Product @Scale

Product @Scale 2024

Product @Scale conferences are designed for technologists who work on solving complex product problems at scale. The @Scale community focuses on bringing forward people's experiences in creating innovative solutions to large-scale products serving millions or...
UPCOMING EVENT   September 4-5, 2024 (2 day event) Networking @Scale

Networking @Scale 2024

Networking @Scale is a technical conference for engineers that build and manage large-scale networks. Meta’s Networking Infrastructure team is excited to host Networking @Scale, a two-day virtual event featuring a range of speakers from Meta...
UPCOMING EVENT   October 9, 2024 Reliability @Scale

Reliability @Scale 2024

Reliability @Scale is a technical conference for engineers who are passionate about building and understanding highly resilient and reliable systems and products at massive scale. Whether it’s novel design decisions, or outages that impact billions...
UPCOMING EVENT   October 23, 2024 Mobile @Scale

Mobile @Scale 2024

Mobile @Scale is a technical conference designed for the engineers, product managers, and engineering leaders building mobile experiences at significant scale (millions to billions of daily users). Mobile @Scale provides a rare opportunity to gather...
UPCOMING EVENT   November 20, 2024 Video @Scale

Video @Scale 2024

Video @Scale 2024 is a technical conference designed for engineers that develop or manage large-scale video systems serving millions of people. The development of large-scale video systems includes complex, unprecedented engineering challenges. The @Scale community...
PAST EVENT   March 20, 2024 @ 9am PT - 3pm PT RTC @Scale

RTC @Scale 2024

RTC @Scale is for engineers who develop and manage large-scale real-time communication (RTC) systems serving millions of people. The operations of large-scale RTC systems have always involved complex engineering challenges which continue to attract attention...
Past EVENT   May 22, 2024 Data @Scale

Data @Scale 2024

Data @Scale is a technical conference for engineers who are interested in building, operating, and using data systems at scale. Companies across the industry use data and underlying infrastructure to build products with user empathy,...
Past EVENT   June 12, 2024 Systems @Scale

Systems @Scale 2024

Systems @Scale 2024 is a technical conference intended for engineers that build and manage large-scale distributed systems serving millions or billions of users. The development and operation of such systems often introduces complex, unprecedented engineering...


To help personalize content, tailor and measure ads, and provide a safer experience, we use cookies. By clicking or navigating the site, you agree to allow our collection of information on and off Facebook through cookies. Learn more, including about available controls: Cookies Policy