The @Scale Conference 

San Jose Convention Center 10:00am - 6:30pm

Event Completed

The 2017 @Scale Conference has finished! Thank you for coming. Videos from the event will be posted soon.


Read More Read Less

@Scale brings thousands of engineers together throughout the year to discuss complex engineering challenges and to work on the development of new solutions. We're committed to providing a safe and welcoming environment — one that encourages collaboration and sparks innovation.

Every @Scale event participant has the right to enjoy his or her experience without fear of harassment, discrimination, or condescension. The @Scale code of conduct outlines the behavior that we support and don't support at @Scale events and conferences. We expect participants to follow these rules at all @Scale event venues, online communities, and event-related social activities. These guidelines will keep the @Scale community a safe and enjoyable one for everyone.

Be welcoming. Everyone is welcome at @Scale events, inclusive of (but not limited to) gender, gender identity or expression, sexual orientation, body size, differing abilities, ethnicity, national origin, language, religion, political beliefs, socioeconomic status, age, color and neurodiversity. We have a zero-tolerance policy for discrimination.

Choose your words carefully. Treat one another with respect and in a professional manner. We're here to collaborate. Conflict is not part of the equation.

Know where the line is, and don't cross it. Harassment, threats, or intimidation of any kind will not be tolerated. This includes verbal, physical, sexual (such as sexualized imagery on clothing, presentations, in print, or onscreen), written, or any other form of aggression (whether outright, subtle, or micro). Behavior that is offensive, as determined by @Scale organizers, security staff, or conference management, will not be tolerated. Participants who are asked to stop a behavior or an action are expected to comply immediately or will be asked to leave.

Don't be afraid to call out bad behavior. If you're the target of harmful or offensive behavior, or if you witness someone else being harassed, threatened, or intimidated, don't look away. Tell an @Scale staff member, a security staff member, or a conference organizer immediately. Please notify our event staff, security staff, or conference organizers of any harmful or offensive behavior that you've experienced or witnessed in any form, whether in person or online.

We at @Scale want our events to be safe for everyone, and we have a zero-tolerance policy for violations of our code of conduct. @Scale conference organizers will investigate any allegation of problematic behavior, and we will respond accordingly. We reserve the right to take any follow-up actions we determine are needed. These include being warned, being refused admittance, being ejected from the conference with no refund, and being banned from future @Scale events.

Event Completed
Filter by Track:
  • Dev Tools & Ops
  • Hot Topics
  • Data
  • Machine Learning
8:00am - 10:00am


8:00am - 6:30pm


10:00am - 10:40am


10:50am - 11:30am

Dev Tools & OpsThe journey of turning a CI system into a universal platform

Sergey Doroshenko and Adriana Libório share lessons learned on a journey of transforming Facebook's continuous integration system, Sandcastle, into a universal platform.
10:50am - 11:30pm

Hot TopicsGoogle Translate: Breaking language barriers in emerging markets

The session will focus on how the Google Translate team members make their Android app work better for users in emerging markets.
10:50am - 11:30am

DataAzure Data Lake Store

Azure Data Lake Store (ADLS) is a fully managed, elastic, scalable, and secure file system that supports semantics of the Hadoop distributed file system (HDFS) and the Microsoft Cosmos file system. It is specifically designed and optimized for a broad spectrum of big data analytics that depend on an extremely high degree of parallel reads and writes, as well as colocation of compute and data for high-bandwidth and low-latency access. It brings together key components and features of Cosmos — long used internally at Microsoft as the warehouse for data and analytics — and HDFS. It also is a unified file storage solution for analytics on Azure. Internal and external workloads run on this unified platform. Distinguishing aspects of ADLS include its support for multiple storage tiers, exabyte scale, and comprehensive security and data sharing. Raghu Ramakrishnan will cover ADLS architecture, design points, the Cosmos experience, and performance.
10:50am - 11:30am

Machine LearningBuilding data-efficient AI algorithms with a dose of inspiration from the brain

Currently, the predominant approach in AI is to use unlimited data to solve narrowly defined problems. To progress toward humanlike intelligence, AI benchmarks will need to be extended to focus more on data efficiency, flexibility of reasoning, and transfer of knowledge between tasks. This talk will detail the challenges and successes in making these ideas operational. At Vicarious, the language of probabilistic graphical models is used as the representational framework. Compared with neural networks, graphical models have several advantages, such as the ability to incorporate prior knowledge, answer arbitrary probabilistic queries, and deal with uncertainty. However, a downside is that inference can be intractable. By incorporating several insights that originally were discovered in neuroscience, engineers at Vicarious were able to create probabilistic models, on which accurate inference can be performed using message-passing algorithms that are similar to the computations in a neural network.
11:40am - 12:20pm

Machine LearningWhen all the world's data scientists are just not enough

What if you had to build more machine-learned models than there are data scientists in the world? Well, at enterprise companies serving hundreds of thousands of businesses, this is precisely the case. In this talk, Shubha Nabar will walk through the scale challenges of building AI in the enterprise. She also will describe the general-purpose machine learning platform at Salesforce that automatically builds personalized models optimized for every business and every use case.
11:40am - 12:20pm

DataLogDevice: A file-structured log system

Facebook's Mark Marchukov will talk about Facebook's work to build a durable and highly available sequential distributed storage system that can handle hardware failures and sustain consistent delivery at exceptionally high ingest rates.
11:40am - 12:20pm

Hot TopicsBuilding successful teams at scale

This panel features leaders who have spent decades building successful teams that tackle large-scale technical challenges. Come hear how they’ve worked across disciplines, across oceans, and across their companies to move technology forward — while solving for the inherent difficulties of scaling technical management and managing technical teams at scale. Even as technology plays a greater role in society, this panel will highlight how people remain at the center of the code, infrastructure, and products that reach billions of people.
11:40am - 12:20pm

Dev Tools & OpsInside the VS Code team: How a small team manages growth

Product teams usually grow to deal with the increased workload caused by their own success. While growing, they risk losing the qualities that initially made them successful. Microsoft's VS Code is a small team with a wildly successful open source, cross-platform code editor. In this talk, Kai Maetzel will explain what the team has learned from developing open source projects and working on desktop, SaaS, and mobile applications, and how these findings help the team stay small and nimble, manage explosive growth, and make millions of developers happy.
12:20pm - 1:20pm

Lunch & Office Hours

1:20pm - 2:00pm

Machine LearningBringing 360 to the world

This talk will explore the future directions for 360 media across photos, video, and AR/VR. Matt Uyttendaele will dive deep on his latest work, applying machine learning models to 360 photos for an enhanced user experience.
1:20pm - 2:00pm

Dev Tools & OpsResiliency testing with Toxiproxy

Fibers get cut, databases crash, and you've adopted chaos engineering to challenge your production environment as much as possible. But what are you doing to craft the resiliency test suites that minimize the impact of failure on your application as much as possible? How do you debug resiliency problems locally and make sure single points of failure don't creep into the application in the first place? Shopify developed the open source Toxiproxy in 2015 to emulate timeouts, latency, and outages in its development environments. This talk will equip you with tools to start writing resiliency test suites that harden your own applications and supplement other chaos engineering practices.
1:20pm - 2:00pm

DataUsing Apache Beam for batch, streaming, and everything in between

Apache Beam is a unified programming model capable of expressing a wide variety of traditional batch and complex streaming use cases. By neatly separating properties of the data from runtime characteristics, Beam enables users to easily tune requirements around completeness and latency and run the same pipeline across multiple runtime environments. In addition, Beam's model enables cutting-edge optimizations such as dynamic work rebalancing and autoscaling, giving those runtimes the ability to be highly efficient. This talk will cover the basics of Apache Beam, touch on its evolution, and describe the main concepts in its powerful programming model. It will include detailed, concrete examples of how Beam unifies batch and streaming use cases, and show efficient execution in real-world scenarios.
1:20pm - 2:00pm

Hot TopicsArcher, a distributed computing platform for media processing

Netflix engineers were spending too much time working with infrastructure and not enough time on their media algorithms, so they created Archer, a high-scale distributed computing platform for media processing. It uses Docker, which allows developers to write their code in any language with any OS packages, test it on a laptop, and run it with millions of compute hours. This talk will discuss the Archer platform architecture as well as its implementation and applications, including feature extraction, encode experimentation, and machine learning.
2:10pm - 2:50pm

Hot TopicsBuilding Live With

Last year Facebook started rolling out the ability for public figures to go live with a guest. Now Live With is available for all profiles and Pages on iOS, letting you invite a friend into your live video so you can hang out together, or broadcast the conversation to an audience. To make this possible, Facebook's engineers worked to bring real-time interactive communication to broadcast-quality streams. In this talk, Nick Ruff will discuss how they bridged the trade-offs between video streaming technologies to enable real-time multiparty broadcasting.
2:10pm - 2:50pm

Machine LearningGPUs and deep learning

In the last year, GPUs plus deep learning have gone from a hot topic to large-scale production deployment in major data centers. That's because deep learning works, and the evolution of GPUs has made them a great fit for deep learning and inference. Neural nets, frameworks, and GPU architectures have changed significantly in the last year as well, allowing better solutions to be created more quickly and in more places, moving from niche applications to the mainstream. It also allows them to be used in real time for more industrial automation and human interaction roles. We talk about GPU architecture and framework evolution, scaling out and scaling up training and performance, real-time inference improvements, security plus VM isolation and management, and overall deep learning flow improvements to make development and deployment more devops-friendly.
2:10pm - 2:50pm

DataPerfEnforce: A dynamic scaling engine for analytics with performance guarantees

Magda Balazinska talks about PerfEnforce, a system that enables performance-oriented service-level agreements (SLAs) for data analytics. Using a set of tenants and query-level performance SLAs, she addresses how to dynamically assign compute resources to queries from each tenant (query scheduling). She also discusses how to dynamically resize the multi-tenant service to minimize costs due to compute resources and SLA violation penalties (resource provisioning).
2:10pm - 2:50pm

Dev Tools & OpsKeeping 2 billion lines of code moving forward

Google's codebase includes over 2 billion lines of code, spanning thousands of projects. This talk looks at how Google keeps such a large codebase nimble and evolving despite its size and scale.
2:50pm - 3:15pm

Office Hours

3:15pm - 3:55pm

Hot TopicsDemocratizing the real-time filming and editing of 3D content

Unity builds the tools that enable game developers, artists, designers, and videographers to tell better visual stories. Games made with Unity have reached over 3 billion devices and were installed over 16 billion times. Unity also has powerful tools for design, video animation, special effects, and video rendering. Adam Myhill, who heads cinematics at Unity, will discuss how Unity is democratizing and scaling tools for video that enable the real-time filming and editing of 3D content, watching VR without hardware, and more.
3:15pm - 3:55pm

Machine LearningAchieving AI at scale on mobile devices

Qualcomm is an at-scale company. It powered the smartphone revolution and connected billions of people. It pioneered 3G and 4G, and now it is leading the way to 5G and a new era of intelligent, connected devices. Mobile is going to be the largest machine learning platform on the planet. Come learn how Qualcomm is making efficient on-device machine learning possible, how Qualcomm and Facebook worked closely to support machine learning in Facebook applications, and what's next for Qualcomm and AI.
3:15pm - 3:55pm

DataServerless at scale with Amazon DynamoDB and Lambda

DynamoDB is a fully managed NoSQL database service that provides high throughput at low latency with seamless scalability. The service is the backbone for many Internet applications, handling trillions of requests daily. The scale of data that applications have to manage continues to grow rapidly, making it a challenge to manage systems and respond to events in real time. This talk will be a deep dive into the challenges of building the DynamoDB Streams feature, which provides a time-ordered sequence of item changes on DynamoDB tables, and leveraging it with AWS Lambda to reimagine large-scale applications for the cloud.
4:05pm - 4:45pm

Machine LearningDeep learning trends and developments

Deep learning is making a great impact across products at Google and in the world at large. As Google pushes the limits of AI and deep learning, research is underway in many areas. With integration into many Google products, this research is improving the lives of billions of people. Open source tools like TensorFlow and open publications put the latest deep learning research at the fingertips of engineers around the world. This talk begins by exploring what has enabled this field to evolve rapidly over the last few years. It also will cover some of the leading research advances and current trends that point to a promising future, and the algorithms that make it possible.
4:05pm - 4:45pm

Hot TopicsEfficient and healthy background data prefetching sessions

The ability to prefetch data while your app is on background can decouple the usability of your app from network availability. Moreover, it can minimize cellular data usage and significantly increase perceived speed. This talk walks through the main technical and performance challenges of the implementation. How do you schedule data prefetching on background? What framework is the most appropriate to execute this type of work? What should you be prefetching and when? Find out here.
4:05pm - 4:45pm

Dev Tools & OpsOkta's secret sauce for dealing with flaky tests at scale

Monolith, microservices, or both? Find out how Okta has developed the best of both worlds to solve for the challenge of scaling to handle dynamic traffic volumes. In this session, Kelvin Zhu will walk through how the Okta team manages a monolithic codebase by way of virtual splitting, allowing dialable CI loop speed. He'll also speak to how Okta has rethought testing at scale — ensuring that tests run at speed while also monitoring the quality of tests to help gain visibility into problem tests and get past them.
4:05pm - 4:45pm

DataMake data-driven decisions faster with real-time stream processing

Facebook can move fast and iterate because of its ability to make data-driven decisions. Data from its stream processing systems provides real-time analytics and insights; the system is also implemented into various Facebook products, which have to aggregate data from many sources. In this talk, Rajesh Nishtala covers the difficulties of stream processing at scale, the solutions Facebook has created to date, and three case studies on improving the time-to-deliver insights with data via stream processing. The case studies include examples from search product development, accelerating daily pipelines in the data warehouse, and seamless integration with machine learning platforms. Each case study shows how Facebook can deliver value to more teams while continuing to abstract the details of stream processing from various teams. Rajesh concludes by speaking to the future of stream processing.
4:55pm - 5:35pm

Dev Tools & OpsRapid release at massive scale

This updated talk will give the latest details on how Facebook's Release Engineering team ships multiple times per day.
4:55pm - 5:35pm

DataBigQuery: Managed storage for analytics

BigQuery is best known for being a large-scale query engine, but one of its most important components is a structured storage system. Over the last several years, Google has found that active data management is crucial for providing no-ops scalable storage. This talk goes into the details of how BigQuery managed storage works, why it's hard to get right, and how it helps ensure that queries are always fast.
4:55pm - 5:35pm

Machine LearningUnlocking meaning across languages at scale

At Facebook, the mission is to give people the power to build community and bring the world closer together. In this talk, Necip Fazil Ayan will present the most recent work on using deep learning for machine translation and language understanding to unlock meaning across languages to help that mission. He will talk about the challenges of doing machine translation and language understanding at large scale, and will discuss technologies and platforms that have been built to tackle these challenges.
4:55pm - 5:35pm

Hot TopicsScaling Android with ReDex

This talk will explore applications of ReDex to improve performance on emerging-market phones.
5:35pm - 6:30pm

Happy Hour & Office Hours

Join the @Scale Mailing List and Get the Latest News & Event Info

Code of Conduct