Journey to 1000 Models: Scaling Instagram's algorithm without the Reliability Nightmare

Journey to 1000 Models: Scaling Instagram's algorithm without the Reliability Nightmare

173 Lượt nghe
Journey to 1000 Models: Scaling Instagram's algorithm without the Reliability Nightmare
Speakers: Sing Sing Ma and Luke Levis from Meta At the beginning of 2023, Instagram had O(10) gpu models, a manual release process, and a manual monitoring setup. This talk will be centered around our journey to 1000 models: the bumps along the road and the foundational work built to make monitoring model health faster and more accurate. We’ll be going over model registry, the model launch process, and model stability. Upcoming 2025 Events: AI & Data - June 25, 2025 Networking - August 13, 2025 Product - October 22, 2025 Learn more about the @Scale conference here: https://atscaleconference.com/ @Scale is a technical conference series for engineers who build or maintain systems designed for scale. New for 2025, in person and virtual attendance options will be available at all four of our programs, which will bring together complementary themes to create event communities to spark cross-discipline collaboration.