Starting with a basic setup for click-through rate (CTR) prediction, we will step by step improve on it by incorporating the lessons we've learned from operating and scaling such a mission-critical system. The presented lessons will be related to infrastructure, model comprehension, and specifics like how to deal with thresholds. They should be applicable to most ML models used in production.
After briefly introducing Yelp and more specifically click-through rate (CTR) prediction at Yelp, we will start out with a basic setup for model-based predictions in a production system. From there we will point out deficiencies of said setup in various areas, some of which arise especially in large scale environments or when predicting CTRs.
This will give us an opportunity to dive deeper into a selection of insightful practical lessons we've learned from operating and scaling the mission-critical CTR prediction system at Yelp. Those lessons can be categorized into:
Along the way, various Python tools which we're actively using will be highlighted.
By touching on a multitude of diverse challenges, this presentation strives to provide valuable insights for any engineer interested in or working with ML models in a production environment.