Accurately predicting when clients churn is valuable to many businesses. This talk will cover validation techniques that can be used to assess the adequacy of a client churn model. After reviewing how to fit the Cox Proportional Hazard model with the lifelines library, we will focus on the model validation techniques, such as proportional assumptions, Schoenfeld, martingale and score residuals.
Survival analysis models have been successfully applied to predict the ‘time to churn’ in subscription-based businesses. Nowadays there are a number of Python libraries that allow data scientists to quickly build a ‘churn’ model. However, unlike in a typical linear regression, a proportional hazard model does not easily lend itself to the ‘observed minus predicted’ residuals metric. However, certain regression modelling assumptions like additivity, linearity and distributional assumptions extend to the proportional hazard model and must be examined.
The main objective of this talk is to provide and intuitive explanation of the metrics that can be used to assess the adequacy of a client churn model. The talk will cover mathematical definitions and Python implementation of such metrics. It will cover: