Monday 11:30 AM–12:10 PM in Music Box 5411/Winter Garden 5412 (5th fl)

sklearn-Compatible Model Stacking

Keith Myers-Crum

Audience level:
Intermediate

Description

At Civis Analytics we've written and open sourced meta-estimators compatible with scikit-learn that enable the creation of stacked models for regression and classification problems. This talk will describe how those meta-estimators work and show examples of their use.

Abstract

In addition to their success in many Kaggle competitions, at Civis Analytics we have found that stacked models produce reliable push-button estimators for many of the problems we encounter. To enable the easy creation of stacked classifiers and regressors in python, we've written a pair of scikit-learn-compatible meta-estimators that handle both the the I/O between the layers of models and the cross-validation needed for fitting the full model stack.

This talk will describe the methodology of model stacking, along with best practices. We'll show how our stacked model meta-estimators work under the hood. We will also demonstrate how to construct stacked models yourself using the open-source code described above. The source code for our model stacking meta-estimators can be found on github. This code can be installed with pip by calling pip install civisml-extensions.

Subscribe to Receive PyData Updates

Subscribe