In this talk I will share some of my favourite tools and tricks I use every day as Data Scientist. They help me to solve all kind of problems, from statistical modeling all the way to scalability issues. Expect machine learning, math, algortithms and of course python. All of them are necessary to be a Pragmatic Data Scientist.
The Pragmatic Data Scientist Catalog: 1 kNN: from slow to fast using math. 2 Clustering: From k-Means to Spherical Clustering in a breeze. 3 Missing values in classification: from bad to good using old truth. 4 Power law: Bucketing the beast. 5 The King of proportions Ranking. 6 Hyperparameter search: one tool to rule them all.
Python code for all tricks and tools will be available in github for everyone to use change and challenge.