Conference Schedule

Talks Tutorials Keynotes

Saturday March 12

	Room 1	Room 2
8:00	Registration and Breakfast
9:00	Opening Notes
9:10	Petascale genomics Sean Owen
9:45	Coffee
10:00	Understanding the tech community through notebooks Friso van Vollenhoven	Finding relations in documents with IEPY Daniel Moisset
10:45	How big are your banks? Because the ones in the U.S. are pretty big! David R. Pugh	Using random search for efficient hyper-parameters optimization with H2O Jo-fai Chow
11:30	Julia for Data Analysis: features, interfaces and future directions Simon Byrne	Do Angry People Have Poor Grammar? An Exploration of Language Processing and Statistics in Python Ben Fields
12:15	Lunch
13:00	Realtime Bayesian A-B testing with Spark Streaming Dennis Bohle, Ben Teeuwen	Machine Learning with Scikit-Learn Andreas Mueller
13:45	CART: Not only Classification and Regression Trees Marc Garcia	Machine Learning with Scikit-Learn Andreas Mueller
14:30	more psychologists need to learn python – programming as a perspective on the world Titus von Köller	Data driven literary analysis: an unsupervised approach to text analysis and classification Serena Peruzzo
15:15	Running (snippets of) Python in the browser Almar Klein	`from __past__ import print_statement`: a Dadaist Rejection of Python 2 vs 3 James Powell
15:45	Coffee
16:00	From Data Science to Production - deploy, scale, enjoy! Sergii Khomenko	Building a live face recognition system in the blink of a very slow eye Rodrigo Agundez
16:45	Winning Ways for Your Visualization Plays Mark Grundland
17:30	Eneco Drinks n’ Data Intro
17:45	Drinks

	Room 1	Room 2
8:30	Breakfast
9:00	The PyData stack state of the union Peadar Coyle
9:45	Contributing to open source Panel discussion	The Duct Tape of Heroes: Bayes Rule. Vincent Warmerdam
10:30	Coffee
10:45	Networks meet Finance in Python Miguel Vaz	Pandas: from bdate_range to wide_to_long Giovanni Lanzani
11:30	Tools and Tricks from a Pragmatic Data Scientist Lucas Bernardi	Pandas: from bdate_range to wide_to_long Giovanni Lanzani
12:15	Lunch
13:00	Jupyter: Notebooks in Multiple Languages for Data Science Thomas Kluyver, Min Ragan-Kelley	Hybrid Recommender Systems in Python Maciej Kula
13:45	Improving PySpark Performance: Spark performance beyond the JVM Holden Karau	Explaining the idea behind Automatic Relevance Determination and Bayesian Interpolation Florian Wilhelm
14:30	Measuring Search Engine Quality using Spark and Python Sujit Pal	NoSQL Python: making data frames work for you in a non-rectangular world Aileen Nielsen
15:15	The Role of Python in the Oil & Gas Industry Giuseppe Pagliuca	Store and manage data effortlessly with HDF5 Margaret Mahan
16:00	Coffee
16:15	Gotta catch'em all: recognizing sloppy work in crowdsourcing tasks Maciej Gryka	Python based predictive analytics with GraphLab Create Danny Bickson
17:00	Lightning Talks
17:50	Closing Notes