| Aditi Khullar |
Introduction to Language Modeling
|
| Aditya Lahiri |
Dealing With Imbalanced Classes in Machine Learning
|
| Agata Sumowska |
Role playing Annotation workshop
|
| Akos Furton |
Production Code in Data Science Consulting
|
| Alex Egg |
Discover your latent food graph with this 1 weird trick
|
| Allen Downey |
The Inspection Paradox is Everywhere,
Unconference - 10:55: Contributing to Open Source, 11:40: Data Science Education
|
| Amanda Moran |
Pandas vs Koalas: The Ultimate Showdown!
|
| Anne Bauer |
Data science at The New York Times: a mission-driven approach to personalizing the customer journey,
Fireside Chat
|
| Anthony Scopatz |
Conda-press, or Reinventing the Wheel,
What's now in NumFOCUS projects? (Part 2),
Stump the Chump
|
| Bhargav Srinivasa Desikan |
Role playing Annotation workshop
|
| Bhargav Srinivasa Desikan |
Role playing Annotation workshop
|
| Bill Lynch |
Bringing mental health data to doctors
|
| Breck Baldwin |
Introduction to Bayesian Modeling with Stan: No Statistics Background Required (Pt 1),
Introduction to Bayesian Modeling with Stan: No Statistics Background Required (Pt 2),
Introduction to Bayesian Modeling with Stan: No Statistics Background Required (Pt 3)
|
| Brian Austin |
How and why to put your Jupyter notebooks in Docker containers
|
| Bryan Cross |
Panel Discussion: My First Open Source Contribution
|
| Bryan Van de Ven |
What's now in NumFOCUS projects? (Part 2)
|
| Cameron Davidson-Pilon |
New Trends in Estimation and Inference
|
| Carlos Afonso |
Visualizing the 2019 Measles Outbreak in NYC (with Python)
|
| Chaya D Stern |
Using Graph Nets (GNs) to predict molecular properties
|
| Chris Fonnesbeck |
A Primer on Gaussian Processes for Regression Analysis,
Panel Discussion: My First Open Source Contribution
|
| Christian Steiner |
Improve the efficiency of your Big Data application
|
| Christopher J "CJ" Wright |
conda-forge sprint,
conda-forge sprint
|
| Chris Wiggins |
Data science at The New York Times: a mission-driven approach to personalizing the customer journey,
Fireside Chat
|
| Colin Carroll |
Building a maintainable plotting library,
What's now in NumFOCUS projects? (Part 1)
|
| Daniel Rodriguez |
Effective Python and R collaboration
|
| Dante Gama Dessavre |
High-Performance Data Science at Scale with RAPIDS, Dask, and GPUs
|
| David Palaitis |
Panel Discussion: Pitching Open Source Up Your Management Chain
|
| Debra Williams Cauley |
Discussion: What’s Missing in Python and Data Online Resources?
|
| Deepyaman Datta |
New and Upcoming
|
| Diego Torres Quintanilla |
Cleaning, optimizing and windowing pandas with numba
|
| Emily A Ray |
Discover your latent food graph with this 1 weird trick
|
| Eric Dill |
Is Spark still relevant? Multi-node CPU and single-node GPU workloads with Spark, Dask and RAPIDS.
|
| Ethan Rosenthal |
Time series for scikit-learn people
|
| Eugene Tang |
Introduction to Language Modeling
|
| Evan Patterson |
Semantic modeling of data science code
|
| Ferras Hamad |
Reproducibility in ML Systems: A Netflix Original
|
| Francesc Alted |
Improve the efficiency of your Big Data application,
What's now in NumFOCUS projects? (Part 1)
|
| Gil Forsyth |
Panel Discussion: Pitching Open Source Up Your Management Chain
|
| Hannah Aizenman |
What's now in NumFOCUS projects? (Part 2),
Matplotlib sprint,
Matplotlib sprint,
Panel Discussion: My First Open Source Contribution,
Building a maintainable plotting library,
HoloViz and Matplotlib sprint,
HoloViz and Matplotlib sprint,
Jupyter & matplotlib sprint,
Jupyter & matplotlib sprint
|
| Hillary Green-Lerman |
How to Prove You’re Right: A/B Testing with SciPy,
Hacking the Data Science Challenge
|
| Hyonjee Joo |
Spark Backend for Ibis: Seamless Transition Between Pandas and Spark
|
| Ian Whalen |
Implementing Lightweight Random Indexing for Polylingual Text Classification
|
| Itamar Turner-Trauring |
Small Big Data: using NumPy and Pandas when your data doesn't fit in memory
|
| Jacqueline Gutman |
Simplified Data Quality Monitoring of Dynamic Longitudinal Data: A Functional Programming Approach
|
| James Powell |
PyData Pop Quiz,
Sloth & ENVy
|
| Jason Grout |
Jupyter sprint,
Jupyter sprint,
Jupyter & matplotlib sprint,
Jupyter & matplotlib sprint
|
| Jeff Reback |
Pandas sprint,
Pandas sprint,
Introduction to pandas
|
| Jenny Turner-Trauring |
Working with Maps: Extracting Features for Traffic Crash Insights
|
| Jens Fredrik Skogstrom |
What we learned by running a large custom Bayesian forecasting model in production
|
| Jessica Tyler |
Geo Experiments and CausalImpact in Incrementality Testing
|
| Jim Weiss |
Example
|
| Joe Jevnik |
Zarr vs. HDF5
|
| Joseph Kearney |
New and Upcoming
|
| Joshua Falk |
Generating realistic, differentially private data sets using GANs
|
| Julia Signell |
Data-centric exploration using intake, dask, hvplot, datashader, panel, and binder,
HoloViz and Matplotlib sprint,
HoloViz and Matplotlib sprint,
Panel Discussion: My First Open Source Contribution,
HoloViz and Matplotlib sprint
|
| Kamal Abdelrahman |
Painting A Picture of Public Data
|
| Katherine Kampf |
Build an AI-powered Pet Detector in Visual Studio Code
|
| Keith Kraus |
High-Performance Data Science at Scale with RAPIDS, Dask, and GPUs
|
| Kelle Cruz |
AstroPy sprint,
AstroPy sprint
|
| Kelly Jin |
A Few Good Public Servants: How Great Analysis Inspires Action,
Fireside Chat
|
| Kelly Shen |
Julia for Pythonistas
|
| Kevin Fleming |
Panel Discussion: Pitching Open Source Up Your Management Chain
|
| Lara Kattan |
Machine learning from scratch using the scientific Python stack
|
| Laurence Warner |
Role playing Annotation workshop
|
| Lauren Oldja |
Managing Stakeholders: The Key to Successful Data Science for Business
|
| Lev Konstantinovskiy |
Role playing Annotation workshop
|
| Li Jin |
Spark Backend for Ibis: Seamless Transition Between Pandas and Spark
|
| Maciej Wojton |
Free Your Esoteric Data Using Apache Arrow and Python
|
| Malaika Handa |
Colorism in High Fashion (featuring: K-Means Clustering)
|
| Marc Garcia |
Introduction to pandas
|
| Marianne Hoogeveen |
The physics of deep learning using tensor networks
|
| Mariel Frank |
Introduction to NLP
|
| Marius van Niekerk |
conda-forge sprint,
conda-forge sprint,
A How-to guide for migrating legacy data applications
|
| Martin Hirzel |
Type-Driven Automated Learning with Lale
|
| Matti Lyra |
[SCHEDULE CHANGE 12:45PM - 2:15PM] Neural Networks for Natural Language Processing
|
| Meghan Heintz |
Launching a new warehouse with SimPy at Rent the Runway
|
| Michael Johns |
Propensity Score Matching: A Non-experimental Approach to Causal Inference
|
| Michael Skarlinski |
New and Upcoming
|
| Michal Mucha |
Swiftly turn Jupyter notebooks into pretty web apps
|
| Michoel Snow |
Hacking the Data Science Challenge,
How to Prove You’re Right: A/B Testing with SciPy
|
| Mike McCarty |
Panel Discussion: Pitching Open Source Up Your Management Chain
|
| Mitzi Morris |
Bayesian Inference for Fun and Profit
|
| Moussa Taifi Ph.D. |
Clean Machine Learning Code: Practical Software Engineering Principles for ML Craftsmanship
|
| Natu Lauchande |
Machine Learning Engineering principles with Python and MLFlow
|
| Nick Becker |
High-Performance Data Science at Scale with RAPIDS, Dask, and GPUs
|
| Noam Ross |
Building Software and Communities With Peer Review
|
| Parin Choganwala |
Discover your latent food graph with this 1 weird trick
|
| Patrick Landreman |
A Crash Course in Applied Linear Algebra
|
| Paul Ganssle |
Stump the Chump
|
| Petr Wolf |
From Raw Recruit Scripts to Perfect Python (in 90 minutes)
|
| Piero Ferrante |
Should I develop my own DS library? Maybe.
|
| Ralf Gommers |
What's now in NumFOCUS projects? (Part 1)
|
| Raman Tehlan |
Genetic algorithms: Making errors do all the work
|
| Raoul-Gabriel Urma |
Advanced Software Testing for Data Scientists
|
| Raphaël Meudec |
tf-explain: Interpretability for Tensorflow 2.0
|
| Rohit Kapur |
A How-to guide for migrating legacy data applications
|
| Romain Cledat |
Every ML Model Deserves To Be A Full Micro-service
|
| Ryan Abernathey |
What's now in NumFOCUS projects? (Part 1)
|
| Samuel Rochette |
Quantifying uncertainty in machine learning models
|
| Sara Seager |
Stars, Planets, and Python,
Fireside Chat
|
| Saul Shanabrook |
Same API, Different Execution,
Jupyter sprint,
Jupyter sprint,
Jupyter & matplotlib sprint,
Jupyter & matplotlib sprint
|
| Sean Law |
New and Upcoming,
New and Upcoming
|
| Stanley van der Merwe |
From Raw Recruit Scripts to Perfect Python (in 90 minutes)
|
| Steve Dower |
Unconference - 10:55: Contributing to Open Source, 11:40: Data Science Education,
The Secret Life of Python,
Panel Discussion: My First Open Source Contribution
|
| Tamar Yastrab |
The Echo-Chamber of Your Social Media Feed
|
| Thomas Caswell |
Stump the Chump,
Building a maintainable plotting library,
HoloViz and Matplotlib sprint,
HoloViz and Matplotlib sprint,
Jupyter & matplotlib sprint,
Jupyter & matplotlib sprint,
Matplotlib sprint,
Matplotlib sprint
|
| Thomas J Fan |
Deep Dive into scikit-learn's HistGradientBoosting Classifier and Regressor,
What's now in NumFOCUS projects? (Part 2)
|
| Tom Augspurger |
Scalable Machine Learning with Dask,
What's now in NumFOCUS projects? (Part 1),
Introduction to pandas
|
| Veronica Hanus |
To comment or not
|
| Will Kurt |
An Introduction to Probability and Statistics
|
| YOU! |
Unconference,
Unconference,
1:20pm-2pm: A Github for Data, 2pm-3:25pm: OPEN,
3:40pm-5pm: Role Play annotation facilitator training, 5:10pm-5:50pm: STUMPY & Time-series analysis,
Unconference - Using GPUs in Python/Julia/R Applications,
Unconference - 2:10: Software Engineering for Data Scientists, 2:55: Equity and Algorithmic Fairness,
Unconference - Native Extension Modules,
Unconference - 3:45: Design Thinking for Data Science, 4:30: Open
|
| Yuanqing Wang |
Using Graph Nets (GNs) to predict molecular properties
|