Follow @continuumio on Twitter
  • About
    • Mission
    • Code of Conduct
    • Press
    • NumFocus
    • Talk Videos
  • Venue
  • Schedule
  • Sponsor
    • Become a Sponsor
    • Sponsor List
  • Speakers
    • Keynote Bios
    • Speaker Bios
    • Abstracts
  • News
  • Community

Speaker Bios

(click on the speaker name or photo to view speaker details)


Alex Rubinsteyn

NYU
Python on the GPU with Parakeet
Alex Rubinsteyn is a Computer Science Ph.D. student at NYU. His interests are a high variance mixture distribution around programming language implementation and machine learning.

Allen Grimm

Churn Prediction With Graphical Models

Currently using python for modeling, clustering, and predicting user behavior in games on mobile devices. Experience with graphical models, k-means, neural networks, SVMs, and regression.

Professional Background:

  • B.S. from Gannon University in electrical engineering, focus on computer engineering with a minor in mathematics.
  • M.S. from Portland State University in electrical engineering, focus on computational intelligence.
  • Data mining at the Nike Sport Research Lab.
  • Currently a Data Scientist at PlayHaven.

Andrew Montalenti

CTO, Parse.ly
Rapid Data Visualization, from Python to Browser, Real-time streams and logs with Storm and Kafka
Andrew is the co-founder and CTO of Parse.ly, a Python-built tech startup that helps top online publishers understand what content their audience is interested in -- and why. Prior to starting Parse.ly, Andrew was a technologist with nearly a decade of experience in finance, high tech, and online media. He earned a degree in Computer Science from NYU. A dedicated Pythonista, JavaScript hacker, and open source advocate, Andrew is also a technical author and speaker. He has presented at PyData, PyCon, and several other technology conferences.

Avishek Panigrahi

Xilinx
Generators Will Free Your Mind

Avishek Panigrahi has an extensive background in chip design and an avid interest in using data effectively as a method to handle the increasing complexity of current generation chips. He has been working on chip designs for over 12 years - designing circuits, writing EDA/CAD tool flows and doing physical implementation of complex Systems-on-Chip.

He first used the early concepts of data analysis to organize, slice and dice data generated from CAD tools with the intent to find power bottlenecks and to reduce power on the (then) latest generation of MIPS microprocessors. The work turned into a paper - "Clock PowerReduction-Analysis Metrics and Power Reduction Techniques" and won the Technical Committee Award for best paper at SNUG, San Jose - a highly regarded chip design industry conference. Since then he has been interested in looking at analyses of chip design data - wirelengths, timing data, crosstalk, parasitics, log files etc, - generated in the process of designing chips. He is an avid user of the python data analysis stack to simplify and automate many reporting and debug tasks.

Avishek is a Senior Staff Engineer at Xilinx and has worked for Samsung and MIPS Technologies in the past. He has an MS in Electrical Engineering from the University of Virginia and a B.Tech in Electrical Engineering from Indian Institute of Technology, Bombay. He currently serves as a Member of the Technical Committee for SNUG Silicon Valley, Boston, Ottawa and Austin

Ben Lerner

DataNitro
Data Engineering 101: Building your First Data Product , Introduction to DataNitro: Python in Excel
Ben is the co-founder and CEO of DataNitro, a YCombinator backed startup which integrates Excel with Python. He has a degree in Mathematics from MIT.

Brian Granger

Cal Poly State University, IPython
Functional Performance with Core Data Structures, IPython Interactive Widgets
Brian Granger is an Assistant Professor of Physics at Cal Poly State University in San Luis Obispo, CA. He has a background in theoretical atomic, molecular and optical physics, with a Ph.D from the University of Colorado. His current research interests include quantum computing, parallel and distributed computing and interactive computing environments for scientific and technical computing. He is a core developer of the IPython project, the creator of PyZMQ and a contributor to SymPy. Contact him at ellisonbg@gmail.com or @ellisonbg (Twitter, GitHub).

Bryan Lewis

Paradigm4
Python as Part of a Production Machine Learning Stack, Pythran: Static Compiler for High Performance
Bryan Lewis is the chief data scientist at Paradigm4. Bryan is an applied mathematician and is the author of a number of R packages including irlba, rredis, doRedis, websockets, and bigalgebra.

Chris Colbert

JP Morgan
K-means Clustering with Scikit-Learn, Sentiment Classification Using scikit-learn
Chris is a professional software developer in New York City where he uses Python and C++ to develop the UI framework for the front office trading platform of a large investment bank. He is the creator of the Nucleic project which is a collection of software for developing enterprise quality Python applications. He received his MS in Mechanical Engineering from the University of South Florida.

Dan Blanchard

Educational Testing Service
DataViz Showdown: a comparison of different data visualisation libraries
Daniel Blanchard is a Research Engineer in the NLP & Speech group at Educational Testing Service (ETS). He has been coding for over 20 years and has been using Python exclusively for the past 4 years. He is the primary maintainer of the DRMAA Python, GridMap, SKLL libraries and is co-maintainer of chardet.

Daniel Krasner

Johnson Research Labs
House Prices and Rents: Evidence from a Matched Dataset in Central London, Rapid Development and High Performance Text Processing
Daniel Krasner is a senior researcher at John Research Labs where he focuses on high performance statistical solutions in text and natural language processing. Prior to entering the world of data science, Daniel Krasner was a researcher at the Mathematical Sciences Research Institute in Berkeley and an assistant professor of mathematics at UCLA. He holds a PhD in mathematics from Columbia University. Previously, Daniel was the chief data scientist at Sailthru, an email and analytics platform, where he ran Sailthru’s behavioral analytics division. Daniel is also the co-founder of KFit Solutions, a data science consulting firm, and has recently taught Applied Data Science as a Columbia University adjunct professor.

Emily Chen

Working with Hadoop from Python
Coming soon.

Erik Bernhardsson

Spotify
Building Data Pipelines with Python and Luigi
Erik is the Technical Lead of the Discovery team at Spotify. He focuses on music recommendations and machine learning, in particular large scale methods using Hadoop. Before that, Erik lead the Analytics team in the Stockholm office where he was responsible for collecting, aggregating and making sense out of all the data.

Ian Huston

Pivotal
Massively Parallel Processing with Procedural Python, Python Powered Data Science at Pivotal
Ian is a data scientist for Pivotal and uses the Python data stack on a wide range of customer projects from fraud detection to transport and logistics. Ian has a background in numerical analysis and simulation and his expertise includes high performance computing for scientific applications, perturbative analysis of large systems of differential equations and the differential geometry underlying relativistic physics. He completed a PhD in theoretical cosmology at Queen Mary, University of London and received a MSc from Imperial College London in theoretical physics. Ian’s work has been published in leading international physics journals and he released Pyflation, the Python numerical package used in his research to the community.

Imran Haque

Counsyl
Beyond the dict: Python Tools to Wrangle Data From CSV Up
Imran S. Haque is the Director of Research at Counsyl, a medical genomics startup, where he helped build the pipeline to perform thousands of genetic tests per week. He earned his PhD at Stanford University working on large-scale machine learning for drug design with millions of molecules and trillions of data points.

Jake Vanderplas

University of Washington
Creating Interactive Applications in Matplotlib, Machine Learning with scikit-learn , Python as Part of a Production Machine Learning Stack
Jake Vanderplas is an NSF Postdoctoral fellow working jointly in the Computer Science and Astronomy departments at the University of Washington. His research involves large-scale machine learning applications within astronomy and astrophysics. He is a maintainer of the Python packages Scikit-learn and Scipy, and regularly contributes to several of the other packages within the numpy/scipy ecosystem. He occasionally blogs about Python-related topics at Pythonic Perambulations - http://jakevdp.github.com.

Jake Vanderplas

Efficient Computing with NumPy

Jake Vanderplas is an NSF post-doctoral fellow at University of Washington, working jointly between the Computer Science and Astronomy departments. His research involves applying recent advances in machine learning to large astronomical datasets, in order to learn about the Universe at the largest scales. He is co-author of "Statistics, Data Mining, and Machine Learning in Astronomy", a Python-centric textbook to be published by Princeton Press in 2013, and has presented many technical talks and papers in this subject area.

In the Python world, Jake is active in maintaining and contributing to several core Python scientific computing packages, including Scikit-learn, Scipy, Matplotlib, and others. He occasionally blogs on python-related topics at http://jakevdp.github.com.

James Powell

NYC Python
Dataflow Programming Using Generators and Coroutines, Embeddings of Python, Embeddings of Python, Embeddings of Python , Generator Showcase Showdown, Generator Showcase Showdown, Generators the Third, My First Numba
James Powell is a professional Python programmer based in New York City. He is the chair of the NYC Python meetup nycpython.com and has spoken on Python/CPython topics at PyData SV, PyData NYC, PyTexas, PyArkansas, PyGotham, and at the NYC Python meetup. He also authors a blog on programming topics at seriously.dontusethiscode.com

Joshua Horowitz

TechLawNY
An Intro to FOSS Licenses and Copyrights in Data and Software
Josh has been an avid free and open source software (FOSS) enthusiast since his early college days and a lifelong computer hobbyist. Last summer he worked as a legal intern at the Software Freedom Law Center, a non-profit organization that provides legal counsel to free and open source software developers on a broad range of intellectual property and corporate issues. Most notably he worked on defending the right to run alternative operating systems and software in the new Windows 8 world, assisting the X.Org project in updating their licenses, and ensuring license compatibility between GPLv3 and the new Creative Commons license. In his spare time, he works on furthering his python skills and spreading the use of Linux to willing and able users. Josh recently started TechLawNY, where he specializes in assisting attorneys who are handling technically complex legal matters. Josh plans on focusing his legal practice on advising new technology companies and software developers.

Julia Evans

A practical introduction to IPython Notebook & pandas
Julia Evans is a programmer & data scientist based in Montréal, Quebec. She loves coding, math, playing with datasets, teaching programming, open source communities, and late night discussions on how to dismantle oppression. She co-organizes PyLadies Montréal and Montréal All-Girl Hack Night.

Karissa McKelvey

Continuum Analytics
Intro to Python Data Analysis in Wakari
Karissa McKelvey has a BA in Computer Science and Political Science from Indiana University. After graduating, she worked as a research assistant at the Center for Complex Networks and Systems Research at Indiana University on an NSF grant to study information diffusion to analyze and visualize the relationship between social media expressions and political events. She is an active contributor to open source projects and continues to publish in computer supported cooperative work and computational social science venues.

Kelsey Jordahl

Enthought, Inc.
GeoPandas: Geospatial Data in Python Made Easy
Kelsey is a Scientific Software Developer at Enthought. He works on applications for clients in science, engineering and finance, and teaches quantitative python training courses. He holds a Ph.D. in marine geophysics from the MIT/Woods Hole Oceanographic Institution

Martin Laprise

Parse.ly
Probabilistic Data Structures for Realtime Analytics
Until 2010, Martin was doing research in nonlinear optics as a PhD Candidate in Physics at Université Laval. After his graduate study, he joined BrightScope where he was doing GPU Computing and Data Analysis. Prior to joining Parse.ly, he was CTO at Crowdbase where he led the Research and Development of the natural language processing and recommendation components of the platform. Martin enjoys crunching data, Python and running.

Matthew Rocklin

Continuum Analytics
Functional Performance with Core Data Structures, Old School - Functional Data Analysis , Python in Business Intelligence: What's Missing?
Matthew Rocklin likes numerics, mathematics, and programming paradigms. He contributes to a variety of open source projects and endeavors to demonstrate the value of abstract solutions to concrete problems. A graduate of UC Berkeley (Physics, Math) and of the University of Chicago (PhD in CS). http://matthewrocklin.com/

Matthew Trentacoste

Adobe
Image Features in Python

Matthew Trentacoste is a Computer Scientist at Adobe Systems, where he develops mobile photography platforms. He's interested in computational photography, networked cameras and how large collections of images can enable new forms of creative imaging.

His background is a mix of image processing, human visual perception, data analysis and mobile platforms. Previously, he's worked or consulted at companies large and small, including Dolby, Pocket Pixels, BrightSide Technologies and Mobify. He holds a degree in Computer Science from Carnegie Mellon and with a Ph.D. and M.Sc. from the University of British Columbia, both in Computer Science.

Olivier Pomel

Datadog
Python @ Datadog: Building High-Volume Data Systems in the Python Ecosystem
Olivier is currently CEO and Co-Fouder at Datadog, a monitoring-as-a-service platform built to delight dev and ops teams alike. Prior to founding Datadog, Olivier built data systems for K-12 teachers as a VP, Technology for Wireless Generation, growing the development team from a handful to close to 100 of the best engineers in NYC. Before Wireless Generation, Olivier held software engineering positions at IBM Research and several internet startups. Olivier is an original author of the VLC media player and holds a MS, CS from the Ecole Centrale Paris.

Paddy Mullen

Continuum Analytics
A practical introduction to Pandas with Citibike Data, Bokeh Workshop
Paddy earned a B.S. in Economics from George Mason University. After graduating he started programming, working at a variety of startups. Paddy has also worked on side projects such as terminal cast and chart widget, two advanced javascript programs. At perpetually.com he wrote a flash decompiler and javascript rewriting system.

Panel Discussion

Scaling Python Across Hardware & Organizations
-------

Peter Wang

Continuum Analytics
, Python's Role in the Future of Data Analysis, State of the Py, 2015
Peter holds a B.A. in Physics from Cornell University and has been developing applications professionally using Python since 2001. Before co-founding Continuum Analytics in 2011, Peter spent seven years at Enthought designing and developing applications for a variety of companies, including investment bankers, high-frequency trading firms, oil companies, and others. In 2007, Peter was named Director of Technical Architecture and served as client liaison on high-profile projects. Peter also developed Chaco, an open-source, Python-based toolkit for interactive data visualization.

Sarah Guido

University of Michigan School of Information
A Beginner’s Guide to Machine Learning with Scikit-Learn
Sarah is a second-year master's student at the University of Michigan's School of Information. She intends to pursue a career in data science after graduation in April. She currently works as a data analyst for the HathiTrust Digital Library. Three of her favorite things are Python, data, and machine learning.

Saul Diez-Guerra

Ampush
My First Numba, My First Numba , Speed Without Drag, Speed without drag
Saul Diez-Guerra is a senior software engineer at Ampush in New York City, where he uses Python to build ad management and bidding systems, after a stint in social network R&D at Telefónica. He hails from Spain, where he received both a Bachelors in Computer Science as well as one in Telecommunications.

Srivatsan Ramanujam

Pivotal
Python Powered Data Science at Pivotal
Srivatsan Ramanujam is a Senior Data Scientist at Pivotal where he executes Data Sciences labs for their customers, with a special focus on Text Analytics. Previously, as a Data Scientist at Sony Mobile Communications in Redwood City, he lead Sony Mobile's Data Science initiatives that spanned across Statistical Machine Learning and Natural Language Processing. Before joining Sony, he was an engineer in the Analytics team at Salesforce.com. He received a Masters in Computer Sciences from UT Austin, completing his thesis and research in NLP where he focused on graphical models for weakly supervised sequence prediction problems. He loves mountaineering and is a native speaker of Python.

Thomas Levine

CSV Soundsystem
ddpy: Data-Driven Music for Big Data Analysis
Playing with computers since he was young, Tom eventually developed back and wrist pain, so he started studying ergonomics and conducting quantitative ergonomics research. Then he realized that he’d accidentally become a data scientist. And his back and wrists now hurt less. Tom likes using data to help people work less and think more.

Thomas Wiecki

Quantopian & Brown University
Algorithmic Trading with Zipline, Bayesian Data Analysis with PyMC3
I am currently enrolled in the Ph.D. program at Brown University where I investigate the neuronal underpinnings of mental illness using quantitative methods like Bayesian Modeling. I also work as a quantitative researcher for Quantopian Inc where we are building the worlds' first browser based financial backtesting platform.

Travis Oliphant

Co-Founder & CEO, Continuum Analytics
Blaze, Building the PyData Community, Conda, Packaging and Deployment, Packaging and Deployment, Pythran: Static Compiler for High Performance, Scalable Analytics and Visualization: Connecting Expertise to Data With Python, Welcome

CEO and Co-Founder, Continuum Analytics Introduction to NumPy; Introduction to SciPy

Dr. Oliphant has a Ph.D. in Biomedical Engineering from the Mayo Clinic, and M.S. and B.S. degrees in Electrical Engineering (and Math) from Brigham Young University. Travis has worked extensively with Python for numerical and scientific programming since 1997, and was the primary developer of the NumPy package and the author of the definitive Guide to NumPy. He is also the primary founding author of the SciPy package. During his academic career, he has worked in the fields of satellite remote sensing, Magnetic Resonance Imaging (MRI), Ultrasound, elastography, and general inverse problems. He was an Assistant Professor of Electrical and Computer Engineering at Brigham Young University from 2001 to 2007 where he taught courses in probability theory, electromagnetics, inverse problems, and signal processing. In addition, he directed the BYU Biomedical Imaging Lab, and performed research on scanning impedance imaging. He has done consulting work since 1997 in laser scattering off of semiconductors, sparse matrix calculations for search engines, and mesh transformations for fluid dynamics. Dr. Oliphant co-founded Continuum Analytics, Inc. in 2012 and currently serves as its CEO.

Trent Nelson

Continuum Analytics
PyParallel: How we Removed the GIL and Exploited all Cores (Without Needing to Remove the GIL at all)
Coming soon.

Victor Jakubiuk

DataNitro
Excel and IPython
Victor is the founder and CTO of DataNitro (a Python plugin to Microsoft Excel). His daily work involves integration of Python with the “outside world” and Python peculiars in the Windows environment. Victor is a recent Computer Science graduate from MIT, where he did research in concurrency, high-performance computing and distributed systems. You can read more about his work at www.jakubiuk.net

Wes McKinney

DataPad
DataPad: Python-powered Business Intelligence, Practical Medium Data Analytics with Python
Coming Soon

Yves Hilpisch

Performance Python
Coming soon.

Sponsors

DIAMOND

PLATINUM

GOLD

SPONSOR WORKSHOP

LANYARD

SILVER

SUPPORTING

DIVERSITY SCHOLARSHIP

MEDIA

COMMUNITY

Questions? Comments? Contact admin@pydata.org