Follow @continuumio on Twitter
  • About
    • Mission
    • Code of Conduct
    • Press
    • NumFocus
    • Talk Videos
  • Venue
  • Schedule
  • Sponsor
    • Become a Sponsor
    • Sponsor List
  • Speakers
    • Keynote Bios
    • Speaker Bios
    • Abstracts
  • News
  • Community
  • CFP

Speaker Bios

(click on the speaker name or photo to view speaker details)


Bart Baddeley

A Full Stack Approach to Data Visualization: Terabytes (and Beyond) at Facebook
Bart Baddeley is a freelance data consultant with a background in AI and Machine Learning. In a previous incarnation as an academic he studied Artificial Intelligence with Neuroscience as an undergraduate and went onto gain a PhD in neural network modelling. A recent convert to the world of Python, he has worked on a range of disparate problems from an early version of high frequency trading, through to modelling ant navigation using ideas from computer vision and machine learning.

Ben Moran

Data Science at Berkeley
Quant developer with interests in machine learning, optimization and theoretical neuroscience.

Bertrand NOUVEL

WIDE IO
An introduction to video action recognition
PhD in Theoretical Computer Sciences from Ecole Normale Superieure de Lyon. 4 years of postdoctoral research related to unsupervised learning related to image analysis and video mining. Author of the no Python Computer Vision Framework in 2009. Bertrand NOUVEL is now leading on a project related to cloud-computing and democratisation of the best algorithms.

Bryan Van De Ven

Continuum Analytics
Beautiful Interactive Visualizations in the Browser with Bokeh, Bokeh, Bokeh, Bokeh - Interactive Visualization for Large Datasets, Bokeh Tutorial, Interactive Plots Using Bokeh, The IPython protocol, frontends and kernels
Mr. Van de Ven received undergraduate degrees in Computer Science and Mathematics from UT Austin, and a Master's degree in physics from UCLA. He has worked at the Applied Research Labs, developing software for sonar feature detection and classification systems on US Naval submarine platforms. He also spent time at Enthought, where he worked on problems in financial risk modeling and fluid mixing simulation, and also contributed to the Chaco visualization library. He has also worked on an assortment of iOS projects as an independent consultant.

Christian Prokopp

Rangespan
The IPython protocol, frontends and kernels
Christian Prokopp is a Data Scientist at Rangespan and responsible for its cloud computing architecture and data mining operations. In his spare time, Christian enjoys panel discussion, giving talks, as well as writing as a data journalist and blogger focusing on big data, cloud computing, and data science. You can follow him on @prokopp and semantikoz.com/blog.

Clyde Fare

Imperial College
How to build a SQL-based data warehouse for 100+ billion rows in Python
Clyde is a second year PHD student at Imperial College working in theoretical photochemistry.

David Cournapeau

Databases for Scientists: Narrowing the Gap Between Array Computing and Databases

David Cournapeau has been using python since 2005, first for automating builds and then data analysis. He has been a regular numpy contributor since 2006, and was the release manager for both numpy and scipy for a few years. He also started what would become the scikits learn project.

He holds a MsC from Telecom Paristech in EE, a MsC from Paris VI in acoustics/signal engineering applied to Music, and a PhD in the domain of speech recognition from Kyoto University.

Dirk Gorissen

BAE Systems Research
Measuring and Predicting Departures from Routine in Human Mobility
Dirk Gorissen holds Masters degrees in Computer Science and Artificial Intelligence and a PhD in Computational Engineering. He has worked in research labs in the US, Canada, Europe & Africa and with a large range of industrial partners, including: Rolls-Royce, BMW, Arcelor Mittal, NXP, and Airbus. His interests span machine learning, data analysis, surrogate modeling and computational engineering. Particularly in the aerospace, automotive, and marine domains. After eight years he left academia to join BAE Systems Research where he works on big data analysis, deep learning, Integrated Vehicle Health Management and autonomous systems related topics. He has a strong interest in UAVs, is an active STEM Ambassador, organizer of the London Big-O Algorithms meetup, and is active in the Tech4Good / ICT4D space.

Emlyn Clay

OpenVivo Ltd.
Python, Pharmaceuticals and Drug Discovery
Emlyn Clay is the director of OpenVivo ltd a distributor for biomedical engineering hardware, software and technical consulting that predominately targets the MATLAB and SIMULINK environment. He is also a final year PhD student at King’s College London reading pharmacology and previously held the position of Data Systems Architect at Verona Pharma plc. He has previously spoken on a number of occasions at the Clinical Data Management and Drug Discovery and Development conferences and is a proponent for Python and it’s scientific stack to be used in the pharmaceutical industry.

Eric Drass

You give me data, I give you art.
An artist operating at the interface between art and science, between the proscribed and the ineffable. My primary media are painting, installation art, and digital alchemy, but sometimes it’s video, sometimes it’s music. shardcore is an ongoing process of aesthetic exploration, you never know what’s around the next bend, and frequently neither do I.

Felix Fernandez

Deutsche Börse
Python in the Financial Industry: The Universal Tool for End-to-End Development
Felix Fernandez works since 01.02.2013 as Business CIO for the Cash & Derivatives IT department within Deutsche Börse IT. His team of approx. 500 people is responsible for the development and maintenance of all major systems for trading and clearing of cash and derivatives instruments of Deutsche Börse Group. Felix has a background in information technology with a diploma from the University of Applied Sciences in Frankfurt, Germany. He works for more than 20 years in the financial industry and has an extensive experience in the exchange business. His activities in the last years have been focused around quantitative modeling and simulation of exchange pricing and market analysis, leading a team of mathematicians, software engineers and financial engineers before he moved back to the IT.

Francesc Alted

Blosc
Blosc: Sending data from memory to CPU (and back) faster than memcpy(), Data Oriented Programming
Francesc Alted is a freelance teacher, developer and consultant. Physicist by formation, he spent most of his life designing hardware and software systems while trying to squeeze the last drop of performance out of them (see a nice article he wrote here: http://www.pytables.org/docs/CISE-12-2-ScientificPro.pdf). Creator of PyTables, BLZ and Blosc and developer of Blaze and Numexpr. He likes good movies too. Email: faltet@gmail.com Twitter: @FrancescAlted GitHub: https://github.com/FrancescAlted

Gael Varoquaux

INRIA
Building a Cutting-Edge Data Processing Environment on a Budget
Gaël Varoquaux is an INRIA faculty researcher working on computational science for brain imaging in the Neurospin brain research institute (Paris, France). His research focuses on modeling and mining brain activity in relation to cognition. Years before the NSA, he was hoping to make bleeding-edge data processing available across new fields, and he has been working on a mastermind plan building easy-to-use open-source software in Python. He is a core developer of scikit-learn, joblib, and Mayavi, a nominated member of the PSF, and often teaches scientific computing with Python using http://scipy-lectures.github.com. His infrequent thoughts can be found at http://gael-varoquaux.info

Gilles Louppe

University of Liege, Belgium; Scikit-Learn
Gradient Boosted Regression Trees in scikit-learn
PhD student from the University of Liege (Belgium) where I do research on machine learning, with an expertise on tree-based methods. I am also core developer on the Scikit-Learn machine learning library, in which I co-authored the ensemble and decision tree modules.

Greg Detre

Writing a simple backend framework for 1-line AB tests in Django

Greg trained as a computational neuroscientist, putting people in brain scanners to try and understand why they forget things.

He co-founded Memrise.com in 2008, which has helped millions of people learn 100s of languages 10 times faster than staring at a textbook.

In 2013, he rebooted as a data scientist at The Guardian, and is now working on a deep learning toolbox.

Ian Huston

Pivotal
Massively Parallel Processing with Procedural Python, Python Powered Data Science at Pivotal
Ian is a data scientist for Pivotal and uses the Python data stack on a wide range of customer projects from fraud detection to transport and logistics. Ian has a background in numerical analysis and simulation and his expertise includes high performance computing for scientific applications, perturbative analysis of large systems of differential equations and the differential geometry underlying relativistic physics. He completed a PhD in theoretical cosmology at Queen Mary, University of London and received a MSc from Imperial College London in theoretical physics. Ian’s work has been published in leading international physics journals and he released Pyflation, the Python numerical package used in his research to the community.

Ian Ozsvald

Mor Consulting
The High Performance Python Landscape
Author of O’Reilly’s forthcoming High Performance Python, teacher on the same subject at PyCon 2013 and 2012, EuroSciPy 2012, EuroPython 2011, speaker at PyConUK. Ian is the owner of Mor Consulting, a specialist agency for the application of AI using Python to data science since 2005.

Ian Ozsvald

Mor Consulting (AI/Data Science in London)
Panel Discussion: "Shouldn't more companies be using data science?"
Data Scientist running Mor Consulting in London, user of NLP/ML and teacher of High Performance Computing, author for O'Reilly on High Performance Computing, 12 yr Pythonista, PyData London co-organiser, co-founder of ShowMeDo.com (Python educational videos)

James Blackburn

Man AHL
Python and MongoDB as a platform for financial market data
Trading Systems Developer at Man AHL

James Powell

NYC Python
Dataflow Programming Using Generators and Coroutines, Embeddings of Python, Embeddings of Python, Embeddings of Python , Generator Showcase Showdown, Generator Showcase Showdown, Generators the Third, My First Numba
James Powell is a professional Python programmer based in New York City. He is the chair of the NYC Python meetup nycpython.com and has spoken on Python/CPython topics at PyData SV, PyData NYC, PyTexas, PyArkansas, PyGotham, and at the NYC Python meetup. He also authors a blog on programming topics at seriously.dontusethiscode.com

James Powell

NYC Python
Generators Will Free Your Mind, Generators Will Free Your Mind, Integration With the Vernacular (the NumPy Approach), Panel Discussion: "Shouldn't more companies be using data science?", Title Coming Soon
James Powell is a NYC-based Python programmer with experience in quantitative finance and data science. He's also very active in the Python community, where he organizes NYC Python which is the world's largest and most active Python meetup group. He also works with the numeric & scientific computing non-profit NumFOCUS to help organize the PyData conference series. In addition, he's a frequent speaker at Python conferences, and has been invited to speak at events such as PyData New York, PyData London, PyGotham, the conference ‘For Python Quants,’ and PyCon Spain.

Jonathan Sedar

Applied AI
Correcting 10 years of messy CRM data: A practical data science project & Introducing Cartopy
None

Jonny Edwards

Thoughtful Technology
Recommenders in Python

I'm a former academic, who set up a small consultancy to do all things data ... in Python!. The last year has been a very active with a visualisation project in Kivy for Verdande Technology, a bottle NLTK and scikits-learn "Rapid Review" sentiment analysis system for www.tripdatabase.com, a recommender system for a biotech startup and a bunch of visualisations for the mastodonc big data consultancy. My job takes me to all areas of Data Science, but the one constant that underpins it all is Python.

All the necessary notebooks/data/html files are available from http://www.thoughtfultech.co.uk/blog/matrix-completion-for-users-and-items.html

Jurgen Van Gael

Rangespan, Ltd.
Hierarchical Text Classification using Python (and friends)
I am the Data Science Director at a London based eCommerce startup called Rangespan. Before joining Rangespan, I was an applied researcher at Microsoft Research. My background is in computer science and more specifically in machine learning. I completed my Ph.D. at the University of Cambridge where my advisor was Zoubin Ghahramani. Before joining the department in Cambridge I spent some time at the University of Wisconsin in Madison for my M.Sc. In a time long gone, I did my undergraduate studies at the University of Leuven.

Karim Chine

CLOUD ERA LTD
Python, R and Cloud Computing for Higher Education and Research
Karim Chine is a Cloud Computing expert, a software architect and a social entrepreneur. After graduating from the French Ecole Polytechnique and Telecom ParisTech, he occupied various positions within Industry and Academia. Karim was a stuff member respectively at Schlumberger, IBM, EBI and Imperial College London. Karim's interests include large scale distributed software design, HPC, pervasive computing and cloud computing applications in research and education. Since 2009, he has been a member of the European Commission panel of experts for the EU Research Infrastructure programme. Karim is the author and designer of Elastic-R, a pioneering Virtual Research Environment for scientific computing, reproducible research and collaboration in the cloud.

Kayla Iacovino

University of Cambridge
Life after matplotlib: Harder, better, faster, stronger
Kayla is a volcanologist at the University of Cambridge who has learned the power of Python in solving scientific analysis problems. Being able to easily collate and visualize large sets of data is crucial to getting the most out of your science. Kayla has worked along side Nial Peters, who developed the AvoPlot suite, to bring Python-powered data visualization onto the tops of active volcanoes around the world.

Kostas

University of Athens, Greece
Authorship Attribution using Python
Search, Analytics, Data Science.

Kyran

Showmedo Ltd.
Getting it out there: Python, Javascript and Web-visualizations
A jobbing programmer, ex research-scientist, recreational hacker, independent researcher, occasional entrepreneur, cross-country runner and improving jazz pianist. I live most of the year in sunny (as Paris) Brighton, UK. During 15 odd years as a research scientist I hacked a lot of code, learned a lot of libraries and settled on some favourite tools. These days I find Python, Javascript and a little C++ goes a long way to solving most problems out there. I specialize in fast-prototyping and feasibility studies, with an algorithmic bent but am happy to just build cool things.

Linda Uruchurtu

DBi
A Beginner's guide to Random Forests - R vs Python
Linda works as a consultant at DBi, helping companies and brands with their data strategy and online analytics.

Mark Basham

Diamond Light Source
Python for High Throughput Science
Mark Basham is a Senior Software Scientist at Diamond Light Source, the UK national Synchrotron Facility and largest scientific investment in the UK for over 40 years. His work revolves around supporting the facilities users with processing the Terabytes of data which is collected every day from the equipment at Diamond. This data is often complex in nature and requires specialist processing on clusters using custom or third party software, the management of which, and new developments are mainly done using Python. Part of Mark’s job also involves making data visualisation and processing available to facility staff and users, and is hence a founding developer of the open source Data Analysis Workbench (dawnsci.org). This workbench includes an extended PyDev (pydev.org) to allow users to easily process their data using the excellent tools available in Python. Mark has also recently become a Fellow of the Software Sustainability Institute (software.ac.uk), and is using this opportunity to further the use of Python within the scientific community which use Diamond Light Source.

Mark Grundland

Functional Elegance
Winning Ways for Your Visualization Plays
Mark Grundland loves turning good ideas into great businesses. As a matchmaker between technology and opportunity, he has 20 years experience in research and development. As a data science and product management consultant, he helps his clients make sense out of the data that drives their business. With a passion for designing new products and services, he regularly advises tech startups involved in software, data, internet, and mobile applications, ranging from online marketing and advertising to digital photography and 3D printing. His professional expertise combines statistical data analysis and user experience design with entrepreneurial experience of marketing strategy and business development. Currently, he has been working as a data scientist for Skimlinks, investigating contextual and behavioral factors that signal purchase intent in order to distinguish people who browse online from those who buy online. His recent projects include developing optimal data driven strategies for generating the right affiliate campaign link in the right context to maximize commission revenue for Skimlinks, visualizing the shape of an online news story as it evolves over time for Grapeshot in partnership with IBM, and designing multiplatform television formats that use social media to empower online democracy to have a real world impact for QuickVox. As the managing director of Roleplay Technologies, he was responsible for the design of realistic computer simulations of how people talk with each other, for use in online training and interactive entertainment. As a software design engineer, he previously worked at Microsoft on the user interface development team for Microsoft Office. Graduating with a PhD in image processing from the University of Cambridge, he investigated novel techniques for transforming the contrast, color, style, and composition of images. His research interests encompass computer graphics, computer vision, pattern recognition, information retrieval, information visualization, user experience design, and statistical modeling. His research results have been published in the book "Style and Content in Digital Imaging" and commercialized as far afield as South Africa. Currently, he serves as an associate editor for the Visual Computer journal.

Martin Goodson

Skimlinks
Most Winning A/B Test Results are Illusory
Martin is currently VP of Data Science at Skimlinks, working on large scale machine learning for Advertising. He previously spent several years at Oxford University, focusing on the genetic determinants of personality and the application of machine learning to large genomics data sets. He went on to head up the Research team at the startup, Qubit, working on the statistical modelling of consumer behaviour, Natural Language Processing and online personalisation.

Mike Mueller

Python Academy GmbH & Co. KG
Cython, Shared Memory Parallelism with Python

Mike Müller has been using Python as his primary programming language since 1999. He is a Python trainer and the CEO at Python Academy (www.python-academy.com).

He teaches a wide variety of Python topics including "Introduction to Python", "Python for Scientists and Engineers", "Advanced Python" as well as "Optimization and Extensions of Python Programs".

He is the chairman of the Python Software Verband e.V., a PSF member, a PSF community service award holder, User Group co-founder. He chaired EuroSciPy 2008 and 2009 as well as PyCon DE 2011 and 2012, and is chair of EuroPython 2014 in Berlin, Germany.

Mike Müller

Python Academy GmbH & Co. KG
Faster Python Programs through Optimization
Mike Müller has been using Python as his primary programming language since 1999. He is a Python trainer and the CEO at Python Academy (www.python-academy.com). He teaches a wide variety of Python topics including "Introduction to Python", "Python for Scientists and Engineers", "Advanced Python" as well as "Optimization and Extensions of Python Programs". He is the chairman of the Python Software Verband e.V., a PSF member, a PSF community service award holder, User Group co-founder. He chaired EuroSciPy 2008 and 2009 as well as PyCon DE 2011 and 2012, and is chair of EuroPython 2014 in Berlin, Germany.

Neri Van Otten

Conversocial
Adaptive Filtering of Tweets with Machine Learning
At Conversocial I research, experiment, design and implement custom machine learning solutions.

Oleksandr Pryymak

skimlinks.com
Probabilistic Data Structures and Approximate Solutions
Oleksandr is a data scientist currently focused on user behaviour analysis. He gained his Python skills from running own software development team, ventured into semantic technologies and end up in academia. Oleksandr completed a Ph.D. in computer science at University of Southampton, after which he sees how everything around is neatly self-organised and joined by a complex network of relations betweens independent agents.

Peter Passaro

NousPlan
DataViz Showdown: a comparison of different data visualisation libraries
Peter Passaro has an academic background in computational neuroscience and artificial intelligence. He has applied these skill sets to both scientific (neuroscience and molecular biology) data, as well as commercially to social media and search data. He was an artificial intelligence and rapid prototyping specialist at social media monitor Brandwatch (brandwatch.com) prior to starting his own data science consultancy last year (NousPlan). As a consultant, Dr. Passaro is focusing on building models and visualisations of how brands and other topics of interest are discussed, shared, and felt about across social media and the wider web. He is also a visiting research fellow at the University of Sussex Department of Informatics working on applying AI tools to large volumes of brain recordings, and organises the Big Data Brighton and BrightonHack digital community events.

Peter Prettenhofer

DataRobot, scikit-learn
Gradient Boosted Regression Trees in scikit-learn, Gradient Boosted Regression Trees in scikit-learn
Peter is a data scientist / software engineer at DataRobot. He studied computer science at Graz University of Technology, Austria and Bauhaus University Weimar, Germany focusing on machine learning and natural language processing. He is a contributor to scikit-learn where he co-authored a number of modules such as Gradient Boosted Regression Trees, Stochastic Gradient Descent, and Decision Trees.

Philip Elson

Met Office
Correcting 10 years of messy CRM data: A practical data science project & Introducing Cartopy
None

Philippe Bracke

London School of Economics
House Prices and Rents: Evidence from a Matched Dataset in Central London
Philippe teaches Real Estate Finance at the London School of Economics. He has worked for the European Central Bank and the International Monetary Fund in the past. His current research focuses on housing markets with special attention to big datasets.

Prash Majmudar

Growth Intelligence
Measuring the digital economy using big data
CTO at Growth Intelligence - leading the technical team and product development. GI works with Python, web data and machine learning techniques to improve private company data for business development / marketing purposes. Previously worked in Defence (embedded software, C++, Radar systems, signal processing)

Simon Jagoe

Enthought
Databases for Scientists: Narrowing the Gap Between Array Computing and Databases

Simon received his B.S. in Electrical and Information Engineering from the University of the Witwatersrand, South Africa. Simon is an experienced scientific software developer in a range of fields, from clinical research to mining.

Skimlinks

PYDATA AFTER PARTY

Skimlinks is sponsoring a PyData after party!

Join us for a glass of vino, amazing tapas, and expertly selected craft beers at Levante Bar to discuss all things PyData! Invitation only so please RSVP via the link here.

Sponsors

DIAMOND

PLATINUM

GOLD

SILVER

LANYARD

SPONSOR WORKSHOP

SUPPORTING

DIVERSITY SCHOLARSHIP

COMMUNITY

MEDIA

Questions? Comments? Contact admin@pydata.org