Your source data has multiple formats? You have multiple API’s to pull data from? This talk will go through some common problems with solutions that you will face when trying to combine multiple different research data sources in programmatic way. We go through a real world web project on that visualizes poverty data with JSON API's, Shapefiles and Excel spreadsheets as data sources.
Give the audience idea of real world project in preparation for the part 3
This is the meat of the talk, each point introduces problem and suggests at least one solution. Solutions are based on Python technologies - Handling different data formats - How to manage the data sources (validation, automation, etc) - Normalizing units - Mapping problems (different projects may follow different standards for the id’s) - Normalizing data and metadata