Friday 1:15 PM–2:45 PM in Speakeasy

Python Visualization for Exploration of Data

Stephen F. Elston, Ronald Lopez

Audience level:
Novice

Description

Visualization is an essential method in any data scientist’s toolbox. Visualization is a key first step in the exploration of most data sets. Visualization is also a powerful tool for presentation of results and for determining sources of problems with analytics. This tutorial introduces attendees to the most commonly used Python visualization packages, matplotlib, pandas plotting and seaborn.

Abstract

Visualization of complex real-world datasets presents a number of challenges to data scientists. By developing skills in data visualization, data scientist can confidently explore and understand the relationships in complex data sets while undertaking analyses. Using the Python matplotlib, pandas plotting and seaborn packages attendees will learn how to do the following.

  • Explore complex data sets with visualization, to develop understanding of the inherent relationships.
  • Create multiple views of data to highlight different aspects of the inherent relationships, with different graph types.
  • Use plot aesthetics to project multiple dimensions.
  • Apply conditioning or faceting methods to project multiple dimensions