The very first step of every data science project is to understand the data themselves. Python eco-system serves that purpose really well with various libraries including pandas, matplotlib, seaborn, etc. This hands-on tutorial introduces comprehensive Exploratory Data Analysis (EDA) techniques based on these libraries. It covers data loading, pre-processing, and most importantly visualizations.
Tutorial materials are available here.
sidetable
matplotlib
: basic building block, essential for fine-tuningpandas
: data manipulation + plottingseaborn
: handy matplotlib
wrapper for statistical visualizationsipywidgets
plot.ly
and plot.ly
expressbokeh
altair
dtale
pandas-profiling
sweetviz
autoviz