Friday 12:00–14:00 in Novice

An introduction to the tidyverse

Jorge Cimentada

Audience level:
Novice

Description

The tidyverse is a series of packages that work extremely well with each other and have been created with the aim of easing the data analysis process. In this workshop we will discuss why these packages work well together and how they complement each other. We will spend most of the workshop getting our hands dirty by using the dplyr and ggplot2 packages in a real-world data analysis.

Abstract

If you’ve used R you’ve probably stumbled into the name ggplot2, dplyr or tidyverse. These names are R packages created by RStudio's Chief Scientist Hadley Wickham. The tidyverse is a series of packages that work extremely well with each other and have been created with the aim of easing the data analysis process. As stated by its main author, the philosophy of the tidyverse is to allow the analyst to concentrate on the substantive questions rather than on technicalities of data analysis.

In this workshop we will discuss the philosophy behind the tidyverse. We will discuss why these packages work well together and how they complement each other. We will learn how to input data from softwares such as Excel, Stata and SPSS and we will spend most of the workshop using the dplyr and ggplot2 packages in a real-world example. Expect to brush up on your stats skills as we analyze trends in police killings in the United States. We will explore this data as we learn about Exploratory Data Analysis (EDA) and the tidyverse workflow.

Hope to see you there!

Contents: - Introduction to the tidyverse - Data visualization with ggplot2 - A brief introduction to the pipe - Data transformation with dplyr - Data import with haven and readr - Exploratory Data Analysis (EDA)

Subscribe to Receive PyData Updates

Subscribe

Tickets

Get Now