Friday 3:30 PM–5:00 PM in Track 1 - Auditorium

Web Scraping with Beautiful Soup

Monica Puerto

Audience level:
Novice

Description

We will be using Beautiful Soup to Webscrape the IMDB website and create a function that will allow you to create a dictionary object on specific metadata of the IMDB profile for any IMDB ID you pass through as an argument.

Abstract

https://github.com/monipip3/wwcdc/tree/master/webscraping

We will cover some basics of webscraping and do an exercise that will take about 90 minutes. We will install 2 Python Packages (Beautiful Soup and requests) on your laptop via the command line.

We will then perform an exercise where we learn how to scrape specific metadata from an IMDB url of a specific movie.

We will then take what we learned and create a function to be used on any IMDB page of the user's choice.

Subscribe to Receive PyData Updates

Subscribe