Friday 11:30 AM–12:10 PM in Room 1

You Belong with Me: Scraping Taylor Swift Lyrics with Python and Celery

Rebecca Conley, Mark Lavin

Audience level:
Novice

Description

This talk will demonstrate an example application of using Celery to extract all of the lyrics of the inspiring and influential Taylor Swift from the Internet. Using a light-hearted approach and practical Python examples, we aim to teach people the basics of using Celery for data extraction.

Abstract

Celery is an open source, Python-based, asynchronous task framework which is well-suited for extracting data from webpages, APIs, and text files. With its power comes a bit of a learning curve. This talk covers some of the first questions a new Celery user might have. It will also point out some common pitfalls for beginning users.

Outline:

- What is Celery?
    * Project Goals
    * Project History
    * Major Features
- Initial Setup
    * Message broker
    * Result storage
    * First tasks
- Running Celery
- Webscraping Example
- Common Pitfalls