PyData Amsterdam | Presentation: Scaling Up Genomics with Spark

Saturday 9:10–10:00

Scaling Up Genomics with Spark

Sean Owen

Audience level:: Novice

Description

This talk will briefly introduce the problem of genomics and existing home-grown efforts to bring "big data" technology to solve genomics

Abstract

It's amazing that our genome so completely and uniquely encodes each of us with a simple 4-protein code, like a file. More amazingly, we're so similar that we can build a reference map of human genomes and reason about commonalities. Genomics has taken off in the last two decades driven largely by advances in computing; the work of mapping the genome is incredibly data and compute intensive. This talk will briefly introduce the problem of genomics and existing home-grown efforts to bring "big data" technology to solve it. It will compare these with the separate rise of technologies like Apache Hadoop and Spark, and how these ideas are helping genomics scale up even further.

Saturday 9:10–10:00

Scaling Up Genomics with Spark

Sean Owen

Description

Abstract

Sponsors

Become a sponsor.