Wednesday 10:00 AM–10:40 AM in Belasco (#6203)

Teaching Software Best Practices for Reproducible Science

Chaya D. Stern

Audience level:
Intermediate

Description

This will be an open session to discuss areas where we can improve training for scientists in software development and methodology to improve their research. Attendees will hear about a NumFocus education initiative being spearheaded at Memorial Sloan Kettering and will be given a chance to provide feedback.

Abstract

Many STEM undergraduate degrees do not require their students to take computer science courses. While this might have been a reasonable decision decades ago, current technology and the amount of data most scientists deal with require better training in software development and methodology. This initiative was born out of my own frustration trying to fill the gaps while pursuing a PhD in computational chemistry. While a short course cannot fill all the gaps, it can provide scientists with the right tools to implement software best-practices in their work. The audiences we are targeting are scientists who have some coding familiarity; they will know about for and while loops and might have written some scripts to analyze their data. This course will teach concepts such as version control, testing, documentation, packaging and sharing code. The goal is not only to introduce these concepts, but also provide enough practical training for attendees to start using it in their work.

Subscribe to Receive PyData Updates

Subscribe