PyData global is hosting multiple sprints for open source projects. Each sprint will have a project maintainer leading the sprint to help guide the contributors throughout the session.

What are Sprints?

Development sprints offer an opportunity to enhance and contribute to open source projects in a focused session with the project maintainers. It is a fun exercise that helps open source projects to improve with the help of the open source community.

Who can participate?

ALL experience levels are welcome to participate. Contribution guides and environment setup instructions are provided with each sprint.

Sprints Schedule

You can access the schedule of the sprints using this link.

The duration of each sprint is 4 hours.

Communication Channels

Every sprint will have its own channel on slack. Contributors should enroll in those slack channels to receive the list of issues to work on and have a space to discuss their progress with the maintainers.

During the sprint, there will be a dedicated zoom call for that sprint which will be used for sprint introduction. Participants will be divided into breakout rooms where a maintainer will come over to support attendees when needed.

Before the Sprint
It is crucial to read the documentation and environment set up instructions to have a productive sprint. If you’re facing any issues in setting up your environment, please use slack to ask for support in setting up the environment before the sprint begins.

Which Projects are Sprinting?

In PyData Global 2021, 7 projects will have separate sprint sessions with some of its awesome project core developers and maintainers!

October 28 | 10pm UTC

Matplotlib is a plotting library for the Python programming language and its numerical mathematics extension NumPy. It provides an object-oriented API for embedding plots into applications using general-purpose GUI toolkits like Tkinter, wxPython, Qt, or GTK+.

Maintainers leading the sprint

Thomas Caswell
Thomas is a soft-matter physicist who now developer software for scientists. He develops data acquisition, management, and analysis tools at NSLS-II at BNL, as a core maintainer of h5py, and is the current Project Lead of Matplotlib

Important Links

October 29 | 10am UTC

Ignite is a high-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.

Maintainers leading the sprint

A CS Undergrad. Currently working on the docs of PyTorch-Ignite and helping manage the community

Jeff Yang
Contributor to PyTorch-Ignite and its related projects

Victor Fomin
Software Engineer at Quansight working on AI related open-source project, maintainer of PyTorch-Ignite

A CS Undergrad and a Machine Learning Intern at Factmata working on NLP, and contributor at PyTorch-Ignite.

Important Links

October 29 | 4pm UTC

Modin: speed up your pandas workflows by changing a single line of code.

Maintainers leading the sprint

Devin Petersohn
Devin Petersohn recently graduated the University of California Berkeley with a PhD in Computer Science. His focus was on distributed systems and dataframe formalism. As a part of his PhD work, Devin created Modin, which is a drop-in replacement for pandas that is faster and more scalable. Away from screens, Devin enjoys a number of other typical human activities.

Important Links

October 30 | 10am UTC

PyMC (formerly PyMC3) is a Python package for Bayesian statistical modeling focusing on advanced Markov chain Monte Carlo (MCMC) and variational inference (VI) algorithms. Its flexibility and extensibility make it applicable to a large suite of problems.

Maintainers leading the sprint

Martina Cantaro

Important Links

October 30 | 2pm UTC

NumPy is an open source project aiming to enable numerical computing with Python. It was created in 2005, building on the early work of the Numeric and Numarray libraries. NumPy will always be 100% open source software, free for all to use and released under the liberal terms of the modified BSD license.

SciPy (pronounced “Sigh Pie”) is an open-source software for mathematics, science, and engineering. It includes modules for statistics, optimization, integration, linear algebra, Fourier transforms, signal and image processing, ODE solvers, and more.

Maintainers leading the sprint

Melissa Mendonça
Melissa is an applied mathematician and former university professor turned software engineer. She works at Quansight, developing open-source software and working on consulting projects. She has been involved with the Brazilian Python community for some time, with a focus on outreach and education. Currently, she leads the Documentation Team and works as a maintainer for NumPy.

Matti Picus
Matti is a maintainer of NumPy and a core dev of PyPy. He works at Quansight Labs. He has a wide range of experience in industry around image and processing and near-realtime systems. He has worked with large corporations on training and open-source management, and loves mentoring and teaching.

Rohit Goswami
Rohit is a software engineer at Quansight Labs where he supports the Scientific python ecosystem with a focus on its interactions with Fortran. He is also an avid FOSS programmer and proponent for over ten years and is a Rannis funded doctoral researcher at the University of Iceland working with the Faculty of Physical Sciences and the Faculty of Applied Mathematics on computational chemistry projects.

Ralf Gommers
Ralf has been deeply involved in the SciPy and PyData communities for over a decade. He is a maintainer of NumPy, SciPy and data-apis.org, and has contributed widely throughout the SciPy ecosystem. He served on the NumFOCUS Board of Directors from 2012-2018. Ralf co-directs Quansight Labs, which consists of developers, community managers, designers, and documentation writers who build open-source technology and grow open-source communities around data science and scientific computing projects.

Mukulika Pahari
Mukulika Pahari is a computer engineering undergraduate student at the University of Mumbai. She is a contributor to NumPy’s documentation and an avid fiction reader in her free time.

Important Links

October 30 | 4pm UTC

Bokeh is an interactive visualization library for modern web browsers. It provides elegant, concise construction of versatile graphics, and affords high-performance interactivity over large or streaming datasets. Bokeh can help anyone who would like to quickly and easily make interactive plots, dashboards, and data applications.

Maintainers leading the sprint

Ian Thomas
Ian is a developer at makepath and a contributor to a number of open-source projects including Bokeh and Matplotlib. He has particular interest in using OpenGL/WebGL for fast rendering, and spatial algorithms such as calculating contours. Ian is British and drinks a lot of tea.

Iury Piva
Iury is a Bokeh contributor, mostly interested in the Typescript side. He works at makepath as a software developer and never runs away from a good coding challenge.

Timo Metzger
Timo is a Core Team member of Bokeh and a technical writer at makepath. He loves finding the right words for complex technical ideas and helping others get started with open-source projects.

Bryan Van de Ven
Bryan is a Senior Systems Software Engineer at NVIDIA, where he works on front-end and visualization tools for RAPIDS. Previously he worked at Microsoft, and also at Anaconda, where he created the conda package manager and co-created the Bokeh visualization library.

Important Links