I’m an active developer, maintainer, or contributor to several projects in the Python data science community. Projects I maintain or have made significant contributions to include:
Dask
Dask is a library for flexible parallel computing in Python. It provides parallelized NumPy array and Pandas DataFrame objects as well as a task scheduling interface for scientific computing in larger-than-memory or distributed environments.
PyCondor
PyCondor is a Python package for constructing and submitting jobs to an HTCondor distributed cluster. It was designed to provide built-in functionality to help automate common workflow tasks with a user-friendly API.
PyUnfold
PyUnfold is a Python package for implementing Bayesian iterative unfolding. It is built on top of the Python scientific computing stack, bringing iterative unfolding methods fully into the Python ecosystem.
- J. Bourbeau et al., (2018). PyUnfold: A Python package for iterative unfolding. Journal of Open Source Software, 3(26), 741, https://doi.org/10.21105/joss.00741.
I also contribute to other projects such as scikit-learn, Dask-ML, MLxtend, and more.
Details about my open source contributions are available on GitHub.