The REPETO team recently launched the first Summer of Reproducibility (SoR) — a mentorship-centered program aimed at making computational research efforts reproducible. In this inaugural year, we enabled the matching of 19 SoR Fellows (primarily undergraduate and graduate students) from around the world with 21 reproducibility focused mentors. The REPETO team created the SoR in order to promote the creation of reproducibility artifacts that are practical and accessible to a wider population of researchers and students. The SoR provides students (and other newcomers to reproducibility) the opportunity to work on this effort and acquire relevant, valuable skills.
The inaugural Summer of Reproducibility successfully concluded in September 2023. The program supported the work of 19 summer students from eight different countries. Blog posts and final products from all students are available below:
2023 SoR Fellows
Fellows Name | Project Title | Blogs |
---|---|---|
Shekhar Pandey | Interactive open educational resources for machine learning courses | Initial, Midterm, Final (Repo) |
Charis Christopher Hulu | Predict Genomics Workflow Execution Time using Two-Stage Approach | Initial, Midterm, Final (Repo) |
Eunsoo (Justin) Shin | Leveraging ML-augmented I/O in Linux | Initial, Midterm, Final |
Faishal Zharfan | Reproduce and benchmark self-adaptive edge applications under dynamic resource management | Initial, Midterm, Final |
Goodness Ay | ScaleBugs: Reproducible Scalability Bugs | Initial, Midterm, Final |
Haoran Wu | GPU Emulator for Easy Reproducibility of DNN Training | Initial, Midterm, Final (Repo) |
Jesse Lima | Verify the reproducibility of an experiment in noWorkflow | Initial, Midterm, Final |
Jiayuan Zhu | DataVizFlow: A Record and Visualize Experimental Results Platform | Initial, Midterm, Final |
Jonathan Edwin | Using Reproducibility in Machine Learning Education: A Study on Cutout, U-Net, and Siamese Networks | Initial, Midterm, Final (Repo) |
Kangrui Wang | Automatic Cluster Performance Shifts Detection Toolkit | Initial, Midterm, Final (Lightning Talk) (Slides) |
Krishna Madhwani | Public Artifact Data and Visualization (Experiment Log & Record and visualize experimental results) | Initial, Midterm, Final |
Luiza Zucchi Hesketh | Advancing Reproducible Science through Open Source Laboratory Protocols as Software | Initial, Midterm, Final (Lightning Talk) (Slides) |
Maharani Ayu Putri Irawan | FlashNet: Towards Reproducible Data Science for Storage System | Initial, Midterm, Final |
Mohamed Saeed | Using Reproducibility in Machine Learning Education: Levels of reproduction | Initial, Midterm, Final (Poster) |
Shayantan Banerjee | Reproducible Analysis & Models for Predicting Genomics Workflow Execution Time | Initial, Midterm, Final (Report) |
Srishti Jaiswal | Teaching Computer Networks with Reproducible Research: Classroom competition for adaptive video | Initial, Midterm, Final (Adaptive Video) (Astream) |
Xueyuan Ren | Measuring Open-source Database Systems under TPC-C with Unreported Settings | Initial, Midterm, Final (Spreadsheet) |
Zahra Nabila | ScaleBugs: Reproducible Scalability Bugs | Initial, Midterm, Final |
Zhiyan (Alex) Wang | Reproducible Evaluation of Multi-level Erasure Coding | Initial, Midterm, Final (Repo) |
The SoR organizers are looking forward to SoR 2024, and will be posting more here soon on how to become a mentor or fellow in next year’s program.