The REPETO team recently launched the first Summer of Reproducibility (SoR) — a mentorship-centered program aimed at making computational research efforts reproducible. In this inaugural year, we enabled the matching of 19 SoR Fellows (primarily undergraduate and graduate students) from around the world with 21 reproducibility focused mentors. The REPETO team created the SoR in order to promote the creation of reproducibility artifacts that are practical and accessible to a wider population of researchers and students. The SoR provides students (and other newcomers to reproducibility) the opportunity to work on this effort and acquire relevant, valuable skills.
The inaugural Summer of Reproducibility successfully concluded in September 2023. The program supported the work of 19 summer students from eight different countries. Blog posts and final products from all students are available below:
2023 SoR Fellows
Student Fellow | Project Title | Blogs | Mentor |
---|---|---|---|
Shekhar Pandey | Interactive open educational resources for machine learning courses | Initial, Midterm, Final (Repo) | Fraida Fund, NYU |
Charis Christopher Hulu | Predict Genomics Workflow Execution Time using Two-Stage Approach | Initial, Midterm, Final (Repo) | In Kee Kim, UGA |
Eunsoo (Justin) Shin | Leveraging ML-augmented I/O in Linux | Initial, Midterm, Final | Haryadi Gunawi, UChicago |
Faishal Zharfan | Reproduce and benchmark self-adaptive edge applications under dynamic resource management | Initial, Midterm, Final | Junchen Jiang, UChicago |
Goodness Ay | ScaleBugs: Reproducible Scalability Bugs | Initial, Midterm, Final | Cindy Rubio Gonzalez, Haryadi Gunawi, Hao-Nan Zhu |
Haoran Wu | GPU Emulator for Easy Reproducibility of DNN Training | Initial, Midterm, Final (Repo) | Vijay Chidambaram |
Jesse Lima | Verify the reproducibility of an experiment in noWorkflow | Initial, Midterm, Final | Joao Felipe Pimentel, Juliana Freire |
Jiayuan Zhu | DataVizFlow: A Record and Visualize Experimental Results Platform | Initial, Midterm, Final | Anjo Vahldiek-Oberwagner |
Jonathan Edwin | Using Reproducibility in Machine Learning Education: A Study on Cutout, U-Net, and Siamese Networks | Initial, Midterm, Final (Repo) | Fraida Fund, NYU |
Kangrui Wang | Automatic Cluster Performance Shifts Detection Toolkit | Initial, Midterm, Final (Lightning Talk) (Slides) | Sandeep Madireddy, Ray Andrew Sinurat |
Krishna Madhwani | Public Artifact Data and Visualization (Experiment Log & Record and visualize experimental results) | Initial, Midterm, Final | Anjo Vahldiek-Oberwagner |
Luiza Zucchi Hesketh | Advancing Reproducible Science through Open Source Laboratory Protocols as Software | Initial, Midterm, Final (Lightning Talk) (Slides) | Tim Fallon, Dan Bryce |
Maharani Ayu Putri Irawan | FlashNet: Towards Reproducible Data Science for Storage System | Initial, Midterm, Final | Haryadi Gunawi, UChicago |
Mohamed Saeed | Using Reproducibility in Machine Learning Education: Levels of reproduction | Initial, Midterm, Final (Poster) | Fraida Fund, NYU |
Shayantan Banerjee | Reproducible Analysis & Models for Predicting Genomics Workflow Execution Time | Initial, Midterm, Final (Report) | In Kee Kim, UGA |
Srishti Jaiswal | Teaching Computer Networks with Reproducible Research: Classroom competition for adaptive video | Initial, Midterm, Final (Adaptive Video) (Astream) | Fraida Fund |
Xueyuan Ren | Measuring Open-source Database Systems under TPC-C with Unreported Settings | Initial, Midterm, Final (Spreadsheet) | Yang Wang, Miao Yu |
Zahra Nabila | ScaleBugs: Reproducible Scalability Bugs | Initial, Midterm, Final | Cindy Rubio Gonzalez, Haryadi Gunawi, Hao-Nan Zhu |
Zhiyan (Alex) Wang | Reproducible Evaluation of Multi-level Erasure Coding | Initial, Midterm, Final (Repo) | John Bent, Anjus George |
The SoR organizers are looking forward to SoR 2024, and will be posting more here soon on how to become a mentor or fellow in next year’s program.