Presenting SoR’23 Student Blogs and Final Products

The REPETO team recently launched the first Summer of Reproducibility (SoR) — a mentorship-centered program aimed at making computational research efforts reproducible. In this inaugural year, we enabled the matching of 19 SoR Fellows (primarily undergraduate and graduate students) from around the world with 21 reproducibility focused mentors. The REPETO team created the SoR in order to promote the creation of reproducibility artifacts that are practical and accessible to a wider population of researchers and students.  The SoR provides students (and other newcomers to reproducibility) the opportunity to work on this effort and acquire relevant, valuable skills.

The inaugural Summer of Reproducibility successfully concluded in September 2023. The program supported the work of 19 summer students from eight different countries. Blog posts and final products from all students are available below:

2023 SoR Fellows

Fellows Name Project Title Blogs
Shekhar Pandey Interactive open educational resources for machine learning courses Initial, Midterm, Final (Repo)
Charis Christopher Hulu Predict Genomics Workflow Execution Time using Two-Stage Approach Initial, Midterm, Final (Repo)
Eunsoo (Justin) Shin Leveraging ML-augmented I/O in Linux Initial, Midterm, Final
Faishal Zharfan Reproduce and benchmark self-adaptive edge applications under dynamic resource management Initial, Midterm, Final
Goodness Ay ScaleBugs: Reproducible Scalability Bugs Initial, Midterm, Final
Haoran Wu GPU Emulator for Easy Reproducibility of DNN Training Initial, Midterm, Final (Repo)
Jesse Lima Verify the reproducibility of an experiment in noWorkflow Initial, Midterm, Final
Jiayuan Zhu DataVizFlow: A Record and Visualize Experimental Results Platform Initial, Midterm, Final
Jonathan Edwin Using Reproducibility in Machine Learning Education: A Study on Cutout, U-Net, and Siamese Networks Initial, Midterm, Final (Repo)
Kangrui Wang Automatic Cluster Performance Shifts Detection Toolkit Initial, Midterm, Final (Lightning Talk) (Slides)
Krishna Madhwani Public Artifact Data and Visualization (Experiment Log & Record and visualize experimental results) Initial, Midterm, Final
Luiza Zucchi Hesketh Advancing Reproducible Science through Open Source Laboratory Protocols as Software Initial, Midterm, Final (Lightning Talk) (Slides)
Maharani Ayu Putri Irawan FlashNet: Towards Reproducible Data Science for Storage System Initial, Midterm, Final
Mohamed Saeed Using Reproducibility in Machine Learning Education: Levels of reproduction Initial, Midterm, Final (Poster)
Shayantan Banerjee Reproducible Analysis & Models for Predicting Genomics Workflow Execution Time Initial, Midterm, Final (Report)
Srishti Jaiswal Teaching Computer Networks with Reproducible Research: Classroom competition for adaptive video Initial, Midterm, Final (Adaptive Video) (Astream)
Xueyuan Ren Measuring Open-source Database Systems under TPC-C with Unreported Settings Initial, Midterm, Final (Spreadsheet)
Zahra Nabila ScaleBugs: Reproducible Scalability Bugs Initial, Midterm, Final
Zhiyan (Alex) Wang Reproducible Evaluation of Multi-level Erasure Coding Initial, Midterm, Final (Repo)

The SoR organizers are looking forward to SoR 2024, and will be posting more here soon on how to become a mentor or fellow in next year’s program.

Leave a Reply

Your email address will not be published. Required fields are marked *