Project page: https://delftdata.github.io/repro-di/
The repository contains the dataset created by applying the reproducibility checklist on 27 papers belonging to data integration. The list of papers participating in the study can be found in the dataset, second parameter called name. We used the dataset to analyse the degree of reproducibility in data integration. The analysis can be found in the jupyter notebook.
You can find the reproducibility checklist on the project page.
All the plots used in the paper are generated in the jupyter notebook and can be found in the plots directory.
- Python 3.8.2
- Use a python virtual environment of your choice
- Install the requirements
pip install -r requirements.txt