Reproducibility, replicability and reusability are all interlinked.
Important
Reproducibility…
is when you or others are able to obtain the same results (within a certain tolerance) as the original study, when using the same input data, code, and coding environment, on the same computing platform as the original study.
Input | Aim | Ideal output |
---|---|---|
The same input data and code, on the same computer | Verify the results are real; catch mistakes, error, fraud | The same results! |
What do you need for reproducibility?
What are some examples of this going wrong?
Replicability…
is when you or others are able to produce results that align with the results of the original study, while using different input data and different code, but using the original studies methods or theories.
Input | Aim | Ideal output |
---|---|---|
The methods used previously, but not the same code/data | To test the validity of the results and conclusions | Similar results |
What is needed for replicability?
Reusability…
is when you or others are able to easily use the code or data produced as part of the original study, and potentially rework it and extend it for new applications, contexts, or studies.
Input | Aim | Ideal output |
---|---|---|
The original code, expanded on, used with different data, modified | To continue doing research without reinventing the wheel, and instead recycle code | New results, in different contexts |
What is needed for reusability?
Reusability oftentimes gets sidelined to let reproducibility take the main stage, but it is essential for research!
Nature Computer Science editorial: “But is the code (re)usable?”:
While it is crucial to guarantee the reproducibility of the results reported in a paper, let us also not forget about the importance of making research artifacts reusable for the scientific community.
Reusability […] entails obtaining consistent results with new data, and in some cases, in the context of a new scientific application. Making research artifacts, such as code, reusable allows other researchers to more easily investigate the same or similar scientific questions as new data become available and new ideas are developed, thus helping science progress at a faster pace.
Nature (2021)
Reproducibility, replicability and reusability are all interlinked.
When we discuss reproducibility more broadly (for example, with the “reproducibility crisis”), it’s often implied that reusability and replicability are also included under this umbrella.
More than 70% of researchers have tried and failed to reproduce another scientist’s experiments, and more than half have failed to reproduce their own experiments.
1,500 scientists lift the lid on reproducibility: Nature News Feature Baker (2016)
We’re going to work through each of these different topics using an example project
Note: this is not the only way to organise your work!