“It’s not only peer-reviewed, it’s reproducible!”

October 18, 2013 in Panton Fellowships, Panton Principles, Reproducibility

Peer review is one of the oldest and most respected instruments of quality control in science and research. Peer review means that a paper is evaluated by a number of experts on the topic of the article (the peers). The criteria may vary, but most of the time they include methodological and technical soundness, scientific relevance, and presentation.

“Peer-reviewed” is a widely accepted sign of quality of a scientific paper. Peer review has its problems, but you won’t find many researchers that favour a non peer-reviewed paper over a peer-reviewed one. As a result, if you want your paper to be scientifically acknowledged, you most likely have to submit it to a peer-reviewed journal.

Even though it will take more time and effort to get it published than in a non peer-reviewed publication outlet.

Peer review helps to weed out bad science and pseudo-science, but it also has serious limitations. One of these limitations is that the primary data and other supplementary material such as documentation source code are usually not available. The results of the paper are thus not reproducible. When I review such a paper, I usually have to trust the authors on a number of issues: that they have described the process of achieving the results as accurate as possible, that they have not left out any crucial pre-processing steps and so on. When I suspect a certain bias in a survey for example, I can only note that in the review, but I cannot test for that bias in the data myself. When the results of an experiment seem to be too good to be true, I cannot inspect the data pre-processing to see if the authors left out any important steps.

As a result, later efforts in reproducing research results can lead to devastating outcomes. Wang et al. (2010) for example found that they could not reproduce almost all of the literature on a certain topic in computer science.

“Reproducible”: a new quality criterion

Needless to say this is not a very desirable state. Therefore, I argue that we should start promoting a new quality criterion: “reproducible”. Reproducible means that the results achieved in the paper can be reproduced by anyone because all of the necessary supplementary resources have been openly provided along with the paper.

It is easy to see why a peer-reviewed and reproducible paper is of higher quality than just a peer-reviewed one. You do not have to take the researchers’ word of how they calculated their results – you can reconstruct them yourself. As a welcome side-effect, this would make more datasets and source code openly available. Thus, we could start building on each others’ work and aggregate data from different sources to gain new insights.

In my opinion, reproducible papers could be published alongside non-reproducible papers, just like peer-reviewed articles are usually published alongside editorials, letters, and other non peer-reviewed content. I would think, however, that over time, reproducible would become the overall quality standard of choice – just like peer-reviewed is the preferred standard right now. To help this process, journals and conferences could designate a certain share of their space to reproducible papers. I would imagine that they would not have to do that for too long though. Researchers will aim for a higher quality standard, even if it takes more time and effort.

I do not claim that reproducibility solves all of the problems that we see in science and research right now. For example, it will still be possible to manipulate the data to a certain degree. I do, however, believe that reproducibility as an additional quality criterion would be an important step for open and reproducible science and research.

So that you can say to your colleague one day: “Let’s go with the method described in this paper. It’s not only peer-reviewed, it’s reproducible!”

Tags: Open Data, open source, peer-review, publication process, quality, reproducible

← Panton Fellow Introduction: Samuel Moore

My First Month as a Panton Fellow →

34 responses to ““It’s not only peer-reviewed, it’s reproducible!””

“It’s not only peer-reviewed, it’s reproducible” | Science and the Web says:

October 21, 2013 at 9:11 am

[…] This is a reblog from the OKFN Science Blog. As part of my duties as a Panton Fellow, I will be regularly blogging there about my activities […]

Reply
Links vom 21.10.2013 | Offene Wissenschaft says:

October 22, 2013 at 7:05 am

[…] “It’s not only peer-reviewed, it’s reproducible!” | OKF Open Science Working… – Peter Kraker fasst schön zusammen, warum Reproduzierbarkeit an die Stelle von Peer Review treten sollte/könnte. […]

Reply
Scott H Williams @algae says:

October 23, 2013 at 1:11 am

I replied to this post via several tweets
https://twitter.com/PeterKraker/statuses/391216531780038656

@PeterKraker asked me to add the comments here, which I’m more than happy to do

· I like the sentiment of “Reproducible”. However, the advances of most physical science research cannot be added to a paper.

· Frequently, it is not the data analysis, but rather the act of data collection itself that is the advancement.

· That can’t be conveyed in a PDF, it can only be reproduced in a lab, and the equipment can be expensive (e.g. TEM).

Reply
- Peter Kraker says:
  
  October 23, 2013 at 9:20 am
  
  Scott, thanks for your comment. I agree, reproducibility can mean different things depending on the discipline and the kind of scientific innovation. In the case of data collection in the physical sciences, I could imagine that a link to the experimental setup in a remote laboratory would do the trick. What do you think?
  
  Reply
Robert Muetzelfeldt says:

October 23, 2013 at 11:22 am

I’m really surprised to see reproducibility described as a “new” quality criterion for science. It’s long been recognised (if not practised) as a defining, characteristic of science.

Reply
- Peter Kraker says:
  
  October 24, 2013 at 8:24 am
  
  Robert, by describing it as a new criterion, I was referring to the apparent discrepancy between theory and practice. In my perception, reproducibility has not been considered by journals and conferences so far, even though it is one of the defining characteristics of science. But I wouldn’t mind talking about the “renaissance” of reproducible as a quality criterion either.
  
  Reply
Jan Velterop says:

October 24, 2013 at 7:44 pm

I’d like to draw attention to the keynote given in Berlin last July by professor Carole Goble (U of Manchester) on the topic of reproducibility, the slides of which can be found on Slideshare: http://www.slideshare.net/carolegoble/ismb2013-keynotecleangoble

Reply
- Peter Kraker says:
  
  October 25, 2013 at 1:05 pm
  
  Thanks Jan for pointing out this very comprehensive presentation. C. Goble looks at the matter of reproducibility from many different points of view, and does so in a very entertaining way. Above all, the presentation does a very good job of explaining why reproducibility is simply NOT a matter of a clear description of sample and method in the paper alone.
  
  I’d also like to point out Victoria Stodden’s outstanding work on reproducibility in the computational sciences: http://www.stanford.edu/~vcs/talks/UMN-Oct102013-STODDEN.pdf
  
  Reply
DS says:

October 25, 2013 at 5:09 pm

It appears that you are talking about reproducibility of analysis rather than the reproducibility of an experiment.

If an honest researcher gives you data and tells you how the analysis was performed then how could you not be able to reproduce the analysis (up to machine precision of your computer) ?

Reply
- Peter Kraker says:
  
  October 29, 2013 at 10:10 am
  
  That depends on what you mean by “tells you how the analysis was performed”. I’d like to cite a paper that I wrote together with D. Leony, W. Reinhardt and G. Beham (you can find it here):
  “Knorr-Cetina (1981) already showed that papers do not contain all the methodological information needed to reproduce a certain research result. An elimination process is taking place during the production of the paper in which information is decontextualized and typified. Scientific work is usually done in a different way than it is reported on later. This is also backed by Latour (1979) who found that science is not a structured process but rather an array of incoherent observations, which need to be ordered subsequently. Furthermore, there are certain procedural remarks that are too detailed to be included in a publication. The way that these procedural remarks look like differs greatly depending on the method used. Currently, these procedural details are mostly exchanged through personal communication or joint work.”
  
  Reply
Links 10/25/13 | Mike the Mad Biologist says:

October 25, 2013 at 8:45 pm

[…] kill 300 Zimbabwe elephants with cyanide Air Pollution Definitively Linked To Cancer “It’s not only peer-reviewed, it’s reproducible!” The ocean is broken Despite End of U.S. Shutdown, Antarctic Research Projects Still Getting […]

Reply
Robert L Bell says:

October 25, 2013 at 10:34 pm

Of course I am in broad agreement with your point, but reproducibility can be tricky.

My favorite illustration comes from my chemistry post doc experience. One of my associates stumbled across a novel synthetic method, by the classic means of sorting through the products of a reaction that failed to go as anticipated. What an exciting time! The whole lab pulled together as we churned out example after example of the new reaction class.

Yet, in the course of nailing down some details for the big publication my colleague tried and failed to replicate the original experiment. To this day, no one has ever managed to make the reaction go again on the original target molecule – yet the original vial of product still exists in a freezer.

So where was the reproducibility? Not in the literal replication of the original synthesis, but in the supporting evidence of extending and generalising the original work.

Reply
- Peter Kraker says:
  
  October 29, 2013 at 10:18 am
  
  Robert, thanks for sharing this great story! Do you have any explanation why the experiment could never be replicated?
  
  Reply
The “Journal of Reproducible Research”? | Danna Gifford says:

October 28, 2013 at 5:56 pm

[…] “It’s not only peer-reviewed, it’s reproducible” […]

Reply
Monthly Open Science Sum-Up: Oktober 2013 | Offene Wissenschaft says:

November 8, 2013 at 4:42 pm

[…] Peter Kraker, einer der diesjährigen Panton-Fellows, hat Peer Review und Reproduzierbarkeit von wissenschaftlicher Arbeit als einen Arbeitsschwerpunkt ausgewählt. Eine Debatte dazu wurde im OKFN-Blog angestoßen (eng). […]

Reply
Josh Nicholson says:

November 8, 2013 at 9:09 pm

This post reminds me very much of Bruce Charlton’s, “Peer usage versus peer review.” http://www.bmj.com/content/335/7617/451

Reply
- Peter Kraker says:
  
  November 13, 2013 at 2:30 pm
  
  Josh, thanks for this article. Here is a link to the non-paywalled version: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1962885/pdf/bmj-335-7617-pv-00451.pdf
  
  Reply
PoliticalScienceReplication says:

November 13, 2013 at 9:39 am

I like these ideas. How would you embed reproducibility in the peer review process to be able to label a study as ‘reproduced & peer-reviewed’? Could journals team with statistical training centers and integrate students (guided by a lecturer) into this? Another blog recommends that groups of scholars meet on google hangout or skype to reproduce papers together (http://ivory.idyll.org/blog/a-conversation-on-reproducibility.html). I’d like to discuss some concrete ideas either here or on my own blog (http://politicalsciencereplication.wordpress.com) if anyone is interested in taking this further. Best, Nicole

Reply
- Peter Kraker says:
  
  November 13, 2013 at 2:42 pm
  
  Nicole, thanks for your comment. I like your suggestion because it decouples the reproducibility effort from the peer review process. This would take away some burden from the reviewers, and the results from the reproducibility effort could potentially inform their decision.
  
  The idea posted by C. Titus Brown might work very well in the computational sciences, as reviewers should usually have all of the necessary equipment to reproduce a result.
  
  Reply
  - PoliticalScienceReplication says:
    
    November 15, 2013 at 6:42 pm
    
    Agree on the reviewer burden point. It would be unfair for those who review heavily and support their community to be asked to check the data as well. I really think students projects can be a great source: they learn something from replication, so that it would not be just nasty exploitation.
    
    Reply
    - Peter Kraker says:
      
      November 18, 2013 at 10:58 am
      
      I did have a reproducibility exercise as part of my PhD courses and it was eye-opening. See also Sophie Kershaw’s attempt at teaching the needs of the research end user with Lego blocks: http://sophiekershaw.wordpress.com/2013/11/14/research-roles-through-lego/
      
      Reply
Best of replication & data sharing: Collection 4 (Oct 2013) | Political Science Replication says:

November 13, 2013 at 10:28 am

[…] “It’s not only peer-reviewed, it’s reproducible!” discusses how journal articles could be labelled as not only ‘peer-reviewed’, but additionally as ‘reproduced’ when someone went ahead and checked the data during the peer-review process. Currently, a serious limitation in usual peer-review processes is “that the primary data and other supplementary material such as documentation source code are usually not available. The results of the paper are thus not reproducible. When I review such a paper, I usually have to trust the authors on a number of issues: that they have described the process of achieving the results as accurate as possible, that they have not left out any crucial pre-processing steps and so on.” […]

Reply
#solo13lego: Research Roles Through Lego | The Stilettoed Mathematician says:

November 14, 2013 at 10:41 am

[…] of much work and discussion in the scientific community over many years – see discussion here), whether that’s to do with working culture, tools, infrastructure or incentives – all […]

Reply
Limor Peer says:

November 15, 2013 at 1:23 pm

I totally agree that a “reproducible paper” meets a higher standard. The question of who is ultimately responsible for this kind of review is one I tried to unpack in a blog post here: http://isps.yale.edu/news/blog/2013/07/the-role-of-data-repositories-in-reproducible-research

Reply
- Peter Kraker says:
  
  November 18, 2013 at 11:10 am
  
  Limor, thanks for pointing out your comprehensive review. It was interesting to learn that the data curation process in your repository is driven by replication requirements. I am especially impressed that you check whether the submitted materials can actually be used to reproduce the results from the papers.
  
  Reply
Fleur Jeanquartier says:

November 18, 2013 at 7:51 pm

I also like the idea of promoting “reproducibility” as new important criterion for dissemination quality.
I’d also underline the importance for this kind of quality metric with regard to information visualization.
F.i. Plaisant, Fekete et al. (2008) promote development of benchmarks to facilitate the comparison of certain visualization techniques.
North (2006) and also others already try to introduce and describe methods to produce comparable visualization evaluation results.
I also underlined the importance of repositories for comparison in one of my latest publications.
Thank you for introducing this term as a quality metric and reminding me of its’ importance!

Reply
- Peter Kraker says:
  
  November 19, 2013 at 11:10 am
  
  Thanks Fleur for this valuable insight into the information visualization community! As I am working on science mapping myself, I noted that there is a trend towards making not only evaluations comparable but also the maps themselves: http://knowescape.org/standards-for-science-mapping-and-classifications-workshop-at-the-issi-2013/
  What do you think about that?
  
  P.S.: Here is a pre-print of the first paper that Fleur mentioned: http://hcil2.cs.umd.edu/trs/2004-30/2004-30.pdf
  
  Reply
Fleur Jeanquartier says:

November 22, 2013 at 5:38 pm

@Peter:
First and foremost thank You for linking to the open access version of the paper! 😉
To your question:
Science mapping is surely another interesting example usage and I like the idea of visually approaching the question of mapping science!
By doing this visually, we can reduce complexity with this approach. Currently many research is published on the topic of time visualization and this could be a benefit for the science mapping research too. I also recently came across certain research questions to model evolution. One of the latest case studies I participated in was dealing with the question how (biological) model visualizations evolve over time. This is also a really exciting part for new project ideas! 😉

Reply
- Peter Kraker says:
  
  November 25, 2013 at 1:52 pm
  
  Sounds like an exciting project! I think I need to read up on your latest publications 🙂
  
  Reply
Matt Fenwick says:

December 4, 2013 at 8:02 pm

Nice article! As a reproducibility advocate, one of the criticisms I’ve often run into from people is that making sure things are reproducible takes time away from “getting real work done”. I hope these people realize how critical reproducibility truly is — without it, what is science? We throw away so much information about how our data analysis was performed it’s sickening!

Reply
- Peter Kraker says:
  
  December 5, 2013 at 2:27 pm
  
  Thanks, Matt! I agree, investing time into reproducibility will pay off in the long run. In the short term, however, it may seem that you are losing time. That’s why I think that if reproducibility was seen as a quality standard, it would be easier to get people to commit to it.
  
  Reply
Love your data – and let others love it, too › The Aggregator says:

January 16, 2014 at 3:39 pm

[…] the impossibility to repeat or even statistically verify a study being presented. This has a name: reproducible research. We have all heard about the shocking outcome of Glenn Begley's survey of 53 landmark cancer […]

Reply
Aime tes données… et permets à autrui de les aimer aussi | Science ouverte says:

January 16, 2014 at 3:42 pm

[…] de répéter ou même de statistiquement vérifier une étude présentée. Cela a un nom : la recherche reproductible. Nous avons tous entendu parler du résultat choquant de l’étude de Glenn Begley de 53 […]

Reply
Panton Fellowship Wrap-Up | Science and the Web says:

November 17, 2014 at 10:39 am

[…] contain and how to use them. In my role as an advocate for reproducibility I wrote a blog post on why reproducibility should become a quality criterion in science. The post sparked a lot of discussion, and was widely linked and […]

Reply