Provides utilities for the design and analysis of replication studies. Features both traditional methods based on statistical significance and more recent methods such as the sceptical p-value. Also provides related methods including the harmonic mean chi-squared test and intrinsic credibility. Contains datasets from four large-scale replication projects.


CRAN version:


Development version from github with latest developments:

devtools::install_github(repo = "crsuzh/ReplicationSuccess")

Introduction in vignette

To access the vignette from R use:



  • Held, L. (2020). A new standard for the analysis and design of replication studies (with discussion). Journal of the Royal Statistical Society: Series A (Statistics in Society). 183(2):431-448.

  • Held, L., Micheloud, C., Pawel, S. (2022). The assessment of replication success based on relative effect size. The Annals of Applied Statistics. 16:706–720.

  • Held, L. (2020). The harmonic mean chi-squared test to substantiate scientific findings. Journal of the Royal Statistical Society: Series C (Applied Statistics), 69(3):697-708.

  • Micheloud, C., Held, L. (2021). Power Calculations for Replication Studies. Statistical Science. 37(3):369–379.

  • Pawel, S., Held, L. (2020). Probabilistic forecasting of replication studies. PLoS ONE 15(4):e0231416.

  • Held, L. (2019). The assessment of intrinsic credibility and a new argument for p < 0.005. Royal Society Open Science, 6(3):181534.

  • Micheloud, C., Balabdaoui, F., Held, L. (2023). Beyond the two-trials rule: Type-I error control and sample size planning with the sceptical p-value.