ICERM Reproducibility in Computational and Experimental Mathematics: Readings and References

From StoddenWiki
Jump to: navigation, search

This page collects useful references for the ICERM workshop Reproducibility in Computational and Experimental Mathematics.

Short link: http://icerm.brown.edu/tw12-5-rcem-wiki.php, http://is.gd/RRlinks, or http://goo.gl/QbDOx.

Workshop Report: http://stodden.net/icerm_report.pdf

Contents

Materials from the ICERM Workshop

See also the abstracts posted on the workshop page... click on "Schedule and Supporting Material".

Thought Pieces Submitted by Participants

  • Randy LeVeque, Top Ten Reasons to Not Share Your Code (and why you should anyway). link
  • Nicolas Limare, Running a Reproducible Research Journal, with Source Code Inside.link
  • Sébastien Li-Thiao-Té, Literate Research versus Reproducible Research. link
  • Ursula Martin, The social machine of mathematics. link
  • Fernando Perez, Reproducible software vs. reproducible research. link
  • Todd Rosenquist and Shane Story, Using the Intel Math Kernal Library and Intel Compilers to obtain Numerical Run-to-run Reproducible Results. link original source
  • Anthony Scopatz, Passive Reproducibility: It’s Not You, It’s Me. link
  • Benjamin Seibold, Making reproducible computational research a reasonable choice for young faculty on tenure track. link

Slides from 5-Minute Lightning Talks

Wednesday

  • Noah Clemons, "How to Enforce Reproducibility with your Existing MKL Code" .pptx
  • Neil Chue Hong, "The Foundations of Digital Research" .pdf
  • David Ketcheson, online demo link
  • Nicolas Limare, "My Christmas List for Reproducibility" .pdf
  • Sebastien Li-Thiao-Te, "Lepton : Literate Executable Papers" .pdf
  • Benjamin Seibold, .pdf
  • Matthias Troyer, "Publishing executable papers" .pdf
  • Yihue Xie, "knitr: Starting From Reproducible Homework" .pdf

Thursday

  • Lorena Barba, "Reproducibility PI Manifesto" .pdf figshare
  • Adam Asare, "ITN TrialShare: Promoting reproducible research and transparency in clinical trials" .pptx
  • Sara Billey, ""Canonical Representations of Theorems" .pptx
  • Sarah Michalek, "Silent Data Corruption and Other Anomalies" .pdf
  • Ian Mitchell, "Reproducibility(?) Review Proposal" .pdf
  • Geoffrey Oxberry, "Towards Turnkey Reproducibility" .pdf
  • Bob Robey, "Enhanced Precision Sums for Parallel Computing Reproducibility" .pdf
  • Michael Rubenstein, "The role of computation and data in my number theoretic work" .pdf
  • Fernando Seabra Chirigati, .pptx

Breakout Group Summary Slides

Wednesday

  • Funding Policy Group .pdf
  • Journals/Publication Policy Group .pptx
  • Numerical Reproducibility Group .pptx

Thursday

  • Ontology and V&V Group .pptx
  • Rewards/Culture Group .pptx
  • Teaching Reproducibility Group link

Final Report

To appear.

References and Links Collected

Previous Workshops and Roundtables on Reproducible Research

Why Reproducibility is an Issue

Examples where Lack of Reproducibility Causes Problems

Notions of Reproducibility

A variety of terminology is used in connection with reproducible research. The Final Report contains as section on Terminology and below are some links related to some of these terms.

Reproducible/Replicabale/Auditable Research

Verification and Validation (V&V)

  • Example paper following V&V: William J. Rider and Douglas B. Kothe, Reconstructing Volume Tracking, Journal of Computational Physics, Volume 141, Issue 2, 10 April 1998, Pages 112-152.

Uncertainty Quantification

Quantify the uncertainty in a computation Wikipedia

Identical Code Output

Code Archival

Policies on Data and Code Sharing

Funding Agency Policies

Journal Policies

Legal Issues and Frameworks

Licenses and copyright, citation

  • [Stodden09] "The Legal Framework for Reproducible Research in the Sciences: Licensing and Copyright", IEEE Computing in Science and Engineering, 11(1), January 2009, p.35-40.

  • [Stodden12] "Software Patents as a Barrier to Scientific Transparency: An Unexpected Consequence of Bayh-Dole," With I. Reich, The Seventh Annual Conference on Empirical Legal Studies (CELS 2012), Stanford, CA. Nov, 2012.
  • [Hodges2011] Wilfrid Hodges, “CEIC Copyright Recommendations: What do You Want from Your Publisher?”, link

Open Science

  • Example of social math link

Archiving

Influencing Policy and Changing the Culture

  • Science Code Manifesto link
  • [Patterson1999] David Patterson, Lawrence Snyder and Jeffrey Ullman, “Evaluating Computer Scientists and Engineers For Promotion and Tenure,” August, 1999, link.

Tools and Technologies

Version Control

Some version control systems (VCS) include:

Some public hosting cites for VCS repositories include

Workflow Management Systems

  • D. Koop, E. Santos, P. Mates, H. Vo, P. Bonnet, B. Bauer, B. Surer, M. Troyer, D. Williams, J. Tohline, J. Freire and C. Silva, A Provenance-Based Infrastructure to Support the Life Cycle of Executable Papers], In Proceedings of the International Conference on Computational Science, 2011. link

Literate Programming Tools

Some literate programming tools include:

Notebooks/Publishing Tools

Some notebook/publishing tools include:

Tools that capture and preserve a software environment

Package code along with complete environment (OS, compilers, graphics tools, etc)

Cloud Computing

Web platforms for running code

Integrated tools for version control and collaboration

Interactive theorem proving

Tools that can aid in reproducible research

These tools may be useful in conducting reproducible research.

  • Matlab function that provides information about the CPU and operating system link

Numerical Reproducibility

  • [Bailey2012] David H. Bailey, Roberto Barrio, and Jonathan M. Borwein, “High precision computation: Mathematical physics and dynamics,” Applied Mathematics and Computation, vol. 218 (2012), pg. 10106-10121.
  • [Bailey1992] David H. Bailey, “Misleading performance reporting in the supercomputing field,” Scientific Programming, vol. 1 (Winter 1992), pg. 141-151.

Parallel Computing Issues

  • [Borkar2012] Borkar, S. (2012) “Exascale Challenges, Why Resiliency?” talk presented at the Inter-Agency Workshop on HPC Resilience at Extreme Scale, Feburary 21, 2012.
  • [Constantinescu2000] Constantinescu, C. (2000) “Teraflops supercomputer: Architecture and validation of the fault tolerance mechanisms” IEEE Transactions on Computers 49:886-894.
  • [Kola2005] Kola, G., Kosar, T. and M. Livey (2005) “Faults in large distributed systems and what we can do about them” Proceedings of the 11th European Conference on Parallel Processing (Euro-Par 2005).
  • [Robey2011] Robey, R., Robey, J., and Aulwes, R., “In Search of Numerical Consistency in Parallel Computing”, Vol. 37, Issue 1, Jan 2011
  • [TowardsExascaleResilience2009] Franck Cappello, et. al, “Towards Exascale Resilience”, International Journal of High Performance Computing Applications, Vol 23, Issue 4, Nov 2009, pp 374-388.

Silent Data Corruption

  • [Autran2010] Autran, JL, Munteanu, D., Roche, P. , Gasiot, G., Martinie, S., Uznanski, S., Sauze, S., Semikh, S., Yakushev, E., Rozov, S. et al. (2010) “Soft-errors induced by terrestrial neutrons and natural alpha-particle emitters in advanced memory circuits at ground level” Microelectronics Reliability 50: 1822-1831.
  • [Li2010] Li, X., Huang, M.C.. Shen, K. and L. Chu (2010) “A realistic evaluation of memory hardware errors and software system susceptibility” Proceedings of the 2010 USENIX conference on USENIX annual technical conference.
  • [Michalak2012] Sarah Michalak, Andrew DuBois, Curtis Storlie, Heather Quinn, William Rust, David DuBois, David Modl, Andrea Manuzzato and Sean Blanchard (2012) ``Assessment of the Impact of Cosmic-Ray-Induced Neutrons on Hardware in the Roadrunner Supercomputer, IEEE Transactions on Device and Materials Reliability 12:2, 445-454.
  • [Constantinescu2005] Constantinescu, C. (2005) “Dependability Benchmarking Using Environmental Test Tools,” Proceedings of the 2005 Reliability and Maintainability Symposium 567-571.

Experimental Mathematics

  • The Computer As Crucible: An Introduction to Experimental Mathematics, Jonathan Borwein and Keith Devlin. link
  • [Borwein2008] Jonathan M. Borwein and David H. Bailey, Mathematics by Experiment: Plausible Reasoning in the 21st Century, A K Peters, Natick, MA, 2008.

Education, Courses, and Training

Regular courses teaching some aspects of reproducibility

Short courses and summer schools

On-line tutorials and other sources


Other Readings and Publications

  • N. Barnes, Publish your computer code: it is good enough, Nature 467 (2010) p. 753. link
  • Z. Merali, Computational science: ...Error Why scientific programming does not compute. Nature 467(2010), pp. 775-777. link
  • K. A. Baggerly and D. A. Berry, Reproducible Research, AMSTAT NEWS, Jan. 1, 2011 link
  • A. Jogalekar, Computational research in the era of open access: Standards and best practices, Scientific American (2013) link
  • A. Morin, J. Urban, P.D. Adams, I. Foster, A. Sali, D. Baker, and P. Sliz, Shining Light into Black Boxes, Science 336 (2012) link.
  • S. Fomel and J. Claerbout, "Reproducible Research", Guest Editors' Introduction to a Special Issue of CiSE. link
  • Philippe Bonnet, et al., Repeatability and workability evaluation of SIGMOD 2011, SIGMOD Record, 40, Issue 2 (June 2011), pp. 45-48.
  • J. J. Quirk, Computational Science "Same old silence, same old mistakes", something more is needed. link
  • WaveLab, reproducible research in wavelets;
  • SparseLab, reproducible research in sparse modeling and compressed sensing;
  • [King2006] King, G. “Publication, Publication”. PS: Political Science and Politics, Vol. XXXIX, No. 1 (January, 2006), 119-125
Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox