The aim of Reproducible Bioinformatics project is the creation of easy to use Bioinformatics workflows that fullfill the following roles (Sandve et al. PLoS Comp Biol. 2013):
- For Every Result, Keep Track of How It Was Produced
- Avoid Manual Data Manipulation Steps
- Archive the Exact Versions of All External Programs Used
- Version Control All Custom Scripts
- Record All Intermediate Results, When Possible in Standardized Formats
- For Analyses That Include Randomness, Note Underlying Random Seeds
- Always Store Raw Data behind Plots
- Generate Hierarchical Analysis Output, Allowing Layers of Increasing Detail to Be Inspected
- Connect Textual Statements to Underlying Results
- Provide Public Access to Scripts, Runs, and Results
The paper on the SeqBox project is on Bioinformatics (Beccuti et al. 2018).
The paper on the Reproducible Bioinformatics project is on BMC Bioinformatics (Kulkarni et al. 2018).
The paper on rCASC: reproducible classification analysis of single-cell sequencing data is on GigaScience (Alessandri et al. 2019)
A new tool for reproducibility is now available at https://github.com/alessandriLuca/dockerFileGeneratorGUI
The project is supported by the non-profit association for Bioinformatics and Molecular Medicine (Bx2M). Bx2M focus on providing training to life science researchers in the area of omics data analysis.
In collaboration with the Italian Elixir node, Italian Society of Bioinformatics, InfoLife CINI lab, Molecular Biotechnology Center UNITO, HPC4AI UNITO, Euroclone and 10XGenomics, from 28 to 30th of September 2022 at Molecular Biotechnology Center (UNITO, Torino, Italy) B2x2M has organised a scRNA-seq data analysis hands-on workshop: https://elixir-iib-training.github.io/website/2022/09/28/scRNA-seq-Workshop.html. Registration ends on September the 10th 2022.