
Combating the Reproducibility Crisis in Computational Proteomics – Translating Proteomics Episode 15

Nautilus Biotechnology
January 23, 2025
On this episode of Translating Proteomics, co-hosts Parag Mallick and Andreas Huhmer of Nautilus Biotechnology discuss the reproducibility crisis in biology and specifically focus on how we can enhance reproducibility in computational proteomics. Key topics they cover include:
- What the reproducibility crisis is
- Factors that make it difficult to replicate multiomics research
- Steps we can take to make biology research more reproducible
Find this episode on Apple Podcasts, Spotify, or YouTube.
Chapters
00:00 – 01:20 – Introduction
01:20– 03:10 – What is reproducibility in research and why is it important?
03:10 – 05:42 – Recent work from the Mallick Lab focused on computational proteomics reproducibility
05:42 – 09:32 – Ways to help improve reproducibility in computational proteomics – More detailed documentation, moving beyond papers as our main form of documentation, and ensuring computational workflows are available
09:32 – 11:30 – Why Parag got interested reproducibility – Attempts to build AI layers on top of current workflows
11:30 – 14:00 – The need to create repositories of analytical workflows codified in a structured way that AI can learn from
14:00 – 15:24 – A role for dedicated data curators
15:24 – 18:31 – Moving beyond the idea of study endpoints and recognizing data as part of a larger whole
18:31 – 21:32 – How does AI fit into the continuous analysis and incorporation of new datasets
21:32 – 23:36 – The role of AI in helping researchers design experiments
23:36 – 27:25 – Three things we can do today to increase the reproducibility of computational proteomics experiments:
- Be clear about the stated hypothesis
- Document analyses through workflow engines and containerized workflows
- Advocate for support for funding for reproducibility and reproducibility tools
27:25 – End – Outro
Resources
- Parag’s Gilbert S. Omenn Computational Proteomics Award Lecture
In this lecture, Parag describes his vision for a more reproducible future in proteomics - Nature Special on “Challenges in irreproducible research”
A list of articles and perspective pieces discussing the “reproducibility crisis” in research - Why Most Published Research Findings Are False (Ioannidis 2005)
Article outlining many of the issues that make it difficult to reproduce research findings - Reproducibility Project: Cancer Biology
eLife initiative investigating reproducibility in preclinical cancer research - Center for Open Science Preregistration Initiative
Resources for preregistering a hypothesis as part of a study - National Institute of Standards and Technology (NIST)
US government agency that aims to be “the world’s leader in creating critical measurement solutions and promoting equitable standards.” - MSstats
Open source software for mass spec data analysis from Bioconductor - National Institute of General Medical Sciences
US government agency focused on “basic research that increases understanding of biological processes and lays the foundation for advances in disease diagnosis, treatment, and prevention.” - Chan Zuckerberg Initiative – Essential Open Source Software for Science
CZI program supporting “software maintenance, growth, development, and community engagement for critical open source tools.”
Find this episode on Apple Podcasts, Spotify, or YouTube.
MORE ARTICLES
