Search CORE

4 research outputs found

ppx : programmatic access to proteomics data repositories

Author: Bittremieux Wout
Fondrie William E.
Noble William S.
Publication venue
Publication date: 01/01/2021
Field of study

The volume of proteomics and mass spectrometry data available in public repositories continues to grow at a rapid pace as more researchers embrace open science practices. Open access to the data behind scientific discoveries has become critical to validate published findings and develop new computational tools. Here, we present ppx, a Python package that provides easy, programmatic access to the data stored in ProteomeXchange repositories, such as PRIDE and MassIVE. The ppx package can either be used as a command line tool or a Python package to retrieve the files and metadata associated with a project when provided its identifier. To demonstrate how ppx enhances reproducible research, we used ppx within a Snakemake workflow to reanalyze a published dataset with the open modification search tool ANN-SoLo and compared our reanalysis to the original results. We show that ppx readily integrates into workflows and our reanalysis produced results consistent with the original analysis. We envision that ppx will be a valuable tool for creating reproducible analyses, providing tool developers easy access to data for development, testing, and benchmarking, and enabling the use of mass spectrometry data in data-intensive analyses. The ppx package is freely available and open source under the MIT license at: https://github.com/wfondrie/pp

PubMed Central

Institutional Repository Universiteit Antwerpen

Evaluating Proteomics Imputation Methods with Improved Criteria

Author: Lincoln Harris (17261906)
Sewoong Oh (17261909)
William E. Fondrie (1505692)
William S. Noble (453112)
Publication venue
Publication date: 20/10/2023
Field of study

Quantitative measurements produced by tandem mass spectrometry proteomics experiments typically contain a large proportion of missing values. Missing values hinder reproducibility, reduce statistical power, and make it difficult to compare across samples or experiments. Although many methods exist for imputing missing values, in practice, the most commonly used methods are among the worst performing. Furthermore, previous benchmarking studies have focused on relatively simple measurements of error such as the mean-squared error between imputed and held-out values. Here we evaluate the performance of commonly used imputation methods using three practical, “downstream-centric” criteria. These criteria measure the ability to identify differentially expressed peptides, generate new quantitative peptides, and improve the peptide lower limit of quantification. Our evaluation comprises several experiment types and acquisition strategies, including data-dependent and data-independent acquisition. We find that imputation does not necessarily improve the ability to identify differentially expressed peptides but that it can identify new quantitative peptides and improve the peptide lower limit of quantification. We find that MissForest is generally the best performing method per our downstream-centric criteria. We also argue that existing imputation methods do not properly account for the variance of peptide quantifications and highlight the need for methods that do

FigShare

Juan Skippy

Author: A Chomsky
D Bacon
G Debord
J Bennet
J Trier
JB Schachner
JB Schachner
JB Schachner
JB Schachner
JL Willis-Rivera
K Hytten
K Schilt
KM Reimer
M Cai
MJ Botelho
NL Nilsson
NL Nilsson
O Pimentel
PE Bandre
RW Habenstein
S Chappell
S Fondrie
S Hall
S Hall
S Nathenson-Mejia
S Salgado
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

CRITICAL EXAMINATION OF CANDIDATES' DIVERSITY COMPETENCE: RIGOROUS AND SYSTEMATIC ASSESSMENT OF CANDIDATES' EFFICACY TO TEACH DIVERSE STUDENT POPULATIONS

Crossref