Search CORE

5 research outputs found

A proteomics sample metadata representation for multiomics integration and big data analysis

The amount of public proteomics data is rapidly increasing but there is no standardized format to describe the sample metadata and their relationship with the dataset files in a way that fully supports their understanding or reanalysis. Here we propose to develop the transcriptomics data format MAGE-TAB into a standard representation for proteomics sample metadata. We implement MAGE-TAB-Proteomics in a crowdsourcing project to manually curate over 200 public datasets. We also describe tools and libraries to validate and submit sample metadata-related information to the PRIDE repository. We expect that these developments will improve the reproducibility and facilitate the reanalysis and integration of public proteomics datasets.publishedVersio

Bergen Open Research Archive (Univ. of Bergen)

Ghent University Academic Bibliography

Copenhagen University Research Information System

Providence St. Joseph Health Digital Commons

NORA - Norwegian Open Research Archives

A proteomics sample metadata representation for multiomics integration and big data analysis

Author: Dai Chengxin
Deng Jingwen
Fexova Silvie
Füllgrabe Anja
George Nancy
Kamatchinathan Selvakumar
Kundu Deepti Jaiswal
Moreno Pablo
Pfeuffer Julianus
Solovyeva Elizaveta M.
Publication venue
Publication date: 01/01/2021
Field of study

Institutional Repository of the Freie Universität Berlin

The ProteomeXchange consortium at 10 years: 2023 update.

Author: Bandeira Nuno
Bandla Chakradhar
Carver Jeremy J
Deutsch Eric W
Hewapathirana Suresh
Ishihama Yasushi
Kamatchinathan Selvakumar
Kawano Shin
Kundu Deepti J
MacCoss Michael J
MacLean Brendan
Mendoza Luis
Okuda Shujiro
Perez-Riverol Yasset
Pullman Benjamin S
Sharma Vagisha
Sun Zhi
Vizcaíno Juan Antonio
Wang Shengbo
Watanabe Yu
Wertz Julie
Zhu Yunping
Publication venue: Providence St. Joseph Health Digital Commons
Publication date: 12/11/2022
Field of study

Mass spectrometry (MS) is by far the most used experimental approach in high-throughput proteomics. The ProteomeXchange (PX) consortium of proteomics resources (http://www.proteomexchange.org) was originally set up to standardize data submission and dissemination of public MS proteomics data. It is now 10 years since the initial data workflow was implemented. In this manuscript, we describe the main developments in PX since the previous update manuscript in Nucleic Acids Research was published in 2020. The six members of the Consortium are PRIDE, PeptideAtlas (including PASSEL), MassIVE, jPOST, iProX and Panorama Public. We report the current data submission statistics, showcasing that the number of datasets submitted to PX resources has continued to increase every year. As of June 2022, more than 34 233 datasets had been submitted to PX resources, and from those, 20 062 (58.6%) just in the last three years. We also report the development of the Universal Spectrum Identifiers and the improvements in capturing the experimental metadata annotations. In parallel, we highlight that data re-use activities of public datasets continue to increase, enabling connections between PX resources and other popular bioinformatics resources, novel research and also new data resources. Finally, we summarise the current state-of-the-art in data management practices for sensitive human (clinical) proteomics data

Providence St. Joseph Health Digital Commons

A proteomics sample metadata representation for multiomics integration and big data analysis

NORA - Norwegian Open Research Archives