Search CORE

12,854 research outputs found

Updates in metabolomics tools and resources: 2014-2015

Author: Misra Biswapriya B.
van der Hooft Justin
Publication venue: 'Wiley'
Publication date: 01/01/2016
Field of study

Data processing and interpretation represent the most challenging and time-consuming steps in high-throughput metabolomic experiments, regardless of the analytical platforms (MS or NMR spectroscopy based) used for data acquisition. Improved machinery in metabolomics generates increasingly complex datasets that create the need for more and better processing and analysis software and in silico approaches to understand the resulting data. However, a comprehensive source of information describing the utility of the most recently developed and released metabolomics resources—in the form of tools, software, and databases—is currently lacking. Thus, here we provide an overview of freely-available, and open-source, tools, algorithms, and frameworks to make both upcoming and established metabolomics researchers aware of the recent developments in an attempt to advance and facilitate data processing workflows in their metabolomics research. The major topics include tools and researches for data processing, data annotation, and data visualization in MS and NMR-based metabolomics. Most in this review described tools are dedicated to untargeted metabolomics workflows; however, some more specialist tools are described as well. All tools and resources described including their analytical and computational platform dependencies are summarized in an overview Table

Enlighten

The online Tabloid Proteome : an annotated database of protein associations

Author: Gupta Surya
Martens Lennart
Tavernier Jan
Turan Demet
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2018
Field of study

A complete knowledge of the proteome can only be attained by determining the associations between proteins, along with the nature of these associations (e.g. physical contact in protein–protein interactions, participation in complex formation or different roles in the same pathway). Despite extensive efforts in elucidating direct protein interactions, our knowledge on the complete spectrum of protein associations remains limited. We therefore developed a new approach that detects protein associations from identifications obtained after re-processing of large-scale, public mass spectrometry-based proteomics data. Our approach infers protein association based on the co-occurrence of proteins across many different proteomics experiments, and provides information that is almost completely complementary to traditional direct protein interaction studies. We here present a web interface to query and explore the associations derived from this method, called the online Tabloid Proteome. The online Tabloid Proteome also integrates biological knowledge from several existing resources to annotate our derived protein associations. The online Tabloid Proteome is freely available through a user-friendly web interface, which provides intuitive navigation and data exploration options for the user at http://iomics.ugent.be/tabloidproteome

Ghent University Academic Bibliography

A golden age for working with public proteomics data

Author: Martens Lennart
Vizcaíno Juan Antonio
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Data sharing in mass spectrometry (MS)-based proteomics is becoming a common scientific practice, as is now common in the case of other, more mature 'omics' disciplines like genomics and transcriptomics. We want to highlight that this situation, unprecedented in the field, opens a plethora of opportunities for data scientists. First, we explain in some detail some of the work already achieved, such as systematic reanalysis efforts. We also explain existing applications of public proteomics data, such as proteogenomics and the creation of spectral libraries and spectral archives. Finally, we discuss the main existing challenges and mention the first attempts to combine public proteomics data with other types of omics data sets

Ghent University Academic Bibliography

Resilience in the proteomics data ecosystem: how the field cares for its data

Author: Martens Lennart
Publication venue: 'Wiley'
Publication date: 01/01/2013
Field of study

The public dissemination of data is an integral part of the life sciences. In the field of proteomics too, data sharing has taken off over the last few years, with the first downstream uses of these data quickly gaining prominence. At the same time, the recent unfortunate demise of two repositories, NCBI Peptidome and ProteomeCommons Tranche, has shown the frailty of such data gathering efforts. Heroic efforts by the PRIDE and Peptidome teams to rescue the Peptidome data have now ensured their continued availability to the field, and alternatives have already been put in place for Tranche. But with public data increasingly at the hub of the life sciences, it is a good time to look at the proteomics data ecosystem in some more detail

Ghent University Academic Bibliography

The PRIDE database and related tools and resources in 2019: improving support for quantification data

Author: Audain E.
Bai J.
Bernal-Llinares M.
Brazma A.
Cox J.
Csordas A.
Eisenacher M.
Griss J.
Hewapathirana S.
Inuganti A.
Jarnuczak A.
Kundu D.
Mayer G.
Perez E.
Perez-Riverol Y.
Pfeuffer J.
Sachsenberg T.
Ternent T.
Tiwary S.
Uszkoreit J.
Vizcaino J.
Walzer M.
Yilmaz S.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2019
Field of study

The PRoteomics IDEntifications (PRIDE) database (https://www.ebi.ac.uk/pride/) is the world's largest data repository of mass spectrometry-based proteomics data, and is one of the founding members of the global ProteomeXchange (PX) consortium. In this manuscript, we summarize the developments in PRIDE resources and related tools since the previous update manuscript was published in Nucleic Acids Research in 2016. In the last 3years, public data sharing through PRIDE (as part of PX) has definitely become the norm in the field. In parallel, data re-use of public proteomics data has increased enormously, with multiple applications. We first describe the new architecture of PRIDE Archive, the archival component of PRIDE. PRIDE Archive and the related data submission framework have been further developed to support the increase in submitted data volumes and additional data types. A new scalable and fault tolerant storage backend, Application Programming Interface and web interface have been implemented, as a part of an ongoing process. Additionally, we emphasize the improved support for quantitative proteomics data through the mzTab format. At last, we outline key statistics on the current data contents and volume of downloads, and how PRIDE data are starting to be disseminated to added-value resources including Ensembl, UniProt and Expression Atlas

MPG.PuRe

ProteoClade: A taxonomic toolkit for multi-species and metaproteomic analysis

Author: Held Jason M
Mooradian Arshag D
Naegle Kristen M
van der Post Sjoerd
Publication venue: Digital Commons@Becker
Publication date: 01/03/2020
Field of study

We present ProteoClade, a Python toolkit that performs taxa-specific peptide assignment, protein inference, and quantitation for multi-species proteomics experiments. ProteoClade scales to hundreds of millions of protein sequences, requires minimal computational resources, and is open source, multi-platform, and accessible to non-programmers. We demonstrate its utility for processing quantitative proteomic data derived from patient-derived xenografts and its speed and scalability enable a novel de novo proteomic workflow for complex microbiota samples

Directory of Open Access Journals

Digital Commons@Becker

Data access and integration in the ISPIDER proteomics grid

Author: C.A. Goble
E.M. Zdobnov
J. Smith
L.M. Haas
M. Antonioletti
M. Maibaum
P. Buneman
P. Mçbrien
R.G.G. Cattell
S. Bowers
S. Durinck
S.B. Davidson
T.M. Oinn
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

Grid computing has great potential for supporting the integration of complex, fast changing biological data repositories to enable distributed data analysis. One scenario where Grid computing has such potential is provided by proteomics resources which are rapidly being developed with the emergence of affordable, reliable methods to study the proteome. The protein identifications arising from these methods derive from multiple repositories which need to be integrated to enable uniform access to them. A number of technologies exist which enable these resources to be accessed in a Grid environment, but the independent development of these resources means that significant data integration challenges, such as heterogeneity and schema evolution, have to be met. This paper presents an architecture which supports the combined use of Grid data access (OGSA-DAI), Grid distributed querying (OGSA-DQP) and data integration (AutoMed) software tools to support distributed data analysis. We discuss the application of this architecture for the integration of several autonomous proteomics data resources

CiteSeerX

Crossref

Birkbeck Institutional Research Online

The University of Manchester - Institutional Repository

Institute of Clinical and Translational Sciences News, Vol. 2, Issue 9

Author
Publication venue: Digital Commons@Becker
Publication date: 01/01/2010
Field of study

Digital Commons@Becker

The impact of sequence database choice on metaproteomic results in gut microbiota studies

Author: Addis Maria Filippa
Deligios Massimo
Fraumene Cristina
Manghina Valeria
Martens Lennart
Muth Thilo
Pagnozzi Daniela
Palomba Antonio
Rapp Erdmann
Tanca Alessandro
Uzzau Sergio
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Background: Elucidating the role of gut microbiota in physiological and pathological processes has recently emerged as a key research aim in life sciences. In this respect, metaproteomics, the study of the whole protein complement of a microbial community, can provide a unique contribution by revealing which functions are actually being expressed by specific microbial taxa. However, its wide application to gut microbiota research has been hindered by challenges in data analysis, especially related to the choice of the proper sequence databases for protein identification. Results: Here, we present a systematic investigation of variables concerning database construction and annotation and evaluate their impact on human and mouse gut metaproteomic results. We found that both publicly available and experimental metagenomic databases lead to the identification of unique peptide assortments, suggesting parallel database searches as a mean to gain more complete information. In particular, the contribution of experimental metagenomic databases was revealed to be mandatory when dealing with mouse samples. Moreover, the use of a "merged" database, containing all metagenomic sequences from the population under study, was found to be generally preferable over the use of sample-matched databases. We also observed that taxonomic and functional results are strongly database-dependent, in particular when analyzing the mouse gut microbiota. As a striking example, the Firmicutes/Bacteroidetes ratio varied up to tenfold depending on the database used. Finally, assembling reads into longer contigs provided significant advantages in terms of functional annotation yields. Conclusions: This study contributes to identify host- and database-specific biases which need to be taken into account in a metaproteomic experiment, providing meaningful insights on how to design gut microbiota studies and to perform metaproteomic data analysis. In particular, the use of multiple databases and annotation tools has to be encouraged, even though this requires appropriate bioinformatic resources

AIR Universita degli studi di Milano

Ghent University Academic Bibliography

PubMed Central

MPG.PuRe

Waveomics: bringing experimental data to online collaboration

Author: Neil Swainston
Publication venue
Publication date: 20/08/2010
Field of study

Systems biology offers an interdisciplinary approach to scientific research that typically involves the collaboration of teams of experimentalists and mathematical modellers. While the importance of data standards has been recognised in facilitating exchange of data between the parties, challenges still remain regarding the practicalities of disseminating experimental data.

The introduction of novel web-based tools aimed at promoting collaborative work has provided a platform upon which scientific applications can be built. The recently released Google Wave protocol provides a facility for real-time collaboration between teams of researchers.

This work introduces a customized Robot that automatically scans text in Google Waves for experimental data identifiers, extracts corresponding experimental data from remote resources associated with such identifiers, and appends charts showing this experimental data to the Wave

Nature Precedings