Search CORE

A Perl procedure for protein identification by Peptide Mass Fingerprinting

Author: Barbarini Nicola
Magni Paolo
Rusconi Luisa
Tiengo Alessandra
Troiani Sonia
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background One of the topics of major interest in proteomics is protein identification. Protein identification can be achieved by analyzing the mass spectrum of a protein sample through different approaches. One of them, called Peptide Mass Fingerprinting (PMF), combines mass spectrometry (MS) data with searching strategies in a suitable database of known protein to provide a list of candidate proteins ranked by a score. To this aim, several algorithms and software tools have been proposed. However, the scoring methods and mainly the statistical evaluation of the results can be significantly improved. Results In this work, a Perl procedure for protein identification by PMF, called MsPI (Mass spectrometry Protein Identification), is presented. The implemented scoring methods were derived from the literature. MsPI implements a strategy to remove the contaminant masses present in the acquired spectra. Moreover, MsPI includes a statistical method to assign to each candidate protein, in addition to the scoring value, a p-value. Results obtained by MsPI on a dataset of 10 protein samples were compared with those achieved using two other software tools, i.e. Piums and Mascot. Piums implements one of the scoring methods available in MsPI, while Mascot is one of the most frequently used software tools in the protein identification field. MsPI scripts are available for downloading on the web site <url>http://aimed11.unipv.it/MsPI</url>. Conclusion The performances of MsPI seem to be better than those of Piums and Mascot. In fact, on the considered dataset, MsPI includes in its candidate proteins list, the "true" proteins nine times over ten, whereas Piums includes in its list the "true" proteins only four time over ten. Even if Mascot also correctly includes in the candidates list the "true" proteins nine times over ten, it provides longer candidate lists, therefore increasing the number of false positives when the molecular weight of the proteins in the sample is approximatively known (e.g. by the 1-D/2-D electrophoresis gel). Moreover, being MsPI a Perl tool, it can be easily extended and customized by the final users.</p

Archivio Istituzionale della Ricerca - Università degli Studi di Pavia

Springer - Publisher Connector

A procedure to decompose high resolution mass spectra

Author: Nicola Barbarini
Paolo Magni
Riccardo Bellazzi
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Novel topological descriptors for analyzing biological networks

Author: Armin A Graber
Dehmer Matthias M
Kurt K Varmuza
Matthias M Dehmer
Nicola N Barbarini
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Topological descriptors, other graph measures, and in a broader sense, graph-theoretical methods, have been proven as powerful tools to perform biological network analysis. However, the majority of the developed descriptors and graph-theoretical methods does not have the ability to take vertex- and edge-labels into account, e.g., atom- and bond-types when considering molecular graphs. Indeed, this feature is important to characterize biological networks more meaningfully instead of only considering pure topological information. Results In this paper, we put the emphasis on analyzing a special type of biological networks, namely bio-chemical structures. First, we derive entropic measures to calculate the information content of vertex- and edge-labeled graphs and investigate some useful properties thereof. Second, we apply the mentioned measures combined with other well-known descriptors to supervised machine learning methods for predicting Ames mutagenicity. Moreover, we investigate the influence of our topological descriptors - measures for only unlabeled vs. measures for labeled graphs - on the prediction performance of the underlying graph classification problem. Conclusions Our study demonstrates that the application of entropic measures to molecules representing graphs is useful to characterize such structures meaningfully. For instance, we have found that if one extends the measures for determining the structural information content of unlabeled graphs to labeled graphs, the uniqueness of the resulting indices is higher. Because measures to structurally characterize labeled graphs are clearly underrepresented so far, the further development of such methods might be valuable and fruitful for solving problems within biological network analysis.</p

CiteSeerX

Springer

Automatic Data Transfer from OMOP-CDM to REDCap: A Semantically-Enriched Framework.

Author: Anna Alloni
Emanuele Girani
Lucia Sacchi
Matteo Gabetta
Morena Stuppia
Nicola Barbarini
Publication venue
Publication date: 18/11/2021
Field of study

Development of a FHIR Layer on Top of the OMOP Common Data Model for the CAPABLE Project.

Author: Anna Alloni
Enea Parimbelli
Francesca Polce
Giordano Lanzola
Matteo Gabetta
Nicola Barbarini
Publication venue
Publication date: 18/11/2021
Field of study

i2b2 to Optimize Patients Enrollment.

Author: Alberto Zambelli
Antonio Bellasi
Cristiana Larizza
Lorenzo Chiudinelli
Matteo Gabetta
Mauro Bucalo
Nicola Barbarini
Publication venue
Publication date: 27/05/2021
Field of study

i2b2 data-warehouse could be a useful tool to support the enrollment phase of clinical studies. The aim of this work is to evaluate its performance on two clinical trials. We developed also an i2b2 extension to help in suggesting eligible patients for a study. The work showed good results in terms of ability to implement inclusion/exclusion criteria, but also in terms of identified patients actually enrolled and high number of patients suggested as potentially enrollable

From EHR to EDC - The Experience at the Policlinico Hospital in Milan.

Author: Alberto Zanella
Amedeo Guzzardella
Angelo Caroli
Eleonora Ferretti
Giacomo Grasselli
Mauro Bucalo
Nicola Barbarini
Sara Pizzimenti
Silvano Bosari
Publication venue
Publication date: 18/11/2021
Field of study

A computational method for designing diverse linear epitopes including citrullinated peptides with desired binding affinities to intravenous immunoglobulin

Author: Alessandra Tiengo
B Yao
B Yao
BG Pierce
Bjoern Ziems
C Lundegaard
C Meydan
C Peri
C Yanover
Carl Kingsford
CH Teo
Felix Steinbeck
GL Zhang
GL Zhang
Gustavo Stolovitzky
Hans-Jürgen Thiesen
J Chen
J Gao
J Kittler
JE Larsen
Julio Saez-Rodriguez
JV Kringelum
JW Ponder
K Newton
L Huang
L Nanni
L Nanni
LJ Wee
M Hecker
M Luštrek
M Nielsen
M Ojala
Mitja Luštrek
N Barbarini
Nicola Barbarini
NP Boghossian
P Pudil
P Wang
Peter Lorenz
R Chen
Raquel Norel
Riccardo Bellazzi
Rob Patro
Robert J. Prill
S Gupta
S Henikoff
S Kawashima
S Saha
SY-H Lin
V Brusic
W Zhang
X Hu
Y EL-Manzalawy
Y Wang
Y Zhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Accurate peak list extraction from proteomic mass spectra for identification and profiling studies

Author: A Mehta
A Savitzky
AG Marshall
AL Rockwood
BY Renard
C Morris
D Valkenborg
DM Horn
DN Perkins
E Frank
EF Petricoin
EFI Petricoin
EJ Breen
ET Fung
F Hillenkamp
G Cagney
HW Ressom
J Pesavento
JA Falkner
JB Fenn
JL Margrave
JR Yates
JS Yu
K Noy
KR Coombes
L Andrade
L Chen
LA Liotta
M Hilario
MA Hall
MR Hoopmann
MW Senko
Nicola Barbarini
P Du
P Du
P Hernandez
P James
Paolo Magni
Q Hu
R Aebersold
RA Zubarev
RC Holte
S Bocker
SK Sze
TP Conrads
W Meuleman
WS Cleveland
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Mass spectrometry is an essential technique in proteomics both to identify the proteins of a biological sample and to compare proteomic profiles of different samples. In both cases, the main phase of the data analysis is the procedure to extract the significant features from a mass spectrum. Its final output is the so-called peak list which contains the mass, the charge and the intensity of every detected biomolecule. The main steps of the peak list extraction procedure are usually preprocessing, peak detection, peak selection, charge determination and monoisotoping operation. Results This paper describes an original algorithm for peak list extraction from low and high resolution mass spectra. It has been developed principally to improve the precision of peak extraction in comparison to other reference algorithms. It contains many innovative features among which a sophisticated method for managing the overlapping isotopic distributions. Conclusions The performances of the basic version of the algorithm and of its optional functionalities have been evaluated in this paper on both SELDI-TOF, MALDI-TOF and ESI-FTICR ECD mass spectra. Executable files of MassSpec, a MATLAB implementation of the peak list extraction procedure for Windows and Linux systems, can be downloaded free of charge for nonprofit institutions from the following web site: <url>http://aimed11.unipv.it/MassSpec</url></p

Archivio Istituzionale della Ricerca - Università degli Studi di Pavia

Springer - Publisher Connector