Search CORE

18,799 research outputs found

Biomarker discovery and redundancy reduction towards classification using a multi-factorial MALDI-TOF MS T2DM mouse model dataset

Author: A Chadt
A Colorni
A Gamez-Pozo
A Rasche
A Tiss
A Tiss
AC Sauve
AL Oberg
Alexandra Chadt
Ali Tiss
B Wu
C Bauer
C Mercier
C Yang
Celia J Smith
Chris Bauer
D Kwon
D Mantini
DB West
Dieter Beule
E Lange
EP Xing
Frank Kleinjung
G Ge
GK Smyth
H Ressom
Hadi Al-Hasani
HS Jurgens
HS Jürgens
I Guyon
J Hua
J McGuire
J Norris
J Voortman
JE Shaw
JF Timms
JL Rodgers
Johannes Schuchhardt
Johnson RAaBGK
JR Ortlepp
K Coombes
Knut Reinert
L Breiman
M Dorigo
M Kirchner
M Palmblad
M Sturm
Mark W Towers
ME de Noo
MJ Crawley
MP van der Werff
N Tiffin
O Kohlbacher
P Du
P Pratapa
P Zhang
PV Rao
Q Liu
R Aebersold
R Cramer
Rainer Cramer
RC Gentleman
Robert Gentleman and Vince Carey and Wolfgang Huber and Rafael Irizarry and Sandrine Dudoit (Ed)
SM Carlson
T Alexandrov
T Dreja
T Hastie
Tanja Dreja
W Yu
X Liu
X Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Diabetes like many diseases and biological processes is not mono-causal. On the one hand multifactorial studies with complex experimental design are required for its comprehensive analysis. On the other hand, the data from these studies often include a substantial amount of redundancy such as proteins that are typically represented by a multitude of peptides. Coping simultaneously with both complexities (experimental and technological) makes data analysis a challenge for Bioinformatics

Central Archive at the University of Reading

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Informed baseline subtraction of proteomic mass spectrometry data aided by a novel sliding window algorithm

Author: Bagley Christopher J.
Solomon Patty J.
Stanford Tyman E.
Publication venue
Publication date: 01/01/2016
Field of study

Proteomic matrix-assisted laser desorption/ionisation (MALDI) linear time-of-flight (TOF) mass spectrometry (MS) may be used to produce protein profiles from biological samples with the aim of discovering biomarkers for disease. However, the raw protein profiles suffer from several sources of bias or systematic variation which need to be removed via pre-processing before meaningful downstream analysis of the data can be undertaken. Baseline subtraction, an early pre-processing step that removes the non-peptide signal from the spectra, is complicated by the following: (i) each spectrum has, on average, wider peaks for peptides with higher mass-to-charge ratios (m/z), and (ii) the time-consuming and error-prone trial-and-error process for optimising the baseline subtraction input arguments. With reference to the aforementioned complications, we present an automated pipeline that includes (i) a novel `continuous' line segment algorithm that efficiently operates over data with a transformed m/z-axis to remove the relationship between peptide mass and peak width, and (ii) an input-free algorithm to estimate peak widths on the transformed m/z scale. The automated baseline subtraction method was deployed on six publicly available proteomic MS datasets using six different m/z-axis transformations. Optimality of the automated baseline subtraction pipeline was assessed quantitatively using the mean absolute scaled error (MASE) when compared to a gold-standard baseline subtracted signal. Near-optimal baseline subtraction was achieved using the automated pipeline. The advantages of the proposed pipeline include informed and data specific input arguments for baseline subtraction methods, the avoidance of time-intensive and subjective piecewise baseline subtraction, and the ability to automate baseline subtraction completely. Moreover, individual steps can be adopted as stand-alone routines.Comment: 50 pages, 19 figure

arXiv.org e-Print Archive

Adelaide Research & Scholarship

Springer - Publisher Connector

PubMed Central

Evaluation of peak-picking algorithms for protein mass spectrometry

Author: A Savitzky
C Yang
D Kwon
D Mantini
E Lange
KR Coombes
M Sturm
N Jeffries
O Kohlbacher
P Du
Q Liu
WS Cleveland
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Peak picking is an early key step in MS data analysis. We compare three commonly used approaches to peak picking and discuss their merits by means of statistical analysis. Methods investigated encompass signal-to-noise ratio, continuous wavelet transform, and a correlation-based approach using a Gaussian template. Functionality of the three methods is illustrated and discussed in a practical context using a mass spectral data set created with MALDI-TOF technology. Sensitivity and specificity are investigated using a manually defined reference set of peaks. As an additional criterion, the robustness of the three methods is assessed by a perturbation analysis and illustrated using ROC curves

Central Archive at the University of Reading

Crossref

Current challenges in software solutions for mass spectrometry-based quantitative proteomics

Author: A Alexandridou
A Gruhler
A Leitner
A Michalski
A Panchaud
A Thompson
A Wolf-Yadlin
AD Polpitiya
AHP America
AI Nesvizhskii
AI Nesvizhskii
AI Nesvizhskii
AJR Heck
Albert J. R. Heck
AM Mayampurath
B Breukelen van
B Carrillo
B Ma
B Macek
B Schwanhäusser
B Zybailov
BAP Roxas
Bas van Breukelen
BO Keller
C Christin
C Ji
C Kumar
CH Becker
CK Frese
C–C Tsou
D Chelius
D Hoof Van
D MacDougall
D Tsur
D Valkenborg
DH Lundgren
DK Han
DL Swaney
DL Tabb
DL Tabb
DM Good
DN Perkins
E Deutsch
E Qeli
EL Hendrickson
G Audi
GL Finney
H Lam
H Lam
H Liu
H Steen
H Steen
I Beer
IP Shadforth
J Cox
J Cox
J Elias
J Gouw
J Grossmann
J Klimek
J Listgarten
J Meija
J Rappsilber
J Seidler
J Zhang
JC Silva
JF Kellie
JF Timms
JV Olsen
K Flikka
K Kultima
K Podwojski
KA Neilson
KC Hansen
KL Simpson
L Martens
L Ting
LF Waanders
LK Iwai
LN Mueller
M Bantscheff
M Bantscheff
M Bern
M Junqueira
M Kohl
M Mann
M Sandin
M Senko
M Unlü
M Vandenbogaert
MA Baldwin
MA Grobei
MA Kuzyk
MC Codrea
ME Belov
ME Sardiu
MH Elliott
MJ MacCoss
MM Savitski
MW Duncan
N Colaert
N Mischerikow
N Wang
NM Griffin
OA Mirgorodskaya
P Lu
P Mallick
P Mortensen
Pedro R. Cutillas
Peter R. Baker
PJ Boersema
PL Ross
PR Baker
PR Cutillas
PR Cutillas
R Aebersold
R Clarke
R Matthiesen
R Matthiesen
R Purves
R Usaite
R Zhang
RA Bradshaw
RD Smith
RE Moore
RJ Chalkley
RJ Chalkley
RJ Jacob
S Cappadona
S Cappadona
S Carr
S Dasari
S Houel
S Julka
S Ong
S-E Ong
SA Beausoleil
SA Gerber
Salvatore Cappadona
SJ Callister
SK Park
SP Gygi
SY Ow
T Shinkawa
TM Annesley
TS Collier
TT Aye
V Faca
V Lange
VG Tusher
VP Andreev
W Weiss
W Yan
W Zhu
WM Old
WX Schulze
X Yang
Y Ishihama
Y Oda
YJ Kim
Z Khan
Z Khan
Z-Q Ma
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

This work was in part supported by the PRIME-XS project, grant agreement number 262067, funded by the European Union seventh Framework Programme; The Netherlands Proteomics Centre, embedded in The Netherlands Genomics Initiative; The Netherlands Bioinformatics Centre; and the Centre for Biomedical Genetics (to S.C., B.B. and A.J.R.H); by NIH grants NCRR RR001614 and RR019934 (to the UCSF Mass Spectrometry Facility, director: A.L. Burlingame, P.B.); and by grants from the MRC, CR-UK, BBSRC and Barts and the London Charity (to P.C.

Crossref

Springer - Publisher Connector

Queen Mary Research Online

Sparse Proteomics Analysis - A compressed sensing-based approach for feature selection and classification of high-dimensional proteomics mass spectrometry data

Author: Conrad Tim
Cvetkovic Nada
Genzel Martin
Kutyniok Gitta
Leichtle Alexander
Schütte Christof
Vybiral Jan
Wulkow Niklas
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 26/11/2016
Field of study

Background: High-throughput proteomics techniques, such as mass spectrometry (MS)-based approaches, produce very high-dimensional data-sets. In a clinical setting one is often interested in how mass spectra differ between patients of different classes, for example spectra from healthy patients vs. spectra from patients having a particular disease. Machine learning algorithms are needed to (a) identify these discriminating features and (b) classify unknown spectra based on this feature set. Since the acquired data is usually noisy, the algorithms should be robust against noise and outliers, while the identified feature set should be as small as possible. Results: We present a new algorithm, Sparse Proteomics Analysis (SPA), based on the theory of compressed sensing that allows us to identify a minimal discriminating set of features from mass spectrometry data-sets. We show (1) how our method performs on artificial and real-world data-sets, (2) that its performance is competitive with standard (and widely used) algorithms for analyzing proteomics data, and (3) that it is robust against random and systematic noise. We further demonstrate the applicability of our algorithm to two previously published clinical data-sets

arXiv.org e-Print Archive

Institutional Repository of the Freie Universität Berlin

DepositOnce

Repository: Freie Universität Berlin (FU), Math Department (fu_mi_publications)

PubMed Central

Bern Open Repository and Information System (BORIS)