Search CORE

86 research outputs found

From sequence to structure to networks

Author: Käll Lukas
Yosef Nir
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

A report on the 7th European Conference on Computational Biology (ECCB), Cagliari, Italy, 22-26 September 2008

Crossref

PubMed Central

Advantages of combined transmembrane topology and signal peptide prediction—the Phobius web server

Author: Krogh Anders
Käll Lukas
Sonnhammer Erik L.L.
Publication venue: Oxford University Press
Publication date: 01/01/2007
Field of study

When using conventional transmembrane topology and signal peptide predictors, such as TMHMM and SignalP, there is a substantial overlap between these two types of predictions. Applying these methods to five complete proteomes, we found that 30–65% of all predicted signal peptides and 25–35% of all predicted transmembrane topologies overlap. This impairs predictions of 5–10% of the proteome, hence this is an important issue in protein annotation

Crossref

PubMed Central

Copenhagen University Research Information System

qvality: non-parametric estimation of q-values and posterior error probabilities

Author: Anderson
Efron
Green
John D. Storey
Käll
Lukas Käll
Storey
Storey
Storey
Strimmer
William Stafford Noble
Publication venue: Oxford University Press
Publication date
Field of study

Summary: Qvality is a C++ program for estimating two types of standard statistical confidence measures: the q-value, which is an analog of the p-value that incorporates multiple testing correction, and the posterior error probability (PEP, also known as the local false discovery rate), which corresponds to the probability that a given observation is drawn from the null distribution. In computing q-values, qvality employs a standard bootstrap procedure to estimate the prior probability of a score being from the null distribution; for PEP estimation, qvality relies upon non-parametric logistic regression. Relative to other tools for estimating statistical confidence measures, qvality is unique in its ability to estimate both types of scores directly from a null distribution, without requiring the user to calculate p-values

Crossref

PubMed Central

Retention Time and Fragmentation Predictors Increase Confidence in Identification of Common Variant Peptides

Author: Bouyssié David
Kuznetsova Ksenia
Käll Lukas
Skiadopoulou Dafni
Vasicek Jakub
Vaudel Marc
Publication venue: ACS
Publication date: 01/01/2023
Field of study

Precision medicine focuses on adapting care to the individual profile of patients, for example, accounting for their unique genetic makeup. Being able to account for the effect of genetic variation on the proteome holds great promise toward this goal. However, identifying the protein products of genetic variation using mass spectrometry has proven very challenging. Here we show that the identification of variant peptides can be improved by the integration of retention time and fragmentation predictors into a unified proteogenomic pipeline. By combining these intrinsic peptide characteristics using the search-engine post-processor Percolator, we demonstrate improved discrimination power between correct and incorrect peptide-spectrum matches. Our results demonstrate that the drop in performance that is induced when expanding a protein sequence database can be compensated, hence enabling efficient identification of genetic variation products in proteomics data. We anticipate that this enhancement of proteogenomic pipelines can provide a more refined picture of the unique proteome of patients and thereby contribute to improving patient care.publishedVersio

University of Bergen

ABRF Proteome Informatics Research Group (iPRG) 2016 Study: Inferring Proteoforms from Bottom-up Proteomics Data.

Author: Choi Hyungwon
Colangelo Christopher M
Davis Darryl
Hoopmann Michael R
Käll Lukas
Lam Henry
Lee Joon-Yong
Palmblad Magnus
Payne Samuel H
Perez-Riverol Yasset
The Matthew
Weintraub Susan T
Wilson Ryan
Publication venue: Providence St. Joseph Health Digital Commons
Publication date: 01/07/2018
Field of study

This report presents the results from the 2016 Association of Biomolecular Resource Facilities Proteome Informatics Research Group (iPRG) study on proteoform inference and false discovery rate (FDR) estimation from bottom-up proteomics data. For this study, 3 replicate Q Exactive Orbitrap liquid chromatography-tandom mass spectrometry datasets were generated from each of

Providence St. Joseph Health Digital Commons

Computational Mass Spectrometry–Based Proteomics

Author: A Bell
A Bertsch
A Keller
A Subramanian
A Thompson
AC Gavin
AHP America
AI Nesvizhskii
AI Nesvizhskii
AK Yocum
AL Boulesteix
AL Oberg
AR Joyce
AW Liew
B Domon
B MacLean
B Schwanhäusser
C Ansong
C H
C Kumar
CH Ahrens
D Huang
DF Ransohoff
DH Lundgren
DL Tabb
DW Huang
EW Deutsch
F Emmert-Streib
Fran Lewitter
H Choi
H Lam
J Cox
J Cox
J Hu
JA Cham Mead
JD Venable
JV Olsen
JV Olsen
K Jeong
L Käll
L Nie
L Reiter
L Reiter
L Valledor
LMF de Godoy
LN Mueller
Lukas Käll
M Ackermann
M Beck
M Gstaiger
M Mann
M Sturm
M Uhlen
MW Duncan
MYK Brusniak
N Bandeira
N Castellana
N Gehlenborg
N Gupta
N Gupta
N Rifai
NL Anderson
NM Griffin
NR Kitteringham
Olga Vitek
P Mallick
P Picotti
P Picotti
PL Ross
R Aebersold
R Clarke
R de Sousa Abreu
R Moore
R Sharan
R Wu
RA Irizarry
RK Nibbe
S Abbatiello
S Carr
S Dasari
S Hanash
S Pan
SE Ong
SJ Callister
SS Huang
T Aittokallio
T Clough
T Geiger
T Maier
T Nilsson
TA Addona
TC Walther
TH Corzett
V Granholm
V Lange
WX Schulze
Y Karpievitch
YF Li
Publication venue: Public Library of Science
Publication date: 01/12/2011
Field of study

Crossref

Directory of Open Access Journals

PubMed Central

Predicting transmembrane topology and signal peptides with hidden Markov models

Author: Käll Lukas
Publication venue: Centrum för Genomik och Bioinformatik (CGB) / Center for Genomics Research
Publication date: 07/04/2006
Field of study

Transmembrane proteins make up a large and important class of proteins. About 20% of all genes encode transmembrane proteins. They control both substances and information going in and out of a cell. Yet basic knowledge about membrane insertion and folding is sparse, and our ability to identify, over-express, purify, and crystallize transmembrane proteins lags far behind the field of water-soluble proteins. It is diffcult to determine the three dimensional structures of transmembrane proteins. erefore, researchers normally attempt to determine their topology, i.e. which parts of the protein are buried in the membrane, and on what side of the membrane are the other parts located. Proteins aimed for export have an N-terminal sequence known as a signal peptide that is inserted into the membrane and cleaved off. The same mechanism that inserts transmembrane proteins into their membranes also handles the export of protein with signal peptides. Transmembrane helices and signal peptides thus have many features in common. In silico methods for predicting transmembrane topology and methods for predicting signal peptides from amino acid sequence are a fast and relatively accurate alternative to biochemical experiments. A methodology called hidden Markov models (HMMs) has proved particularly useful for these and other prediction tasks. In this thesis, properties of transmembrane topology predictors and signal peptide predictors are investigated. It includes three novel HMM based prediction methods. i) A combined transmembrane topology and signal peptide predictor, Phobius. The paper shows that cross predictions, i.e. signal peptides predicted as transmembrane helices and vice versa, are a common problem. About 10% of the genes in E.coli have overlapping signal peptide and transmembrane helix predictions by conventional predictors. We were able to dramatically lower these false cross predictions. ii)Amethod for detecting remote G protein-coupled receptor (GPCR) families,GPCRHMM. GPCRs are a very large and divergent superfamily of transmembrane proteins. We designed a hidden Markov model based on the topological regions of the superfamily. We searched five genomes and predicted 120 previously not annotated sequences as possible GPCRs. e majority of these predictions (102) were in C. elegans, but 4 were found in human and 7 in mouse. We as well conclude that a family of odorant receptors in Drosophila are not GPCRs. iii)Amethod to improve predictions with HMMs of generic sequence features (such as transmembrane segments or signal peptides) by including homologs. We show that the performance of Phobius using this decoder was significantly better than with other decoders. We also assessed the difficulty of benchmark sets used in transmembrane topology prediction. By studying the level of agreement between different predictors applied to typical benchmark sets andwhole proteome sets,we concluded that the benchmark sets are far easier to predict than reality. In other words, the accuracies reported in benchmark studies are exaggerated. Thesis also includes a paper presenting a hypothesis of the transmembrane topology of presenilin, a protein involved in the development of Alzheimer's disease. By comparing the output of several transmembrane topology predictors with experimental results from previous studies, a novel nine-transmembrane topology with an extracellular C-terminus was elucidated

Publications from Karolinska Institutet

Focus on the spectra that matter by clustering of quantification data in shotgun proteomics

Author: Lukas Käll
Matthew The
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/06/2020
Field of study

Matching mass spectra to peptide sequences is the usual first step in proteomics data analysis, often followed by peptide quantification. Here, the authors show that clustering and quantifying mass spectral features prior to peptide identification can increase the sensitivity of label-free quantitative proteomics

Directory of Open Access Journals