Search CORE

25 research outputs found

Enhancing the accuracy of HMM-based conserved pathway prediction using global correspondence scores

Author: A Osman
AL Barabasi
AL Barabasi
B Srinivasan
BJ Yoon
BP Kelley
Byung-Jun Yoon
CS Liao
G Klau
J Flannick
J Flannick
M Ashburner
M Kanehisa
M Koyutürk
M Zaslavskiy
Q Yang
R Aebersold
R Pinter
R Sharan
R Sharan
R Singh
S Sahraeian
Sayed Mohammad Ebrahim Sahraeian
SME Sahraeian
W Tian
X Qian
X Qian
Xiaoning Qian
Z Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

RNA secondary structure prediction from multi-aligned sequences

It has been well accepted that the RNA secondary structures of most functional non-coding RNAs (ncRNAs) are closely related to their functions and are conserved during evolution. Hence, prediction of conserved secondary structures from evolutionarily related sequences is one important task in RNA bioinformatics; the methods are useful not only to further functional analyses of ncRNAs but also to improve the accuracy of secondary structure predictions and to find novel functional RNAs from the genome. In this review, I focus on common secondary structure prediction from a given aligned RNA sequence, in which one secondary structure whose length is equal to that of the input alignment is predicted. I systematically review and classify existing tools and algorithms for the problem, by utilizing the information employed in the tools and by adopting a unified viewpoint based on maximum expected gain (MEG) estimators. I believe that this classification will allow a deeper understanding of each tool and provide users with useful information for selecting tools for common secondary structure predictions.Comment: A preprint of an invited review manuscript that will be published in a chapter of the book `Methods in Molecular Biology'. Note that this version of the manuscript may differ from the published versio

arXiv.org e-Print Archive

CiteSeerX

Crossref

Establishing reference samples for detection of somatic mutations and germline variants with NGS technologies

Author: Abaan O.D.
Cam M.
Chen W.
Chen Z.
de Mars M.
Donaldson E.
Drabek J.
Duerken-Hughes P.
Ebrahim Sahraeian S.M.
Fang L.T.
Gasparotto D.
Guo Y.
Hong H.
Hung T.
Idler K.
Jacob H.
Jaeger E.
Jensen R.V.
Kalamegham R.
Kerrigan L.
Kusko R.
Kõks S.
Lack J.
Lam H.Y.K.
Langenbach K.
Li J.
Li Z.
Liljedahl U.
Lu C.
Maestro R.
Meerzaman D.
Mohiyuddin M.
Moos M.
Nguyen C.
Ning B.
Nordlund J.
Peters E.
Petitjean V.
Pirooznia M.
Reimann E.
Ren L.
Scherer A.
Schroth G.
Shen T-W
Sherry S.
Shetty J.
Shi L.
Song L.
Stanbouly S.
Sultan M.
Talsania K.
Tezak Z.
Tran B.
Wang C.
Xiao C.
Xiao W.
Yang Z.
Yao L.
Yu Y.
Zhao Y.
Zheng Y.
Zhu B.
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 01/01/2019
Field of study

We characterized two reference samples for NGS technologies: a human triple-negative breast cancer cell line and a matched normal cell line. Leveraging several whole-genome sequencing (WGS) platforms, multiple sequencing replicates, and orthogonal mutation detection bioinformatics pipelines, we minimized the potential biases from sequencing technologies, assays, and informatics. Thus, our “truth sets” were defined using evidence from 21 repeats of WGS runs with coverages ranging from 50X to 100X (a total of 140 billion reads). These “truth sets” present many relevant variants/mutations including 193 COSMIC mutations and 9,016 germline variants from the ClinVar database, nonsense mutations in BRCA1/2 and missense mutations in TP53 and FGFR1. Independent validation in three orthogonal experiments demonstrated a successful stress test of the truth set. We expect these reference materials and “truth sets” to facilitate assay development, qualification, validation, and proficiency testing. In addition, our methods can be extended to establish new fully characterized reference samples for the community

Research Repository

PROPER: global protein interaction network alignment through percolation matching

Author: A Chatr-aryamontri
A Egozi
A Elmsallati
AE Aladag
AHY Tong
B Seah
BP Kelley
BP Kelley
C Clark
C Pesquita
C Roth
CS Liao
D Barrell
D Conte
D Cullina
D Davis
D Devos
D Park
E Kazemi
Ehsan Kazemi
F Alkan
FE Faisal
H Hermjakob
H Yu
Hamed Hassani
Hassan Pezeshgi Modarres
HTT Phan
J Berg
J Flannick
J Hu
K Sjölander
L Licata
LR Matthews
M Ashburner
M Bayati
M Koyutürk
M Madan Babu
M Remm
M Zaslavskiy
Matthias Grossglauser
N Korula
N Malod-Dognin
O Kuchaiev
O Kuchaiev
P Erdös
R Aebersold
R Apweiler
R Patro
R Sharan
R Sharan
R Singh
R Singh
S Navlakha
S Orchard
S Peri
S Suthram
SA Teichmann
SF Altschul
SME Sahraeian
SR Collins
T Ito
T Joshi
T Milenković
V Memišević
V Saraph
V Vijayan
X Zhang
Z Liang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/12/2016
Field of study

Background The alignment of protein-protein interaction (PPI) networks enables us to uncover the relationships between different species, which leads to a deeper understanding of biological systems. Network alignment can be used to transfer biological knowledge between species. Although different PI-network alignment algorithms were introduced during the last decade, developing an accurate and scalable algorithm that can find alignments with high biological and structural similarities among PPI networks is still challenging. Results In this paper, we introduce a new global network alignment algorithm for PPI networks called PROPER. Compared to other global network alignment methods, our algorithm shows higher accuracy and speed over real PPI datasets and synthetic networks. We show that the PROPER algorithm can detect large portions of conserved biological pathways between species. Also, using a simple parsimonious evolutionary model, we explain why PROPER performs well based on several different comparison criteria. Conclusions We highlight that PROPER has high potential in further applications such as detecting biological pathways, finding protein complexes and PPI prediction. The PROPER algorithm is available at http://proper.epfl.ch

Infoscience - École polytechnique fédérale de Lausanne

Repository for Publications and Research Data

Crossref

Springer - Publisher Connector

PubMed Central

An expanded evaluation of protein function prediction methods shows an improvement in accuracy

Author: Almeida-e-Silva DC
Altenhoff A
Babbitt PC
Bankapur AR
Bargsten JW
Ben-Hur A
Benso A
Bhat P
Bonneau R
Brenner SE
Bryson K
Cao RZ
Casadio R
Cejuela JM
Chapman S
Chen CT
Cheng JL
Cibrian-Uhalte E
Clark WT
Cozzetto D
D'Andrea D
Das S
Dawson NL
del Pozo A
Denny P
Dessimoz C
Di Carlo S
Dogan T
Dukka BKC
ElShal S
Falda M
Fang H
Feng S
Fernandez JM
Ferrari C
Fontana P
Foulger RE
Friedberg I
Funk CS
Gabaldon T
Gemovic B
Gillis J
Ginter F
Giollo M
Glisic S
Goldberg T
Gong QT
Gough J
Greene CS
Hakala K
Hamp T
Hieta R
Holm L
Hsu WL
Huntley RP
Jiang YX
Jones DT
Kaewphan S
Kahanda I
Kansakar L
Khan IK
Kihara D
Koo DCE
Koskinen P
Lavezzo E
Lee D
Lees JG
Legge D
Lepore R
Li B
Lin A
Linial M
Lovering RC
Magrane M
Maietta P
Marcet-Houben M
Martelli PL
Martin MJ
Mehryary F
Melidoni AN
Mesiti M
Minneci F
Mooney SD
Moreau Y
Mutowo-Meullenet P
Nepusz T
Ning W
O'Donovan C
Oates M
Ofer D
Orengo CA
Oron TR
Paccanaro A
Pavlidis P
Penfold-Brown D
Perovic V
Pichler K
Piovesan D
Politano G
Profiti G
Radivojac P
Rappoport N
Re M
Rehman HU
Richter L
Robinson PN
Romero AE
Rost B
Sahraeian SME
Salakoski T
Salamov A
Sasidharan R
Savino A
Sedeno-Cortes AE
Sharan M
Shasha D
Shypitsyna A
Sillitoe I
Skunca N
Smithers B
Stern A
Sternberg MJE
Supek F
Tian WD
Toppo S
Toronen P
Tosatto SCE
Tramontano A
Tranchevent LC
Tress ML
Valencia A
Valentini G
van Dijk ADJ
Veljkovic N
Veljkovic V
Vencio RZN
Verspoor KM
Vogel J
Vucetic S
Wang Z
Wass MN
Yang HX
Youngs N
Zakeri P
Zhang S
Zhong Z
Zhou YP
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 28/10/2022
Field of study

Background: A major bottleneck in our understanding of the molecular underpinnings of life is the assignment of function to proteins. While molecular experiments provide the most reliable annotation of proteins, their relatively low throughput and restricted purview have led to an increasing role for computational function prediction. However, assessing methods for protein function prediction and tracking progress in the field remain challenging.Results: We conducted the second critical assessment of functional annotation (CAFA), a timed challenge to assess computational methods that automatically assign protein function. We evaluated 126 methods from 56 research groups for their ability to predict biological functions using Gene Ontology and gene-disease associations using Human Phenotype Ontology on a set of 3681 proteins from 18 species. CAFA2 featured expanded analysis compared with CAFA1, with regards to data set size, variety, and assessment metrics. To review progress in the field, the analysis compared the best methods from CAFA1 to those of CAFA2.Conclusions: The top-performing methods in CAFA2 outperformed those from CAFA1. This increased accuracy can be attributed to a combination of the growing number of experimental annotations and improved methods for function prediction. The assessment also revealed that the definition of top-performing algorithms is ontology specific, that different performance metrics can be used to probe the nature of accurate predictions, and the relative diversity of predictions in the biological process and human phenotype ontologies. While there was methodological improvement between CAFA1 and CAFA2, the interpretation of results and usefulness of individual methods remain context-dependent

UTUPub