Search CORE

13 research outputs found

STAR-Fusion: Fast and Accurate Fusion Transcript Detection from RNA-Seq

Author: Bankapur Asma
Doak Thomas
Dobin Alex
Ganote Carrie
Gingeras Thomas
Haas Brian
Li Bo
Pochet Nathalie
Regev Aviv
Stransky Nicolas
Sun Jing
Tickle Timothy
Wu Catherine
Yang Xiao
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 24/03/2017
Field of study

Motivation Fusion genes created by genomic rearrangements can be potent drivers of tumorigenesis. However, accurate identification of functionally fusion genes from genomic sequencing requires whole genome sequencing, since exonic sequencing alone is often insufficient. Transcriptome sequencing provides a direct, highly effective alternative for capturing molecular evidence of expressed fusions in the precision medicine pipeline, but current methods tend to be inefficient or insufficiently accurate, lacking in sensitivity or predicting large numbers of false positives. Here, we describe STAR-Fusion, a method that is both fast and accurate in identifying fusion transcripts from RNA-Seq data. Results We benchmarked STAR-Fusion’s fusion detection accuracy using both simulated and genuine Illumina paired-end RNA-Seq data, and show that it has superior performance compared to popular alternative fusion detection methods. Availability and implementation STAR-Fusion is implemented in Perl, freely available as open source software at http://star-fusion.github.io, and supported on Linux

Cold Spring Harbor Laboratory Institutional Repository

An expanded evaluation of protein function prediction methods shows an improvement in accuracy

Author: Almeida-e-Silva Danillo C.
Altenhoff Adrian
Babbitt Patricia C.
Bankapur Asma R.
Bargsten Joachim W.
Ben-Hur Asa
Benso Alfredo
Bhat Prajwal
Bkc Dukka
Bonneau Richard
Brenner Steven E.
Bryson Kevin
Cao Renzhi
Casadio Rita
Cejuela Juan M.
Chapman Samuel
Chen Ching-Tai
Cheng Jianlin
Cibrian-Uhalte Elena
Clark Wyatt T.
Cozzetto Domenico
D'Andrea Daniel
Das Sayoni
Dawson Natalie L.
del Pozo Angela
Denny Paul
Dessimoz Christophe
Di Carlo Stefano
Dogan Tunca
ElShal Sarah
Falda Marco
Fang Hai
Feng Shou
Fernández José M.
Ferrari Carlo
Fontana Paolo
Foulger Rebecca E.
Friedberg Iddo
Funk Christopher S.
Gabaldon Toni
Gemovic Branislava
Gillis Jesse
Ginter Filip
Giollo Manuel
Glisic Sanja
Goldberg Tatyana
Gong Qingtian
Gough Julian
Greene Casey S.
Hakala Kai
Hamp Tobias
Hieta Reija
Holm Liisa
Hsu Wen-Lian
Huntley Rachael P.
Jiang Yuxiang
Jones David T.
Kaewphan Suwisa
Kahanda Indika
Kansakar Lakesh
Khan Ishita K.
Kihara Daisuke
Koo Da Chen Emily
Koskinen Patrik
Lavezzo Enrico
Lee David
Lees Jonathan G.
Legge Duncan
Lepore Rosalba
Li Biao
Lin Alexandra
Linial Michal
Lovering Ruth C.
Magrane Michele
Maietta Paolo
Marcet-Houben Marina
Martelli Pier Luigi
Martin Maria J.
Mehryary Farrokh
Melidoni Anna N.
Mesiti Marco
Minneci Federico
Mooney Sean D.
Moreau Yves
Mutowo-Meullenet Prudence
Nepusz Tamás
Ning Wei
O'Donovan Claire
Oates Matt
Ofer Dan
Orengo Christine A.
Oron Tal Ronnen
Paccanaro Alberto
Pavlidis Paul
Penfold-Brown Duncan
Perovic Vladmir
Pichler Klemens
Piovesan Damiano
Politano Gianfranco
Profiti Giuseppe
Radivojac Predrag
Rappoport Nadav
Re Matteo
Rehman Hafeez Ur
Richter Lothar
Robinson Peter N.
Romero Alfonso E.
Rost Burkhard
Sahraeian Sayed M.E.
Salakoski Tapio
Salamov Asaf
Sasidharan Rajkumar
Savino Alessandro
Sedeño-Cortés Adriana E.
Sharan Malvika
Shasha Dennis
Shypitsyna Aleksandra
Sillitoe Ian
Skunca Nives
Smithers Ben
Stern Amos
Sternberg Michael J.E.
Supek Fran
Tian Weidong
Toppo Stefano
Tosatto Silvio C.E.
Tramontano Anna
Tranchevent Léon-Charles
Tress Michael L.
Törönen Petri
Valencia Alfonso
Valentini Giorgio
van Dijk Aalt D.J.
Veljkovic Nevena
Veljkovic Veljko
Vencio Ricardo ZN
Verspoor Karin M.
Vogel Jörg
Vucetic Slobodan
Wang Zheng
Wass Mark N.
Yang Haixuan
Youngs Noah
Zakeri Pooya
Zhang Shanshan
Zhong Zhaolong
Zhou Yuanpeng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Background: A major bottleneck in our understanding of the molecular underpinnings of life is the assignment of function to proteins. While molecular experiments provide the most reliable annotation of proteins, their relatively low throughput and restricted purview have led to an increasing role for computational function prediction. However, assessing methods for protein function prediction and tracking progress in the field remain challenging. Results: We conducted the second critical assessment of functional annotation (CAFA), a timed challenge to assess computational methods that automatically assign protein function. We evaluated 126 methods from 56 research groups for their ability to predict biological functions using Gene Ontology and gene-disease associations using Human Phenotype Ontology on a set of 3681 proteins from 18 species. CAFA2 featured expanded analysis compared with CAFA1, with regards to data set size, variety, and assessment metrics. To review progress in the field, the analysis compared the best methods from CAFA1 to those of CAFA2. Conclusions: The top-performing methods in CAFA2 outperformed those from CAFA1. This increased accuracy can be attributed to a combination of the growing number of experimental annotations and improved methods for function prediction. The assessment also revealed that the definition of top-performing algorithms is ontology specific, that different performance metrics can be used to probe the nature of accurate predictions, and the relative diversity of predictions in the biological process and human phenotype ontologies. While there was methodological improvement between CAFA1 and CAFA2, the interpretation of results and usefulness of individual methods remain context-dependent. Keywords: Protein function prediction, Disease gene prioritizationpublishedVersio

Brage HiM

An Expanded Evaluation of Protein Function Prediction Methods Shows an Improvement In Accuracy

Author: Almeida-e-Silva Danillo C.
Altenhoff Adrian
Babbitt Patricia C.
Bankapur Asma R.
Bargsten Joachim W.
Ben-Hur Asa
Benso Alfredo
Bhat Prajwal
BKC Dukka
Bonneau Richard
Brenner Steven E.
Bryson Kevin
Cao Renzhi
Casadio Rita
Cejuela Juan M.
Chapan Samuel
Chen Ching-Tai
Cheng Jianlin
Cibrian-Uhalte Elenia
Clark Wyatt T.
Cozzetto Domenico
D\u27Andrea Daniel
Das Sayoni
Dawson Natalie L.
del Pozo Angela
Denny Paul
Dessimoz Christophe
Di Carlo Stefano
Dogan Tunca
ElShal Sarah
Falda Marco
Fang Hai
Feng Shou
Fernández José M.
Ferrari Carlo
Fontana Paolo
Foulger Rebecca E.
Friedberg Iddo
Funk Christopher S.
Gabaldon Toni
Gemovic Branislava
Gillis Jesse
Ginter Filip
Giollo Manuel
Glisic Sanja
Goldberg Tatyana
Gong Qingtian
Gough Julian
Greene Casey S.
Hakala Kai
Hamp Tobias
Hieta Reija
Holm Liisa
Hsu Wen-Lian
Huntley Rachael P.
Jiang Yuxiang
Jones David T.
Kaewphan Suwisa
Kahanda Indika
Kansakar Lakesh
Khan Ishita K.
Kihara Daisuke
Koo Da Chen Emily
Koskinen Patrik
Lavezzo Enrico
Lee David
Lees Jonathan G.
Legge Duncan
Lepore Rosalba
Li Biao
Lin Alexandra
Linial Michal
Lovering Ruth C.
Magrane Michele
Maietta Paolo
Marcet-Houben Marina
Martelli Pier Luigi
Martin Maria J.
Mehryar Farrokh
Melidoni Anna N.
Mesiti Marco
Minneci Federico
Mooney Sean D.
Moreau Yves
Mutowo-Meullenet Prudence
Nepusz Tamás
Ning Wei
O\u27Donovan Claire
Oates Matt
Ofer Dan
Orengo Christine A.
Oron Tal Ronnen
Paccanaro Alberto
Pavlidis Paul
Penfold-Brown Duncan
Perovic Vladmir
Pichler Klemens
Piovesan Damiano
Politano Gianfranco
Profiti Giuseppe
Radivojac Predrag
Rappoport Nadav
Re Matteo
Rehman Hafeez Ur
Richter Lothar
Robinson Peter N.
Romero Alfonso E.
Rost Burkhard
Sahraeian Sayed M.E.
Salakoski Tapio
Salamov Asaf
Sasidharan Rajkumar
Savino Alessandro
Sedeño-Cortés Adriana E.
Sharan Malvika
Shasha Dennis
Shypitsyna Aleksandra
Skunca Nives
Smithers Ben
Stern Amos
Sternberg Michael J.E.
Stilltoe Ian
Supek Fran
Tian Weidong
Toppo Stefano
Tosatto Silvio C.E.
Tramontano Anna
Tranchevent Léon-Charles
Tress Michael L.
Törönen Petri
Valencia Alfonso
Valentini Giorgio
van Dijk Aalt D.J.
Veljkovic Nevena
Veljkovic Veljko
Vencio Ricardo Z.N.
Verspoor Karin M.
Vogel Jörg
Vucetic Slobodan
Wang Zheng
Wass Mark N.
Yang Haixuan
Youngs Noah
Zakeri Pooya
Zhang Shanshan
Zhong Zhaolong
Zhou Yuanpeng
Publication venue: The Aquila Digital Community
Publication date: 07/09/2016
Field of study

Aquila Digital Community

Long-Read Sequencing Improves the Detection of Structural Variations Impacting Complex Non-Coding Elements of the Genome

Author: Alawi Alsheikh-Ali
Ammar Albanna
Asma Bankapur
Bakhrom K. Berdiev
Barbara Kellam
Bhooma Thiruvahindrapuram
Deena Alhashmi
Ghausia Begum
Hosneara Akter
Mohammed Uddin
Nasna Nassir
Noushad Karuvantevida
Richa Tambi
Stephen W. Scherer
Wilson W. L. Sung
Publication venue: 'MDPI AG'
Publication date: 19/02/2021
Field of study

The advent of long-read sequencing offers a new assessment method of detecting genomic structural variation (SV) in numerous rare genetic diseases. For autism spectrum disorders (ASD) cases where pathogenic variants fail to be found in the protein-coding genic regions along chromosomes, we proposed a scalable workflow to characterize the risk factor of SVs impacting non-coding elements of the genome. We applied whole-genome sequencing on an Emirati family having three children with ASD using long and short-read sequencing technology. A series of analytical pipelines were established to identify a set of SVs with high sensitivity and specificity. At 15-fold coverage, we observed that long-read sequencing technology (987 variants) detected a significantly higher number of SVs when compared to variants detected using short-read technology (509 variants) (p-value < 1.1020 × 10−57). Further comparison showed 97.9% of long-read sequencing variants were spanning within the 1–100 kb size range (p-value < 9.080 × 10−67) and impacting over 5000 genes. Moreover, long-read variants detected 604 non-coding RNAs (p-value < 9.02 × 10−9), comprising 58% microRNA, 31.9% lncRNA, and 9.1% snoRNA. Even at low coverage, long-read sequencing has shown to be a reliable technology in detecting SVs impacting complex elements of the genome

Multidisciplinary Digital Publishing Institute

Single-cell transcriptome identifies molecular subtype of autism spectrum disorder impacted by de novo loss-of-function variants regulating glial cells

Author: Ahmed Awab
AlBanna Ammar
Ali Abdulrahman
Bankapur Asma
Berdiev Bakhrom K.
Howe Jennifer L.
Inuwa Ibrahim M.
Nassir Nasna
Safizadeh Shabestari Seyed A.
Samara Bisan
Scherer Stephen W.
Uddin Mohammed
Woodbury-Smith Marc
Zarrei Mehdi
Publication venue: University of Toronto
Publication date: 21/11/2021
Field of study

Abstract Background In recent years, several hundred autism spectrum disorder (ASD) implicated genes have been discovered impacting a wide range of molecular pathways. However, the molecular underpinning of ASD, particularly from the point of view of ‘brain to behaviour’ pathogenic mechanisms, remains largely unknown. Methods We undertook a study to investigate patterns of spatiotemporal and cell type expression of ASD-implicated genes by integrating large-scale brain single-cell transcriptomes (> million cells) and de novo loss-of-function (LOF) ASD variants (impacting 852 genes from 40,122 cases). Results We identified multiple single-cell clusters from three distinct developmental human brain regions (anterior cingulate cortex, middle temporal gyrus and primary visual cortex) that evidenced high evolutionary constraint through enrichment for brain critical exons and high pLI genes. These clusters also showed significant enrichment with ASD loss-of-function variant genes (p < 5.23 × 10–11) that are transcriptionally highly active in prenatal brain regions (visual cortex and dorsolateral prefrontal cortex). Mapping ASD de novo LOF variant genes into large-scale human and mouse brain single-cell transcriptome analysis demonstrate enrichment of such genes into neuronal subtypes and are also enriched for subtype of non-neuronal glial cell types (astrocyte, p < 6.40 × 10–11, oligodendrocyte, p < 1.31 × 10–09). Conclusion Among the ASD genes enriched with pathogenic de novo LOF variants (i.e. KANK1, PLXNB1), a subgroup has restricted transcriptional regulation in non-neuronal cell types that are evolutionarily conserved. This association strongly suggests the involvement of subtype of non-neuronal glial cells in the pathogenesis of ASD and the need to explore other biological pathways for this disorder

University of Toronto Research Repository

PubMed Central

An expanded evaluation of protein function prediction methods shows an improvement in accuracy

Author: Bankapur Asma R.
Ben-Hur Asa
Bonneau Richard
Casadio Rita
Clark Wyatt T.
D’Andrea Daniel
Funk Christopher S.
Jiang Yuxiang
Kahanda Indika
Koo Da Chen Emily
Lepore Rosalba
Lin Alexandra
Martelli Pier Luigi
Oron Tal Ronnen
Penfold-Brown Duncan
Profiti Giuseppe
Sahraeian Sayed M. E.
Shasha Dennis
Verspoor Karin M.
Youngs Noah
Publication venue: Springer Nature
Publication date: 20/09/2018
Field of study

Irish Universities

An event-driven approach for studying gene block evolution in bacteria

Author: Andrews
Asma R. Bankapur
Aziz
Cherry
Dandekar
David C. Ream
Dayhoff
Downing
Enault
Enault
Fang
Fani
Fernandez Moran
Fondi
Fulton
González
Grishin
Grishin
Henikoff
Horowitz
Iddo Friedberg
Jun
Keseler
Langille
Langille
Larkin
Lawrence
Marcotte
Martin
Nitschké
Omelchenko
Overbeek
Overbeek
Overbeek
Pasek
Pellegrini
Powell
Price
Price
Pál
Ralling
Remm
Rocha
Salgado
Self
Srinivasan
Stahl
Steward
Szklarczyk
Tatusov
Ward
Wolf
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Crossref

Single Cell Transcriptome Identifies FCGR3B Upregulated Subtype of Alveolar Macrophages in Patients with Critical COVID-19

Author: Ahmed A
Akter H
Al Heialy S
Al Mashshadani M
Alkhajeh A
Almidani O
Alsheikh-Ali A
Bankapur Asma
Begum G
Berdiev BB
Casanova JL
Deesi Z
Gaudet M
Hameid RA
Islam A
Kandasamy RK
Karuvantevida N
Khansaheb HH
Kuebler WM
Loney T
Nassir N
Nowotny N
Rahman P
Shabestari SAS
Tambi R
Tayoun AA
Uddin KMF
Uddin M
Woodbury-Smith M
Zehra B
Publication venue: Cell Press
Publication date
Field of study

Newcastle University E-Prints

Recommended from our members

An expanded evaluation of protein function prediction methods shows an improvement in accuracy.

Author: Altenhoff Adrian
Bankapur Asma R
Ben-Hur Asa
Bhat Prajwal
Bkc Dukka
Bonneau Richard
Bryson Kevin
Cao Renzhi
Casadio Rita
Cejuela Juan M
Chapman Samuel
Chen Ching-Tai
Cheng Jianlin
Cibrian-Uhalte Elena
Clark Wyatt T
Cozzetto Domenico
D'Andrea Daniel
Das Sayoni
Dawson Natalie L
Denny Paul
Dessimoz Christophe
Dogan Tunca
ElShal Sarah
Falda Marco
Fang Hai
Feng Shou
Ferrari Carlo
Fontana Paolo
Foulger Rebecca E
Funk Christopher S
Gabaldon Toni
Gillis Jesse
Ginter Filip
Giollo Manuel
Goldberg Tatyana
Gong Qingtian
Gough Julian
Hakala Kai
Hamp Tobias
Hieta Reija
Holm Liisa
Hsu Wen-Lian
Jiang Yuxiang
Jones David T
Kaewphan Suwisa
Kahanda Indika
Khan Ishita K
Kihara Daisuke
Koo Da Chen Emily
Koskinen Patrik
Lavezzo Enrico
Lee David
Lees Jonathan G
Legge Duncan
Lepore Rosalba
Li Biao
Lin Alexandra
Lovering Ruth C
Magrane Michele
Marcet-Houben Marina
Martelli Pier Luigi
Mehryary Farrokh
Melidoni Anna N
Minneci Federico
Mutowo-Meullenet Prudence
Nepusz Tamás
Ning Wei
Oates Matt
Ofer Dan
Oron Tal Ronnen
Paccanaro Alberto
Pavlidis Paul
Penfold-Brown Duncan
Pichler Klemens
Profiti Giuseppe
Rappoport Nadav
Richter Lothar
Romero Alfonso E
Sahraeian Sayed ME
Salakoski Tapio
Salamov Asaf
Sasidharan Rajkumar
Sedeño-Cortés Adriana E
Shasha Dennis
Shypitsyna Aleksandra
Sillitoe Ian
Skunca Nives
Smithers Ben
Stern Amos
Supek Fran
Tian Weidong
Toppo Stefano
Tranchevent Léon-Charles
Törönen Petri
Verspoor Karin M
Yang Haixuan
Youngs Noah
Zakeri Pooya
Zhong Zhaolong
Zhou Yuanpeng
Publication venue: eScholarship, University of California
Publication date: 01/09/2016
Field of study

BackgroundA major bottleneck in our understanding of the molecular underpinnings of life is the assignment of function to proteins. While molecular experiments provide the most reliable annotation of proteins, their relatively low throughput and restricted purview have led to an increasing role for computational function prediction. However, assessing methods for protein function prediction and tracking progress in the field remain challenging.ResultsWe conducted the second critical assessment of functional annotation (CAFA), a timed challenge to assess computational methods that automatically assign protein function. We evaluated 126 methods from 56 research groups for their ability to predict biological functions using Gene Ontology and gene-disease associations using Human Phenotype Ontology on a set of 3681 proteins from 18 species. CAFA2 featured expanded analysis compared with CAFA1, with regards to data set size, variety, and assessment metrics. To review progress in the field, the analysis compared the best methods from CAFA1 to those of CAFA2.ConclusionsThe top-performing methods in CAFA2 outperformed those from CAFA1. This increased accuracy can be attributed to a combination of the growing number of experimental annotations and improved methods for function prediction. The assessment also revealed that the definition of top-performing algorithms is ontology specific, that different performance metrics can be used to probe the nature of accurate predictions, and the relative diversity of predictions in the biological process and human phenotype ontologies. While there was methodological improvement between CAFA1 and CAFA2, the interpretation of results and usefulness of individual methods remain context-dependent

eScholarship - University of California