Search CORE

31 research outputs found

Gain-of-Function Variomics and Multi-Omics Network Biology For Precision Medicine

Author: Awasthi Sharad
Bisht Deepa
Coban Akdemir Zeynep H
Ghosh Sumanta
Li Mark M
Sahni Nidhi
Sheynkman Gloria M
Yi S Stephen
Publication venue: DigitalCommons@TMC
Publication date: 01/01/2023
Field of study

Traditionally, disease causal mutations were thought to disrupt gene function. However, it becomes more clear that many deleterious mutations could exhibit a gain-of-function (GOF) behavior. Systematic investigation of such mutations has been lacking and largely overlooked. Advances in next-generation sequencing have identified thousands of genomic variants that perturb the normal functions of proteins, further contributing to diverse phenotypic consequences in disease. Elucidating the functional pathways rewired by GOF mutations will be crucial for prioritizing disease-causing variants and their resultant therapeutic liabilities. In distinct cell types (with varying genotypes), precise signal transduction controls cell decision, including gene regulation and phenotypic output. When signal transduction goes awry due to GOF mutations, it would give rise to various disease types. Quantitative and molecular understanding of network perturbations by GOF mutations may provide explanations for \u27missing heritability in previous genome-wide association studies. We envision that it will be instrumental to push current paradigm toward a thorough functional and quantitative modeling of all GOF mutations and their mechanistic molecular events involved in disease development and progression. Many fundamental questions pertaining to genotype-phenotype relationships remain unresolved. For example, which GOF mutations are key for gene regulation and cellular decisions? What are the GOF mechanisms at various regulation levels? How do interaction networks undergo rewiring upon GOF mutations? Is it possible to leverage GOF mutations to reprogram signal transduction in cells, aiming to cure disease? to begin to address these questions, we will cover a wide range of topics regarding GOF disease mutations and their characterization by multi-omic networks. We highlight the fundamental function of GOF mutations and discuss the potential mechanistic effects in the context of signaling networks. We also discuss advances in bioinformatic and computational resources, which will dramatically help with studies on the functional and phenotypic consequences of GOF mutations

DigitalCommons@The Texas Medical Center

Ad-Syn-Net: Systematic Identification of alzheimer\u27s Disease-Associated Mutation and Co-Mutation Vulnerabilities Via Deep Learning

Author: Coban Akdemir Zeynep H
Gao Ruixuan
Huang Jason H
Jiang Xiaoqian
Pan Xingxin
Sahni Nidhi
Sheynkman Gloria M
Wu Erxi
Yi S Stephen
Publication venue: DigitalCommons@TMC
Publication date: 19/03/2023
Field of study

Alzheimer\u27s disease (AD) is one of the most challenging neurodegenerative diseases because of its complicated and progressive mechanisms, and multiple risk factors. Increasing research evidence demonstrates that genetics may be a key factor responsible for the occurrence of the disease. Although previous reports identified quite a few AD-associated genes, they were mostly limited owing to patient sample size and selection bias. There is a lack of comprehensive research aimed to identify AD-associated risk mutations systematically. to address this challenge, we hereby construct a large-scale AD mutation and co-mutation framework (\u27AD-Syn-Net\u27), and propose deep learning models named Deep-SMCI and Deep-CMCI configured with fully connected layers that are capable of predicting cognitive impairment of subjects effectively based on genetic mutation and co-mutation profiles. Next, we apply the customized frameworks to data sets to evaluate the importance scores of the mutations and identified mutation effectors and co-mutation combination vulnerabilities contributing to cognitive impairment. Furthermore, we evaluate the influence of mutation pairs on the network architecture to dissect the genetic organization of AD and identify novel co-mutations that could be responsible for dementia, laying a solid foundation for proposing future targeted therapy for AD precision medicine. Our deep learning model codes are available open access here: https://github.com/Pan-Bio/AD-mutation-effectors

DigitalCommons@The Texas Medical Center

Recommended from our members

Full-length mRNA sequencing uncovers a widespread coupling between transcription initiation and mRNA processing

Author: Allard Guy
Anvar Seyed Yahya
Ariyurek Yavuz
de Klerk Eleonora
den Dunnen Johan T.
Johansson Hans E.
Sheynkman Gloria M.
Tseng Elizabeth
Turner Stephen W.
Vermaat Martijn
Yin Raymund H.
‘t Hoen Peter A. C.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Background: The multifaceted control of gene expression requires tight coordination of regulatory mechanisms at transcriptional and post-transcriptional level. Here, we studied the interdependence of transcription initiation, splicing and polyadenylation events on single mRNA molecules by full-length mRNA sequencing. Results: In MCF-7 breast cancer cells, we find 2700 genes with interdependent alternative transcription initiation, splicing and polyadenylation events, both in proximal and distant parts of mRNA molecules, including examples of coupling between transcription start sites and polyadenylation sites. The analysis of three human primary tissues (brain, heart and liver) reveals similar patterns of interdependency between transcription initiation and mRNA processing events. We predict thousands of novel open reading frames from full-length mRNA sequences and obtained evidence for their translation by shotgun proteomics. The mapping database rescues 358 previously unassigned peptides and improves the assignment of others. By recognizing sample-specific amino-acid changes and novel splicing patterns, full-length mRNA sequencing improves proteogenomics analysis of MCF-7 cells. Conclusions: Our findings demonstrate that our understanding of transcriptome complexity is far from complete and provides a basis to reveal largely unresolved mechanisms that coordinate transcription initiation and mRNA processing. Electronic supplementary material The online version of this article (10.1186/s13059-018-1418-0) contains supplementary material, which is available to authorized users

Harvard University - DASH

Directory of Open Access Journals

Leiden University Scholary Publications

Radboud Repository

FigShare

Enhanced protein isoform characterization through long-read proteogenomics

Author: Castaldi Peter J.
Chatzipantsiou Christina
Conesa Ana
Dai Yunxiang
Deslattes Mays Anne
Jeffery Erin D.
Jordan Ben T.
Kaur Simi
Luckey Chance John
Mehlferber Madison M.
Miller Rachel M.
Millikin Robert J.
Sheynkman Gloria M.
Shortreed Michael R.
Smith Lloyd M.
Tiberi Simone
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

[Background] The detection of physiologically relevant protein isoforms encoded by the human genome is critical to biomedicine. Mass spectrometry (MS)-based proteomics is the preeminent method for protein detection, but isoform-resolved proteomic analysis relies on accurate reference databases that match the sample; neither a subset nor a superset database is ideal. Long-read RNA sequencing (e.g., PacBio or Oxford Nanopore) provides full-length transcripts which can be used to predict full-length protein isoforms.[Results] We describe here a long-read proteogenomics approach for integrating sample-matched long-read RNA-seq and MS-based proteomics data to enhance isoform characterization. We introduce a classification scheme for protein isoforms, discover novel protein isoforms, and present the first protein inference algorithm for the direct incorporation of long-read transcriptome data to enable detection of protein isoforms previously intractable to MS-based detection. We have released an open-source Nextflow pipeline that integrates long-read sequencing in a proteomic workflow for isoform-resolved analysis.[Conclusions] Our work suggests that the incorporation of long-read sequencing and proteomic data can facilitate improved characterization of human protein isoform diversity. Our first-generation pipeline provides a strong foundation for future development of long-read proteogenomics and its adoption for both basic and translational research.This work was supported by a National Institutes of Health (NIH) grant R35GM142647 (G.M.S.), NIH grant R35GM126914 (L.M.S.), and Jackson Laboratory (A.D.M.). The codeathon which initiated the project was supported by the NIH STRIDES Initiative at the NIH.Peer reviewe

PubMed Central

Digital.CSIC

Full-length transcript sequencing of human and mouse cerebral cortex identifies widespread isoform diversity and alternative splicing.

Author: Ahmed Zeshan
Bray Nicholas J
Castanho Isabel
Collier David A
Davies Jonathan P
Dempster Emma L
Gandal Michael J
Hannon Eilis
Jeffery Erin D
Jeffries Aaron R
Jops Connor
Jordan Ben T
Leung Szi Kay
Mill Jonathan
Moore Karen
O'Neill Paul
Prabhakar Shyam
Schalkwyk Leonard
Sheynkman Gloria M
Tseng Elizabeth
Publication venue: 'Elsevier BV'
Publication date: 01/01/2021
Field of study

Alternative splicing is a post-transcriptional regulatory mechanism producing distinct mRNA molecules from a single pre-mRNA with a prominent role in the development and function of the central nervous system. We used long-read isoform sequencing to generate full-length transcript sequences in the human and mouse cortex. We identify novel transcripts not present in existing genome annotations, including transcripts mapping to putative novel (unannotated) genes and fusion transcripts incorporating exons from multiple genes. Global patterns of transcript diversity are similar between human and mouse cortex, although certain genes are characterized by striking differences between species. We also identify developmental changes in alternative splicing, with differential transcript usage between human fetal and adult cortex. Our data confirm the importance of alternative splicing in the cortex, dramatically increasing transcriptional diversity and representing an important mechanism underpinning gene regulation in the brain. We provide transcript-level data for human and mouse cortex as a resource to the scientific community

University of Essex Research Repository

Online Research @ Cardiff

Directory of Open Access Journals

Systematic assessment of long-read RNA-seq methods for transcript identification and quantification

Author: Adams Matthew S
Balderrama-Gutierrez Gabriela
Barnes If
Behera Amit K
Berry Andrew
Birol Inanc
Bostan Hamed
Brooks Angela N
Brooks Ashley M
Capella Salvador
Carbonell-Sala Sílvia
Carninci Piero
Chen Ying
Conesa Ana
De María Maite
Denslow Nancy D
Dhillon Namrita
Diekhans Mark
Du Mei RM
Fai Au Kin
Felton Colette
Fernandez-Gonzalez Jose M
Ferrández-Peral Luis
Frankish Adam
Garcia-Reyero Natàlia
Goetz Stefan
Gonzalez Jose M
Guigó Roderic
Göke Jonathan
Hafezqorani Saber
Hasan Çelik Muhammed
Hernández-Ferrer Carles
Herwig Ralf
Hunt Toby
Hunter Margaret E
Jerryd Meade Marcus
Kawaji Hideya
Kei Wan Yuk
Kondratova Liudmyla
Lagarde Julien
Laird Smith Melissa
Lee Joseph
Li Haoran
Liang Li Jian
Liang Cindy E
Lienhard Matthias
Liu Tianyuan
Loveland Jane E
Martinez-Martin Alessandra
Menor Carlos
Mestre-Tomás Jorge
Mikheenko Alla
Ming Nip Ka
Moraga Amador David A
Mortazavi Ali
Mudge Jonathan M
Mulligan Dennis
Panayotova Nedka G
Paniagua Alejandro
Pardo-Palacios Francisco J
Pertea Mihaela
Prjibelski Andrey D
Reese Fairlie
Repchevsky Dmitry
Ritchie Matthew E
Rouchka Eric
Saint-John Brandon
Sapena Enrique
Sheynkman Gloria M
Sheynkman Leon
Sim Andre D
Suner Marie-Marthe
Takahashi Hazuki
Tang Alison D
Tilgner Hagen U
Vollmers Christopher
Wang Changqing
Wang Dingjie
Williams Brian
Wold Barbara J
Wong Brandon Y
Yang Chen
Youngworth Ingrid Ashley
Publication venue: bioXRiv
Publication date: 27/07/2023
Field of study

The Long-read RNA-Seq Genome Annotation Assessment Project (LRGASP) Consortium was formed to evaluate the effectiveness of long-read approaches for transcriptome analysis. The consortium generated over 427 million long-read sequences from cDNA and direct RNA datasets, encompassing human, mouse, and manatee species, using different protocols and sequencing platforms. These data were utilized by developers to address challenges in transcript isoform detection and quantification, as well as de novo transcript isoform identification. The study revealed that libraries with longer, more accurate sequences produce more accurate transcripts than those with increased read depth, whereas greater read depth improved quantification accuracy. In well-annotated genomes, tools based on reference sequences demonstrated the best performance. When aiming to detect rare and novel transcripts or when using reference-free approaches, incorporating additional orthogonal data and replicate samples are advised. This collaborative study offers a benchmark for current practices and provides direction for future method development in transcriptome analysis

UCL Discovery

Systematic assessment of long-read RNA-seq methods for transcript identification and quantification

Author: Adams Matthew S.
Au Kin Fai
Balderrama-Gutierrez Gabriela
Barnes If
Behera Amit K.
Berry Andrew E.
Birol Inanc
Bostan Hamed
Brooks Angela N.
Brooks Ashley M.
Capella-Gutierrez Salvador
Carbonell-Sala Sílvia
Carninci Piero
Chen Ying
Conesa Ana
Cousineau Alyssa
De María Maite
Denslow Nancy D.
Dhillon Namrita
Diekhans Mark
Du Mei R. M.
Felton Colette
Fernandez-Gonzalez Jose M.
Ferrández-Peral Luis
Frankish Adam
Garcia-Reyero Natàlia
Gonzalez Martinez Jose M.
Guigó Roderic
Göke Jonathan
Götz Stefan
Hafezqorani Saber
Hernández-Ferrer Carles
Herwig Ralf
Hunt Toby
Hunter Margaret E.
Kawaji Hideya
Kondratova Liudmyla
Lagarde Julien
Lee Joseph
Li Haoran
Li Jian-Liang
Liang Cindy E.
Lienhard Matthias
Liu Tianyuan
Loveland Jane E.
Maehr Rene
Martinez-Martin Alessandra
Meade Marcus Jerryd
Menor Carlos
Mestre-Tomás Jorge
Mikheenko Alla
Moraga Amador David A.
Mortazavi Ali
Mudge Jonathan M.
Mulligan Dennis
Nip Ka Ming
Panayotova Nedka G.
Paniagua Alejandro
Pardo-Palacios Francisco J.
Pertea Mihaela
Prjibelski Andrey D.
Reese Fairlie
Ren Xingjie
Repchevsky Dmitry
Ritchie Matthew E.
Rouchka Eric
Saint-John Brandon
Sapena Enrique
Shen Yin
Sheynkman Gloria M.
Sheynkman Leon
Sim Andre D.
Smith Melissa Laird
Suner Marie-Marthe
Takahashi Hazuki
Tang Alison D.
Tilgner Hagen U.
Vollmers Christopher
Wan Yuk Kei
Wang Changqing
Wang Dingjie
Williams Brian
Wold Barbara J.
Wong Brandon Y.
Yang Chen
Youngworth Ingrid A.
Çelik Muhammed Hasan
Publication venue: Nature Research
Publication date: 07/06/2024
Field of study

The Long-read RNA-Seq Genome Annotation Assessment Project Consortium was formed to evaluate the effectiveness of long-read approaches for transcriptome analysis. Using different protocols and sequencing platforms, the consortium generated over 427 million long-read sequences from complementary DNA and direct RNA datasets, encompassing human, mouse and manatee species. Developers utilized these data to address challenges in transcript isoform detection, quantification and de novo transcript detection. The study revealed that libraries with longer, more accurate sequences produce more accurate transcripts than those with increased read depth, whereas greater read depth improved quantification accuracy. In well-annotated genomes, tools based on reference sequences demonstrated the best performance. Incorporating additional orthogonal data and replicate samples is advised when aiming to detect rare and novel transcripts or using reference-free approaches. This collaborative study offers a benchmark for current practices and provides direction for future method development in transcriptome analysis

Online Research @ Cardiff

A reference map of the human binary protein interactome.

Author: Aloy Patrick
Babor Mariana
Bader Gary D.
Balcha Dawit
Basha Omer
Begg Bridget E.
Bian Wenting
Bowman-Colin Christian
Brignall Ruth
Cafarelli Tiziana
Calderwood Michael A.
Campos-Laborie Francisco J.
Charloteaux Benoit
Chin Suet-Feung
Choi Dongsic
Choi Soon Gang
Colabella Claudia
Coppin Georges
Coté Atina G.
D'Amata Cassandra
Daley Meaghan
De Las Rivas Javier
De Ridder David
De Rouck Steffi
Deimling Steven
Desbuleux Alice
Dricot Amélie
Duran-Frigola Miquel
Ennajdaoui Hanane
Gaudet Suzanne
Gebbia Marinella
Goebels Florian
Goehring Liana
Gopal Anjali
Haddad Ghazal
Hao Tong
Hardy Madeleine F.
Hatchi Elodie
Helmy Mohamed
Hill David E.
Jacob Yves
Kassa Yoseph
Kim Dae-Kyum
Kishore Nishka
Knapp Jennifer J.
Kovács István A.
Lambourne Luke
Landini Serena
Lemmens Irma
Li Roujia
Luck Katja
MacWilliams Andrew
Markey Dylan
Mee Miles W.
Mellor Joseph C.
Paulson Joseph N.
Pollis Carl
Pons Carles
Rak Janusz
Rangarajan Sudharshan
Rasla John
Rayhan Ashyad
Richardson Aaron D.
Rolland Thomas
Roth Frederick P.
San-Miguel Adriana
Schlabach Sadie
Shen Yun
Sheykhkarimli Dayag
Sheynkman Gloria M.
Simonovsky Eyal
Spirohn Kerstin
Tavernier Jan
Taşan Murat
Teeking Bridget
Tejeda Alexander
Tropepe Vincent
Twizere Jean-Claude
van Lieshout Natascha
Vidal Marc
Wang Yang
Weatheritt Robert J.
Weile Jochen
Xia Yu
Yadav Anupama
Yang Xinping
Yeger-Lotem Esti
Zhong Quan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Global insights into cellular organization and genome function require comprehensive understanding of the interactome networks that mediate genotype-phenotype relationships(1,2). Here we present a human 'all-by-all' reference interactome map of human binary protein interactions, or 'HuRI'. With approximately 53,000 protein-protein interactions, HuRI has approximately four times as many such interactions as there are high-quality curated interactions from small-scale studies. The integration of HuRI with genome(3), transcriptome(4) and proteome(5) data enables cellular function to be studied within most physiological or pathological cellular contexts. We demonstrate the utility of HuRI in identifying the specific subcellular roles of protein-protein interactions. Inferred tissue-specific networks reveal general principles for the formation of cellular context-specific functions and elucidate potential molecular mechanisms that might underlie tissue-specific phenotypes of Mendelian diseases. HuRI is a systematic proteome-wide reference that links genomic variation to phenotypic outcomes

Crossref

Ghent University Academic Bibliography

Open Repository and Bibliography - Liège

HAL-Pasteur