Search CORE

159 research outputs found

The Importance of Bottlenecks in Protein Networks: Correlation with Gene Essentiality and Expression Dynamics

Author: Diana Murray
Emmett Sprecher
Haiyuan Yu
Mark Gerstein
Philip M Kim
Valery Trifonov
Publication venue: Public Library of Science
Publication date: 01/01/2007
Field of study

It has been a long-standing goal in systems biology to find relations between the topological properties and functional features of protein networks. However, most of the focus in network studies has been on highly connected proteins (“hubs”). As a complementary notion, it is possible to define bottlenecks as proteins with a high betweenness centrality (i.e., network nodes that have many “shortest paths” going through them, analogous to major bridges and tunnels on a highway map). Bottlenecks are, in fact, key connector proteins with surprising functional and dynamic properties. In particular, they are more likely to be essential proteins. In fact, in regulatory and other directed networks, betweenness (i.e., “bottleneck-ness”) is a much more significant indicator of essentiality than degree (i.e., “hub-ness”). Furthermore, bottlenecks correspond to the dynamic components of the interaction network—they are significantly less well coexpressed with their neighbors than nonbottlenecks, implying that expression dynamics is wired into the network topology

CiteSeerX

Public Library of Science (PLOS)

Crossref

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

iRegNet3D: three-dimensional integrated regulatory network for the genomic analysis of coding and non-coding disease mutations

Author: Cooper David Neil
Liang Siqi
Mort Matthew
Stenson Peter D.
Tippens Nathaniel D.
Yu Haiyuan
Zhou Yaoda
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 18/01/2017
Field of study

The mechanistic details of most disease-causing mutations remain poorly explored within the context of regulatory networks. We present a high-resolution three-dimensional integrated regulatory network (iRegNet3D) in the form of a web tool, where we resolve the interfaces of all known transcription factor (TF)-TF, TF-DNA and chromatinchromatin interactions for the analysis of both coding and non-coding disease-associated mutations to obtain mechanistic insights into their functional impact. Using iRegNet3D, we find that disease-associated mutations may perturb the regulatory network through diverse mechanisms including chromatin looping. iRegNet3D promises to be an indispensable tool in large-scale sequencing and disease association studies

Crossref

Online Research @ Cardiff

Springer - Publisher Connector

PubMed Central

Springer OAI

FigShare

nature biotechnology VOLUME

Author: Bram Thijssen
Haiyuan Yu
Jishnu Das
Steven M Lipkin
Xiaomu Wei
Xiujuan Wang
Publication venue
Publication date: 06/03/2020
Field of study

To better understand the molecular mechanisms and genetic basis of human disease, we systematically examine relationships between 3,949 genes, 62,663 mutations and 3,453 associated disorders by generating a three-dimensional, structurally resolved human interactome. This network consists of 4,222 high-quality binary protein-protein interactions with their atomic-resolution interfaces. We find that in-frame mutations (missense point mutations and in-frame insertions and deletions) are enriched on the interaction interfaces of proteins associated with the corresponding disorders, and that the disease specificity for different mutations of the same gene can be explained by their location within an interface. We also predict 292 candidate genes for 694 unknown disease-to-gene associations with proposed molecular mechanism hypotheses. This work indicates that knowledge of how in-frame disease mutations alter specific interactions is critical to understanding pathogenesis. Structurally resolved interaction networks should be valuable tools for interpreting the wealth of data being generated by large-scale structural genomics and disease association studies. Over the past few decades, a tremendous amount of resources and effort have been invested in mapping human disease loci genetically and later physically 1 . Since the completion of the human genome sequence, especially with advances in genome-wide association studies and ongoing cancer genome sequencing projects, an impressive list of disease-associated genes and their mutations have been produced 2 . However, it has rarely been possible to translate this wealth of information on individual mutations and their association with disease into biological or therapeutic insights 3 . Most of the drugs approved by the US Food and Drug Administration today are palliative 4 -they merely treat symptoms, rather than targeting specific genes or pathways responsible, even if associated genes are known. One main reason for this lack of success is the complex genotype-tophenotype relationships among diseases and their associated genes and mutations. In particular, (i) the same gene can be associated with multiple disorders (gene pleiotropy); and (ii) mutations in any one of many genes can cause the same clinical disorder (locus heterogeneity). For example, mutations in TP53 are linked to 32 clinically distinguishable forms of cancer and cancer-related disorders, whereas mutations in any of at least 12 different genes can lead to long QT syndrome. With the publication of several large-scale protein-protein interaction networks in human 5-8 , researchers have recently begun to use complex cellular networks to explore these genotype-to-phenotype relationships 2,9 , on the basis that many proteins function by interacting with other proteins. However, most analyses model proteins as graph-theoretical nodes, ignoring the structural details of individual proteins and the spatial constraints of their interactions. Here, we investigate on a large-scale the underlying molecular mechanisms for the complex genotype-to-phenotype relationships by integrating three-dimensional (3D) atomic-level protein structure information with high-quality large-scale protein-protein interaction data. Within the framework of this structurally resolved protein interactome, we examine the relationships among human diseases and their associated genes and mutations. RESULTS Structurally resolved protein interactome for human disease We first combined 12,577 reliable literature-curated binary interactions filtered from six widely used databases 10-15 (Online Methods) and 8,173 well-verified, high-throughput, yeast two-hybrid (Y2H) interactions Next, we structurally resolved the interfaces of these interactions using a homology modeling approach 16 . We used both iPfam 17 and 3did 18 to identify the interfaces of two interacting proteins by mapping them to known atomic-resolution 3D structures of interactions in the Protein Data Bank (PDB) Finally, to compile a comprehensive list of disease-associated genes and their mutations, we combined information from both Online Mendelian Inheritance in Man (OMIM

CiteSeerX

Genome-wide functional annotation and structural verification of metabolic ORFeome of Chlamydomonas reinhardtii

Author: Balaji Santhanam
Balcha Dawit
Fan Changyu
Ghamsari Lila
Hao Tong
Papin Jason A
Salehi-Ashtiani Kourosh
Shen Yun
Yang Xinping
Yu Haiyuan
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Recent advances in the field of metabolic engineering have been expedited by the availability of genome sequences and metabolic modelling approaches. The complete sequencing of the <it>C. reinhardtii</it> genome has made this unicellular alga a good candidate for metabolic engineering studies; however, the annotation of the relevant genes has not been validated and the much-needed metabolic ORFeome is currently unavailable. We describe our efforts on the functional annotation of the ORF models released by the Joint Genome Institute (JGI), prediction of their subcellular localizations, and experimental verification of their structural annotation at the genome scale. Results We assigned enzymatic functions to the translated JGI ORF models of <it>C. reinhardtii</it> by reciprocal BLAST searches of the putative proteome against the UniProt and AraCyc enzyme databases. The best match for each translated ORF was identified and the EC numbers were transferred onto the ORF models. Enzymatic functional assignment was extended to the paralogs of the ORFs by clustering ORFs using BLASTCLUST. In total, we assigned 911 enzymatic functions, including 886 EC numbers, to 1,427 transcripts. We further annotated the enzymatic ORFs by prediction of their subcellular localization. The majority of the ORFs are predicted to be compartmentalized in the cytosol and chloroplast. We verified the structure of the metabolism-related ORF models by reverse transcription-PCR of the functionally annotated ORFs. Following amplification and cloning, we carried out 454FLX and Sanger sequencing of the ORFs. Based on alignment of the 454FLX reads to the ORF predicted sequences, we obtained more than 90% coverage for more than 80% of the ORFs. In total, 1,087 ORF models were verified by 454 and Sanger sequencing methods. We obtained expression evidence for 98% of the metabolic ORFs in the algal cells grown under constant light in the presence of acetate. Conclusions We functionally annotated approximately 1,400 JGI predicted metabolic ORFs that can facilitate the reconstruction and refinement of a genome-scale metabolic network. The unveiling of the metabolic potential of this organism, along with structural verification of the relevant ORFs, facilitates the selection of metabolic engineering targets with applications in bioenergy and biopharmaceuticals. The ORF clones are a resource for downstream studies.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Network medicine links SARS-CoV-2/COVID-19 infection to brain microvascular injury and neuroinflammation in dementia-like cognitive impairment

Author: Cheng Feixiong
Hou Yuan
Jehi Lara
Kallianpur Asha
Leverenz James B.
Liu Yunlong
Mehra Reena
Pieper Andrew A.
Xu Jielin
Yu Haiyuan
Zhou Yadi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/06/2021
Field of study

Background Dementia-like cognitive impairment is an increasingly reported complication of SARS-CoV-2 infection. However, the underlying mechanisms responsible for this complication remain unclear. A better understanding of causative processes by which COVID-19 may lead to cognitive impairment is essential for developing preventive and therapeutic interventions. Methods In this study, we conducted a network-based, multimodal omics comparison of COVID-19 and neurologic complications. We constructed the SARS-CoV-2 virus-host interactome from protein-protein interaction assay and CRISPR-Cas9-based genetic assay results and compared network-based relationships therein with those of known neurological manifestations using network proximity measures. We also investigated the transcriptomic profiles (including single-cell/nuclei RNA-sequencing) of Alzheimer’s disease (AD) marker genes from patients infected with COVID-19, as well as the prevalence of SARS-CoV-2 entry factors in the brains of AD patients not infected with SARS-CoV-2. Results We found significant network-based relationships between COVID-19 and neuroinflammation and brain microvascular injury pathways and processes which are implicated in AD. We also detected aberrant expression of AD biomarkers in the cerebrospinal fluid and blood of patients with COVID-19. While transcriptomic analyses showed relatively low expression of SARS-CoV-2 entry factors in human brain, neuroinflammatory changes were pronounced. In addition, single-nucleus transcriptomic analyses showed that expression of SARS-CoV-2 host factors (BSG and FURIN) and antiviral defense genes (LY6E, IFITM2, IFITM3, and IFNAR1) was elevated in brain endothelial cells of AD patients and healthy controls relative to neurons and other cell types, suggesting a possible role for brain microvascular injury in COVID-19-mediated cognitive impairment. Overall, individuals with the AD risk allele APOE E4/E4 displayed reduced expression of antiviral defense genes compared to APOE E3/E3 individuals. Conclusion Our results suggest significant mechanistic overlap between AD and COVID-19, centered on neuroinflammation and microvascular injury. These results help improve our understanding of COVID-19-associated neurological manifestations and provide guidance for future development of preventive or treatment interventions, although causal relationship and mechanistic pathways between COVID-19 and AD need future investigations

IUPUIScholarWorks

Directory of Open Access Journals

PubMed Central

Edgetic perturbation models of human inherited disorders

Author: Amélie Dricot
Benoit Charloteaux
Changyu Fan
Chenwei Lin
Danny Mou
David E Hill
David Szeto
Denis Dupuy
Dreze M
Fabien Heuze
Haiyuan Yu
Han Yan
Kavitha Venkatesan
Marc Vidal
Michael E Cusick
Muhammed A Yildirim
Nicolas Simonis
Niels Klitgord
Qian‐Ru Li
Quan Zhong
Robert Brasseur
Stanley Tam
Stuart Milstein
Tong Hao
Venus Swearingen
Ye Y
Publication venue: Nature Publishing Group
Publication date: 01/01/2009
Field of study

Cellular functions are mediated through complex systems of macromolecules and metabolites linked through biochemical and physical interactions, represented in interactome models as ‘nodes' and ‘edges', respectively. Better understanding of genotype-to-phenotype relationships in human disease will require modeling of how disease-causing mutations affect systems or interactome properties. Here we investigate how perturbations of interactome networks may differ between complete loss of gene products (‘node removal') and interaction-specific or edge-specific (‘edgetic') alterations. Global computational analyses of ∼50 000 known causative mutations in human Mendelian disorders revealed clear separations of mutations probably corresponding to those of node removal versus edgetic perturbations. Experimental characterization of mutant alleles in various disorders identified diverse edgetic interaction profiles of mutant proteins, which correlated with distinct structural properties of disease proteins and disease mechanisms. Edgetic perturbations seem to confer distinct functional consequences from node removal because a large fraction of cases in which a single gene is linked to multiple disorders can be modeled by distinguishing edgetic network perturbations. Edgetic network perturbation models might improve both the understanding of dissemination of disease alleles in human populations and the development of molecular therapeutic strategies

Crossref

Harvard University - DASH

PubMed Central

Open Repository and Bibliography - Liège

DI-fusion

Extensive disruption of protein interactions by genetic variants across the allele frequency spectrum in human populations

Author: Beltran Juan F.
Clark Andrew G.
Cooper David N.
Das Jishnu
Fragoza Robert
Keinan Alon
Liang Jin
Liang Siqi
Mort Matthew
Rivera-Erick Christen A.
Schimenti John C.
Stenson Peter D.
Tran Tina N.
Wang Ting-Yi
Wei Xiaomu
Wierbowski Shayne D.
Yao Li
Ye Kaixiong
Yu Haiyuan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 12/09/2019
Field of study

Each human genome carries tens of thousands of coding variants. The extent to which this variation is functional and the mechanisms by which they exert their influence remains largely unexplored. To address this gap, we leverage the ExAC database of 60,706 human exomes to investigate experimentally the impact of 2009 missense single nucleotide variants (SNVs) across 2185 protein-protein interactions, generating interaction profiles for 4797 SNV-interaction pairs, of which 421 SNVs segregate at > 1% allele frequency in human populations. We find that interaction-disruptive SNVs are prevalent at both rare and common allele frequencies. Furthermore, these results suggest that 10.5% of missense variants carried per individual are disruptive, a higher proportion than previously reported; this indicates that each individual’s genetic makeup may be significantly more complex than expected. Finally, we demonstrate that candidate disease-associated mutations can be identified through shared interaction perturbations between variants of interest and known disease mutations

Online Research @ Cardiff