71 research outputs found
Inference of expanded Lrp-like feast/famine transcription factor targets in a non-model organism using protein structure-based prediction
© 2014 Ashworth et al. Widespread microbial genome sequencing presents an opportunity to understand the gene regulatory networks of nonmodel organisms. This requires knowledge of the binding sites for transcription factors whose DNA-binding properties are unknown or difficult to infer. We adapted a protein structure-based method to predict the specificities and putative regulons of homologous transcription factors across diverse species. As a proof-of-concept we predicted the specificities and transcriptional target genes of divergent archaeal feast/famine regulatory proteins, several of which are encoded in the genome of Halobacterium salinarum. This was validated by comparison to experimentally determined specificities for transcription factors in distantly related extremophiles, chromatin immunoprecipitation experiments, and cis-regulatory sequence conservation across eighteen related species of halobacteria. Through this analysis we were able to infer that Halobacterium salinarum employs a divergent local trans-regulatory strategy to regulate genes (carA and carB) involved in arginine and pyrimidine metabolism, whereas Escherichia coli employs an operon. The prediction of gene regulatory binding sites using structure-based methods is useful for the inference of gene regulatory relationships in new species that are otherwise difficult to infer
Structure-based predictions broadly link transcription factor mutations to gene expression changes in cancers
© 2014 The Author(s). Thousands of unique mutations in transcription factors (TFs) arise in cancers, and the functional and biological roles of relatively few of these have been characterized. Here, we used structure-based methods developed specifically for DNA-binding proteins to systematically predict the consequences of mutations in several TFs that are frequently mutated in cancers. The explicit consideration of protein-DNA interactions was crucial to explain the roles and prevalence of mutations in TP53 and RUNX1 in cancers, and resulted in a higher specificity of detection for known p53-regulated genes among genetic associations between TP53 genotypes and genome-wide expression in The Cancer Genome Atlas, compared to existing methods of mutation assessment. Biophysical predictions also indicated that the relative prevalence of TP53 missense mutations in cancer is proportional to their thermodynamic impacts on protein stability and DNA binding, which is consistent with the selection for the loss of p53 transcriptional function in cancers. Structure and thermodynamics-based predictions of the impacts of missense mutations that focus on specific molecular functions may be increasingly useful for the precise and large-scale inference of aberrant molecular phenotypes in cancer and other complex diseases
Evolution of context dependent regulation by expansion of feast/famine regulatory proteins
BACKGROUND: Expansion of transcription factors is believed to have played a crucial role in evolution of all organisms by enabling them to deal with dynamic environments and colonize new environments. We investigated how the expansion of the Feast/Famine Regulatory Protein (FFRP) or Lrp-like proteins into an eight-member family in Halobacterium salinarum NRC-1 has aided in niche-adaptation of this archaeon to a complex and dynamically changing hypersaline environment.RESULTS: We mapped genome-wide binding locations for all eight FFRPs, investigated their preference for binding different effector molecules, and identified the contexts in which they act by analyzing transcriptional responses across 35 growth conditions that mimic different environmental and nutritional conditions this organism is likely to encounter in the wild. Integrative analysis of these data constructed an FFRP regulatory network with conditionally active states that reveal how interrelated variations in DNA-binding domains, effector-molecule preferences, and binding sites in target gene promoters have tuned the functions of each FFRP to the environments in which they act. We demonstrate how conditional regulation of similar genes by two FFRPs, AsnC (an activator) and VNG1237C (a repressor), have striking environment-specific fitness consequences for oxidative stress management and growth, respectively.CONCLUSIONS: This study provides a systems perspective into the evolutionary process by which gene duplication within a transcription factor family contributes to environment-specific adaptation of an organism
A nonsynonymous SNP within PCDH15 is associated with lipid traits in familial combined hyperlipidemia
Familial combined hyperlipidemia (FCHL) is a common lipid disorder characterized by the presence of multiple lipoprotein phenotypes that increase the risk of premature coronary heart disease. In a previous study, we identified an intragenic microsatellite marker within the protocadherin 15 (PCDH15) gene to be associated with high triglycerides (TGs) in Finnish dyslipidemic families. In this study we analyzed all four known nonsynonymous SNPs within PCDH15 in 1,268 individuals from Finnish and Dutch multigenerational families with FCHL. Association analyses of quantitative traits for SNPs were performed using the QTDT test. The nonsynonymous SNP rs10825269 resulted in a P = 0.0006 for the quantitative TG trait. Additional evidence for association was observed with the same SNP for apolipoprotein B levels (apo-B) (P = 0.0001) and total cholesterol (TC) levels (P = 0.001). None of the other three SNPs tested showed a significant association with any lipid-related trait. We investigated the expression of PCDH15 in different human tissues and observed that PCDH15 is expressed in several tissues including liver and pancreas. In addition, we measured the plasma lipid levels in mice with loss-of-function mutations in Pcdh15 (Pcdh15av-Tg and Pcdh15av-3J) to investigate possible abnormalities in their lipid profile. We observed a significant difference in plasma TG and TC concentrations for the Pcdh15av-3J carriers when compared with the wild type (P = 0.013 and P = 0.044, respectively). Our study suggests that PCDH15 is associated with lipid abnormalities
Phenotypic Evidence of Emerging Ivermectin Resistance in Onchocerca volvulus
Onchocerciasis, commonly known as river blindness, is caused by the filarial nematode Onchocerca volvulus and is transmitted by a blackfly vector. Over 37 million people are thought to be infected, with over 90 million at risk. Infection predominantly occurs in sub-Saharan Africa. Foci also exist in the Arabian Peninsula and Central and South America. Ivermectin, the sole pharmaceutical available for mass chemotherapy, has been used on a community basis for annual or semi-annual treatment since 1987. Multiple treatments with ivermectin kill the microfilariae that are responsible for the pathology of onchocerciasis. More importantly, ivermectin suppresses the reproductive activity of the adult female worms, thus delaying or preventing the repopulation of the skin with new microfilariae and thereby reducing transmission. This study extends earlier reports of sub-optimal responses to ivermectin by examining repopulation levels of microfilaria one year after treatment, worm burdens per nodule, the age structure of adult female worms recovered from nodules, and the reproductive status of adult female worms 90 days after ivermectin treatment. In some communities which have shown a pattern of sub-optimal response to treatment, the data is consistent with an emergence of ivermectin non response or resistance manifested by a loss of the effect of ivermectin on the suppression of parasite reproduction
Adipose Co-expression networks across Finns and Mexicans identify novel triglyceride-associated genes
BACKGROUND: High serum triglyceride (TG) levels is an established risk factor for coronary heart disease (CHD). Fat is stored in the form of TGs in human adipose tissue. We hypothesized that gene co-expression networks in human adipose tissue may be correlated with serum TG levels and help reveal novel genes involved in TG regulation. METHODS: Gene co-expression networks were constructed from two Finnish and one Mexican study sample using the blockwiseModules R function in Weighted Gene Co-expression Network Analysis (WGCNA). Overlap between TG-associated networks from each of the three study samples were calculated using a Fisher’s Exact test. Gene ontology was used to determine known pathways enriched in each TG-associated network. RESULTS: We measured gene expression in adipose samples from two Finnish and one Mexican study sample. In each study sample, we observed a gene co-expression network that was significantly associated with serum TG levels. The TG modules observed in Finns and Mexicans significantly overlapped and shared 34 genes. Seven of the 34 genes (ARHGAP30, CCR1, CXCL16, FERMT3, HCST, RNASET2, SELPG) were identified as the key hub genes of all three TG modules. Furthermore, two of the 34 genes (ARHGAP9, LST1) reside in previous TG GWAS regions, suggesting them as the regional candidates underlying the GWAS signals. CONCLUSIONS: This study presents a novel adipose gene co-expression network with 34 genes significantly correlated with serum TG across populations
A Systems Genetics Approach Implicates USF1, FADS3, and Other Causal Candidate Genes for Familial Combined Hyperlipidemia
We hypothesized that a common SNP in the 3' untranslated region of the upstream transcription factor 1 (USF1), rs3737787, may affect lipid traits by influencing gene expression levels, and we investigated this possibility utilizing the Mexican population, which has a high predisposition to dyslipidemia. We first associated rs3737787 genotypes in Mexican Familial Combined Hyperlipidemia (FCHL) case/control fat biopsies, with global expression patterns. To identify sets of co-expressed genes co-regulated by similar factors such as transcription factors, genetic variants, or environmental effects, we utilized weighted gene co-expression network analysis (WGCNA). Through WGCNA in the Mexican FCHL fat biopsies we identified two significant Triglyceride (TG)-associated co-expression modules. One of these modules was also associated with FCHL, the other FCHL component traits, and rs3737787 genotypes. This USF1-regulated FCHL-associated (URFA) module was enriched for genes involved in lipid metabolic processes. Using systems genetics procedures we identified 18 causal candidate genes in the URFA module. The FCHL causal candidate gene fatty acid desaturase 3 (FADS3) was associated with TGs in a recent Caucasian genome-wide significant association study and we replicated this association in Mexican FCHL families. Based on a USF1-regulated FCHL-associated co-expression module and SNP rs3737787, we identify a set of causal candidate genes for FCHL-related traits. We then provide evidence from two independent datasets supporting FADS3 as a causal gene for FCHL and elevated TGs in Mexicans
Transcriptomic Characterization of Temperature Stress Responses in Larval Zebrafish
Temperature influences nearly all biochemical, physiological and life history activities of fish, but the molecular mechanisms underlying the temperature acclimation remains largely unknown. Previous studies have identified many temperature-regulated genes in adult tissues; however, the transcriptional responses of fish larvae to temperature stress are not well understood. In this study, we characterized the transcriptional responses in larval zebrafish exposed to cold or heat stress using microarray analysis. In comparison with genes expressed in the control at 28°C, a total of 2680 genes were found to be affected in 96 hpf larvae exposed to cold (16°C) or heat (34°C) for 2 and 48h and most of these genes were expressed in a temperature-specific and temporally regulated manner. Bioinformatic analysis identified multiple temperature-regulated biological processes and pathways. Biological processes overrepresented among the earliest genes induced by temperature stress include regulation of transcription, nucleosome assembly, chromatin organization and protein folding. However, processes such as RNA processing, cellular metal ion homeostasis and protein transport and were enriched in genes up-regulated under cold exposure for 48 h. Pathways such as mTOR signalling, p53 signalling and circadian rhythm were enriched among cold-induced genes, while adipocytokine signalling, protein export and arginine and praline metabolism were enriched among heat-induced genes. Although most of these biological processes and pathways were specifically regulated by cold or heat, common responses to both cold and heat stresses were also found. Thus, these findings provide new interesting clues for elucidation of mechanisms underlying the temperature acclimation in fish
Integrative Analysis of Low- and High-Resolution eQTL
The study of expression quantitative trait loci (eQTL) is a powerful way of detecting transcriptional regulators at a genomic scale and for elucidating how natural genetic variation impacts gene expression. Power and genetic resolution are heavily affected by the study population: whereas recombinant inbred (RI) strains yield greater statistical power with low genetic resolution, using diverse inbred or outbred strains improves genetic resolution at the cost of lower power. In order to overcome the limitations of both individual approaches, we combine data from RI strains with genetically more diverse strains and analyze hippocampus eQTL data obtained from mouse RI strains (BXD) and from a panel of diverse inbred strains (Mouse Diversity Panel, MDP). We perform a systematic analysis of the consistency of eQTL independently obtained from these two populations and demonstrate that a significant fraction of eQTL can be replicated. Based on existing knowledge from pathway databases we assess different approaches for using the high-resolution MDP data for fine mapping BXD eQTL. Finally, we apply this framework to an eQTL hotspot on chromosome 1 (Qrr1), which has been implicated in a range of neurological traits. Here we present the first systematic examination of the consistency between eQTL obtained independently from the BXD and MDP populations. Our analysis of fine-mapping approaches is based on ‘real life’ data as opposed to simulated data and it allows us to propose a strategy for using MDP data to fine map BXD eQTL. Application of this framework to Qrr1 reveals that this eQTL hotspot is not caused by just one (or few) ‘master regulators’, but actually by a set of polymorphic genes specific to the central nervous system
- …