1,132 research outputs found

    Gene connectivity, function, and sequence conservation: predictions from modular yeast co-expression networks

    Get PDF
    BACKGROUND: Genes and proteins are organized into functional modular networks in which the network context of a gene or protein has implications for cellular function. Highly connected hub proteins, largely responsible for maintaining network connectivity, have been found to be much more likely to be essential for yeast survival. RESULTS: Here we investigate the properties of weighted gene co-expression networks formed from multiple microarray datasets. The constructed networks approximate scale-free topology, but this is not universal across all datasets. We show strong positive correlations between gene connectivity within the whole network and gene essentiality as well as gene sequence conservation. We demonstrate the preservation of a modular structure of the networks formed, and demonstrate that, within some of these modules, it is possible to observe a strong correlation between connectivity and essentiality or between connectivity and conservation within the modules particularly within modules containing larger numbers of essential genes. CONCLUSION: Application of these techniques can allow a finer scale prediction of relative gene importance for a particular process within a group of similarly expressed genes

    Gene expression profiling in C57BL/6J and A/J mouse inbred strains reveals gene networks specific for brain regions independent of genetic background

    Get PDF
    Abstract Background We performed gene expression profiling of the amygdala and hippocampus taken from inbred mouse strains C57BL/6J and A/J. The selected brain areas are implicated in neurobehavioral traits while these mouse strains are known to differ widely in behavior. Consequently, we hypothesized that comparing gene expression profiles for specific brain regions in these strains might provide insight into the molecular mechanisms of human neuropsychiatric traits. We performed a whole-genome gene expression experiment and applied a systems biology approach using weighted gene co-expression network analysis. Results We were able to identify modules of co-expressed genes that distinguish a strain or brain region. Analysis of the networks that are most informative for hippocampus and amygdala revealed enrichment in neurologically, genetically and psychologically related pathways. Close examination of the strain-specific gene expression profiles, however, revealed no functional relevance but a significant enrichment of single nucleotide polymorphisms in the probe sequences used for array hybridization. This artifact was not observed for the modules of co-expressed genes that distinguish amygdala and hippocampus. Conclusions The brain-region specific modules were found to be independent of genetic background and are therefore likely to represent biologically relevant molecular networks that can be studied to complement our knowledge about pathways in neuropsychiatric disease

    Integrated Weighted Gene Co-expression Network Analysis with an Application to Chronic Fatigue Syndrome

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Systems biologic approaches such as Weighted Gene Co-expression Network Analysis (WGCNA) can effectively integrate gene expression and trait data to identify pathways and candidate biomarkers. Here we show that the additional inclusion of genetic marker data allows one to characterize network relationships as causal or reactive in a chronic fatigue syndrome (CFS) data set.</p> <p>Results</p> <p>We combine WGCNA with genetic marker data to identify a disease-related pathway and its causal drivers, an analysis which we refer to as "Integrated WGCNA" or IWGCNA. Specifically, we present the following IWGCNA approach: 1) construct a co-expression network, 2) identify trait-related modules within the network, 3) use a trait-related genetic marker to prioritize genes within the module, 4) apply an integrated gene screening strategy to identify candidate genes and 5) carry out causality testing to verify and/or prioritize results. By applying this strategy to a CFS data set consisting of microarray, SNP and clinical trait data, we identify a module of 299 highly correlated genes that is associated with CFS severity. Our integrated gene screening strategy results in 20 candidate genes. We show that our approach yields biologically interesting genes that function in the same pathway and are causal drivers for their parent module. We use a separate data set to replicate findings and use Ingenuity Pathways Analysis software to functionally annotate the candidate gene pathways.</p> <p>Conclusion</p> <p>We show how WGCNA can be combined with genetic marker data to identify disease-related pathways and the causal drivers within them. The systems genetics approach described here can easily be used to generate testable genetic hypotheses in other complex disease studies.</p

    HYTHIRM Radiance Modeling and Image Analyses in Support of STS-119, STS-125 and STS-128 Space Shuttle Hypersonic Re-entries

    Get PDF
    We provide the first geometrically accurate (i.e., 3-D) temperature maps of the entire windward surface of the Space Shuttle during hypersonic reentry. To accomplish this task we began with estimated surface temperatures derived from CFD models at integral high Mach numbers and used them, the Shuttle's surface properties and reasonable estimates of the sensor-to-target geometry to predict the emitted spectral radiance from the surface (in units of W sr-1 m-2 nm-1). These data were converted to sensor counts using properties of the sensor (e.g. aperture, spectral band, and various efficiencies), the expected background, and the atmosphere transmission to inform the optimal settings for the near-infrared and midwave IR cameras on the Cast Glance aircraft. Once these data were collected, calibrated, edited, registered and co-added we formed both 2-D maps of the scene in the above units and 3-D maps of the bottom surface in temperature that could be compared with not only the initial inputs but also thermocouple data from the Shuttle itself. The 3-D temperature mapping process was based on the initial radiance modeling process. Here temperatures were guessed for each node in a well-resolved 3-D framework, a radiance model was produced and compared to the processed imagery, and corrections to the temperature were estimated until the iterative process converged. This process did very well in characterizing the temperature structure of the large asymmetric boundary layer transition the covered much of the starboard bottom surface of STS-119 Discovery. Both internally estimated accuracies and differences with CFD models and thermocouple measurements are at most a few percent. The technique did less well characterizing the temperature structure of the turbulent wedge behind the trip due to limitations in understanding the true sensor resolution. (Note: Those less inclined to read the entire paper are encouraged to read an Executive Summary provided at the end.

    The cerebellum ages slowly according to the epigenetic clock

    Get PDF
    Studies that elucidate why some human tissues age faster than others may shed light on how we age, and ultimately suggest what interventions may be possible. Here we utilize a recent biomarker of aging (referred to as epigenetic clock) to assess the epigenetic ages of up to 30 anatomic sites from supercentenarians (subjects who reached an age of 110 or older) and younger subjects. Using three novel and three published human DNA methylation data sets, we demonstrate that the cerebellum ages more slowly than other parts of the human body. We used both transcriptional data and genetic data to elucidate molecular mechanisms which may explain this finding. The two largest superfamilies of helicases (SF1 and SF2) are significantly over-represented (p=9.2x10-9) among gene transcripts that are over-expressed in the cerebellum compared to other brain regions from the same subject. Furthermore, SNPs that are associated with epigenetic age acceleration in the cerebellum tend to be located near genes from helicase superfamilies SF1 and SF2 (enrichment p=5.8x10-3). Our genetic and transcriptional studies of epigenetic age acceleration support the hypothesis that the slow aging rate of the cerebellum is due to processes that involve RNA helicases

    Maintenance of age in human neurons generated by microRNA-based neuronal conversion of fibroblasts

    Get PDF
    Aging is a major risk factor in many forms of late-onset neurodegenerative disorders. The ability to recapitulate age-related characteristics of human neurons in culture will offer unprecedented opportunities to study the biological processes underlying neuronal aging. Here, we show that using a recently demonstrated microRNA-based cellular reprogramming approach, human fibroblasts from postnatal to near centenarian donors can be efficiently converted into neurons that maintain multiple age-associated signatures. Application of an epigenetic biomarker of aging (referred to as epigenetic clock) to DNA methylation data revealed that the epigenetic ages of fibroblasts were highly correlated with corresponding age estimates of reprogrammed neurons. Transcriptome and microRNA profiles reveal genes differentially expressed between young and old neurons. Further analyses of oxidative stress, DNA damage and telomere length exhibit the retention of age-associated cellular properties in converted neurons from corresponding fibroblasts. Our results collectively demonstrate the maintenance of age after neuronal conversion. DOI: http://dx.doi.org/10.7554/eLife.18648.00

    Biomarkers for Early and Late Stage Chronic Allograft Nephropathy by Proteogenomic Profiling of Peripheral Blood

    Get PDF
    Despite significant improvements in life expectancy of kidney transplant patients due to advances in surgery and immunosuppression, Chronic Allograft Nephropathy (CAN) remains a daunting problem. A complex network of cellular mechanisms in both graft and peripheral immune compartments complicates the non-invasive diagnosis of CAN, which still requires biopsy histology. This is compounded by non-immunological factors contributing to graft injury. There is a pressing need to identify and validate minimally invasive biomarkers for CAN to serve as early predictors of graft loss and as metrics for managing long-term immunosuppression.We used DNA microarrays, tandem mass spectroscopy proteomics and bioinformatics to identify genomic and proteomic markers of mild and moderate/severe CAN in peripheral blood of two distinct cohorts (n = 77 total) of kidney transplant patients with biopsy-documented histology.Gene expression profiles reveal over 2400 genes for mild CAN, and over 700 for moderate/severe CAN. A consensus analysis reveals 393 (mild) and 63 (moderate/severe) final candidates as CAN markers with predictive accuracy of 80% (mild) and 92% (moderate/severe). Proteomic profiles show over 500 candidates each, for both stages of CAN including 302 proteins unique to mild and 509 unique to moderate/severe CAN.This study identifies several unique signatures of transcript and protein biomarkers with high predictive accuracies for mild and moderate/severe CAN, the most common cause of late allograft failure. These biomarkers are the necessary first step to a proteogenomic classification of CAN based on peripheral blood profiling and will be the targets of a prospective clinical validation study

    Protein expression based multimarker analysis of breast cancer samples

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Tissue microarray (TMA) data are commonly used to validate the prognostic accuracy of tumor markers. For example, breast cancer TMA data have led to the identification of several promising prognostic markers of survival time. Several studies have shown that TMA data can also be used to cluster patients into clinically distinct groups. Here we use breast cancer TMA data to cluster patients into distinct prognostic groups.</p> <p>Methods</p> <p>We apply weighted correlation network analysis (WGCNA) to TMA data consisting of 26 putative tumor biomarkers measured on 82 breast cancer patients. Based on this analysis we identify three groups of patients with low (5.4%), moderate (22%) and high (50%) mortality rates, respectively. We then develop a simple threshold rule using a subset of three markers (p53, Na-KATPase-β1, and TGF β receptor II) that can approximately define these mortality groups. We compare the results of this correlation network analysis with results from a standard Cox regression analysis.</p> <p>Results</p> <p>We find that the rule-based grouping variable (referred to as WGCNA*) is an independent predictor of survival time. While WGCNA* is based on protein measurements (TMA data), it validated in two independent Affymetrix microarray gene expression data (which measure mRNA abundance). We find that the WGCNA patient groups differed by 35% from mortality groups defined by a more conventional stepwise Cox regression analysis approach.</p> <p>Conclusions</p> <p>We show that correlation network methods, which are primarily used to analyze the relationships between gene products, are also useful for analyzing the relationships between patients and for defining distinct patient groups based on TMA data. We identify a rule based on three tumor markers for predicting breast cancer survival outcomes.</p

    Gene network interconnectedness and the generalized topological overlap measure

    Get PDF
    BACKGROUND: Network methods are increasingly used to represent the interactions of genes and/or proteins. Genes or proteins that are directly linked may have a similar biological function or may be part of the same biological pathway. Since the information on the connection (adjacency) between 2 nodes may be noisy or incomplete, it can be desirable to consider alternative measures of pairwise interconnectedness. Here we study a class of measures that are proportional to the number of neighbors that a pair of nodes share in common. For example, the topological overlap measure by Ravasz et al. [1] can be interpreted as a measure of agreement between the m = 1 step neighborhoods of 2 nodes. Several studies have shown that two proteins having a higher topological overlap are more likely to belong to the same functional class than proteins having a lower topological overlap. Here we address the question whether a measure of topological overlap based on higher-order neighborhoods could give rise to a more robust and sensitive measure of interconnectedness. RESULTS: We generalize the topological overlap measure from m = 1 step neighborhoods to m ≥ 2 step neighborhoods. This allows us to define the m-th order generalized topological overlap measure (GTOM) by (i) counting the number of m-step neighbors that a pair of nodes share and (ii) normalizing it to take a value between 0 and 1. Using theoretical arguments, a yeast co-expression network application, and a fly protein network application, we illustrate the usefulness of the proposed measure for module detection and gene neighborhood analysis. CONCLUSION: Topological overlap can serve as an important filter to counter the effects of spurious or missing connections between network nodes. The m-th order topological overlap measure allows one to trade-off sensitivity versus specificity when it comes to defining pairwise interconnectedness and network modules

    A Gene Co-Expression Network in Whole Blood of Schizophrenia Patients Is Independent of Antipsychotic-Use and Enriched for Brain-Expressed Genes

    Get PDF
    Despite large-scale genome-wide association studies (GWAS), the underlying genes for schizophrenia are largely unknown. Additional approaches are therefore required to identify the genetic background of this disorder. Here we report findings from a large gene expression study in peripheral blood of schizophrenia patients and controls. We applied a systems biology approach to genome-wide expression data from whole blood of 92 medicated and 29 antipsychotic-free schizophrenia patients and 118 healthy controls. We show that gene expression profiling in whole blood can identify twelve large gene co-expression modules associated with schizophrenia. Several of these disease related modules are likely to reflect expression changes due to antipsychotic medication. However, two of the disease modules could be replicated in an independent second data set involving antipsychotic-free patients and controls. One of these robustly defined disease modules is significantly enriched with brain-expressed genes and with genetic variants that were implicated in a GWAS study, which could imply a causal role in schizophrenia etiology. The most highly connected intramodular hub gene in this module (ABCF1), is located in, and regulated by the major histocompatibility (MHC) complex, which is intriguing in light of the fact that common allelic variants from the MHC region have been implicated in schizophrenia. This suggests that the MHC increases schizophrenia susceptibility via altered gene expression of regulatory genes in this network
    corecore