70 research outputs found

    PDNAsite:identification of DNA-binding site from protein sequence by incorporating spatial and sequence context

    Get PDF
    Protein-DNA interactions are involved in many fundamental biological processes essential for cellular function. Most of the existing computational approaches employed only the sequence context of the target residue for its prediction. In the present study, for each target residue, we applied both the spatial context and the sequence context to construct the feature space. Subsequently, Latent Semantic Analysis (LSA) was applied to remove the redundancies in the feature space. Finally, a predictor (PDNAsite) was developed through the integration of the support vector machines (SVM) classifier and ensemble learning. Results on the PDNA-62 and the PDNA-224 datasets demonstrate that features extracted from spatial context provide more information than those from sequence context and the combination of them gives more performance gain. An analysis of the number of binding sites in the spatial context of the target site indicates that the interactions between binding sites next to each other are important for protein-DNA recognition and their binding ability. The comparison between our proposed PDNAsite method and the existing methods indicate that PDNAsite outperforms most of the existing methods and is a useful tool for DNA-binding site identification. A web-server of our predictor (http://hlt.hitsz.edu.cn:8080/PDNAsite/) is made available for free public accessible to the biological research community

    EL_PSSM-RT:DNA-binding residue prediction by integrating ensemble learning with PSSM Relation Transformation

    Get PDF
    Background: Prediction of DNA-binding residue is important for understanding the protein-DNA recognition mechanism. Many computational methods have been proposed for the prediction, but most of them do not consider the relationships of evolutionary information between residues. Results: In this paper, we first propose a novel residue encoding method, referred to as the Position Specific Score Matrix (PSSM) Relation Transformation (PSSM-RT), to encode residues by utilizing the relationships of evolutionary information between residues. PDNA-62 and PDNA-224 are used to evaluate PSSM-RT and two existing PSSM encoding methods by five-fold cross-validation. Performance evaluations indicate that PSSM-RT is more effective than previous methods. This validates the point that the relationship of evolutionary information between residues is indeed useful in DNA-binding residue prediction. An ensemble learning classifier (EL_PSSM-RT) is also proposed by combining ensemble learning model and PSSM-RT to better handle the imbalance between binding and non-binding residues in datasets. EL_PSSM-RT is evaluated by five-fold cross-validation using PDNA-62 and PDNA-224 as well as two independent datasets TS-72 and TS-61. Performance comparisons with existing predictors on the four datasets demonstrate that EL_PSSM-RT is the best-performing method among all the predicting methods with improvement between 0.02-0.07 for MCC, 4.18-21.47% for ST and 0.013-0.131 for AUC. Furthermore, we analyze the importance of the pair-relationships extracted by PSSM-RT and the results validates the usefulness of PSSM-RT for encoding DNA-binding residues. Conclusions: We propose a novel prediction method for the prediction of DNA-binding residue with the inclusion of relationship of evolutionary information and ensemble learning. Performance evaluation shows that the relationship of evolutionary information between residues is indeed useful in DNA-binding residue prediction and ensemble learning can be used to address the data imbalance issue between binding and non-binding residues. A web service of EL_PSSM-RT ( http://hlt.hitsz.edu.cn:8080/PSSM-RT_SVM/ ) is provided for free access to the biological research community

    Understanding the Sequence-Dependence of DNA Groove Dimensions: Implications for DNA Interactions

    Get PDF
    BACKGROUND: The B-DNA major and minor groove dimensions are crucial for DNA-protein interactions. It has long been thought that the groove dimensions depend on the DNA sequence, however this relationship has remained elusive. Here, our aim is to elucidate how the DNA sequence intrinsically shapes the grooves. METHODOLOGY/PRINCIPAL FINDINGS: The present study is based on the analysis of datasets of free and protein-bound DNA crystal structures, and from a compilation of NMR (31)P chemical shifts measured on free DNA in solution on a broad range of representative sequences. The (31)P chemical shifts can be interpreted in terms of the BI↔BII backbone conformations and dynamics. The grooves width and depth of free and protein-bound DNA are found to be clearly related to the BI/BII backbone conformational states. The DNA propensity to undergo BI↔BII backbone transitions is highly sequence-dependent and can be quantified at the dinucleotide level. This dual relationship, between DNA sequence and backbone behavior on one hand, and backbone behavior and groove dimensions on the other hand, allows to decipher the link between DNA sequence and groove dimensions. It also firmly establishes that proteins take advantage of the intrinsic DNA groove properties. CONCLUSIONS/SIGNIFICANCE: The study provides a general framework explaining how the DNA sequence shapes the groove dimensions in free and protein-bound DNA, with far-reaching implications for DNA-protein indirect readout in both specific and non specific interactions

    Analysis of Alzheimer's disease severity across brain regions by topological analysis of gene co-expression networks

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Alzheimer's disease (AD) is a progressive neurodegenerative disorder involving variations in the transcriptome of many genes. AD does not affect all brain regions simultaneously. Identifying the differences among the affected regions may shed more light onto the disease progression. We developed a novel method involving the differential topology of gene coexpression networks to understand the association among affected regions and disease severity.</p> <p>Methods</p> <p>We analysed microarray data of four regions - entorhinal cortex (EC), hippocampus (HIP), posterior cingulate cortex (PCC) and middle temporal gyrus (MTG) from AD affected and normal subjects. A coexpression network was built for each region and the topological overlap between them was examined. Genes with zero topological overlap between two region-specific networks were used to characterise the differences between the two regions.</p> <p>Results and conclusion</p> <p>Results indicate that MTG shows early AD pathology compared to the other regions. We postulate that if the MTG gets affected later in the disease, post-mortem analyses of individuals with end-stage AD will show signs of early AD in the MTG, while the EC, HIP and PCC will have severe pathology. Such knowledge is useful for data collection in clinical studies where sample selection is a limiting factor as well as highlighting the underlying biology of disease progression.</p

    Validation of two generic patient-reported outcome measures in patients with type 2 diabetes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Prior to using a generic patient-reported outcome measure (PRO), the measure should be validated within the target population. The purpose of the current study was to validate two generic measures in patients with type 2 diabetes.</p> <p>Methods</p> <p>Patients with type 2 diabetes in Scotland and England completed two generic measures: EQ-5D and Psychological General Well-Being Index (PGWB). Two diabetes-specific measures were administered: ADS and DSC-R. Analyses assessed reliability and validity.</p> <p>Results</p> <p>There were 130 participants (53 Scotland; 77 England; 64% male; mean age = 55.7 years). Responses on the EQ-5D and PGWB reflected moderate impairment consistent with previous diabetes samples: mean EQ-5D Index score, 0.75; EQ-5D VAS, 68.8; PGWB global score, 67.9. All scales of the PGWB demonstrated good internal consistency reliability (Cronbach's alpha = 0.77 to 0.97). The EQ-5D and PGWB demonstrated convergent validity through significant correlations with the ADS (r = 0.48 to 0.61), DSC-R scales (r = 0.33 to 0.81 except ophthalmology subscale), and Body Mass Index (r = 0.15 to 0.38). The EQ-5D and PGWB discriminated between groups of patients known to differ in diabetes-related characteristics (e.g., history of hypoglycemia).</p> <p>Conclusion</p> <p>Results support the use of the EQ-5D and PGWB among patients with type 2 diabetes, possibly in combination with condition-specific measures.</p

    The NSL Complex Regulates Housekeeping Genes in Drosophila

    Get PDF
    MOF is the major histone H4 lysine 16-specific (H4K16) acetyltransferase in mammals and Drosophila. In flies, it is involved in the regulation of X-chromosomal and autosomal genes as part of the MSL and the NSL complexes, respectively. While the function of the MSL complex as a dosage compensation regulator is fairly well understood, the role of the NSL complex in gene regulation is still poorly characterized. Here we report a comprehensive ChIP–seq analysis of four NSL complex members (NSL1, NSL3, MBD-R2, and MCRS2) throughout the Drosophila melanogaster genome. Strikingly, the majority (85.5%) of NSL-bound genes are constitutively expressed across different cell types. We find that an increased abundance of the histone modifications H4K16ac, H3K4me2, H3K4me3, and H3K9ac in gene promoter regions is characteristic of NSL-targeted genes. Furthermore, we show that these genes have a well-defined nucleosome free region and broad transcription initiation patterns. Finally, by performing ChIP–seq analyses of RNA polymerase II (Pol II) in NSL1- and NSL3-depleted cells, we demonstrate that both NSL proteins are required for efficient recruitment of Pol II to NSL target gene promoters. The observed Pol II reduction coincides with compromised binding of TBP and TFIIB to target promoters, indicating that the NSL complex is required for optimal recruitment of the pre-initiation complex on target genes. Moreover, genes that undergo the most dramatic loss of Pol II upon NSL knockdowns tend to be enriched in DNA Replication–related Element (DRE). Taken together, our findings show that the MOF-containing NSL complex acts as a major regulator of housekeeping genes in flies by modulating initiation of Pol II transcription

    Gene regulatory network reveals oxidative stress as the underlying molecular mechanism of type 2 diabetes and hypertension

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The prevalence of diabetes is increasing worldwide. It has been long known that increased rates of inflammatory diseases, such as obesity (OBS), hypertension (HT) and cardiovascular diseases (CVD) are highly associated with type 2 diabetes (T2D). T2D and/or OBS can develop independently, due to genetic, behavioral or lifestyle-related variables but both lead to oxidative stress generation. The underlying mechanisms by which theses complications arise and manifest together remain poorly understood. Protein-protein interactions regulate nearly every living process. Availability of high-throughput genomic data has enabled unprecedented views of gene and protein co-expression, co-regulations and interactions in cellular systems.</p> <p>Methods</p> <p>The present work, applied a systems biology approach to develop gene interaction network models, comprised of high throughput genomic and PPI data for T2D. The genes differentially regulated through T2D were 'mined' and their 'wirings' were studied to get a more complete understanding of the overall gene network topology and their role in disease progression.</p> <p>Results</p> <p>By analyzing the genes related to T2D, HT and OBS, a highly regulated gene-disease integrated network model has been developed that provides useful functional linkages among groups of genes and thus addressing how different inflammatory diseases are connected and propagated at genetic level. Based on the investigations around the 'hubs' that provided more meaningful insights about the cross-talk within gene-disease networks in terms of disease phenotype association with oxidative stress and inflammation, a hypothetical co-regulation disease mechanism model been proposed. The results from this study revealed that the oxidative stress mediated regulation cascade is the common mechanistic link among the pathogenesis of T2D, HT and other inflammatory diseases such as OBS.</p> <p>Conclusion</p> <p>The findings provide a novel comprehensive approach for understanding the pathogenesis of various co-associated chronic inflammatory diseases by combining the power of pathway analysis with gene regulatory network evaluation.</p

    C. elegans VANG-1 Modulates Life Span via Insulin/IGF-1-Like Signaling

    Get PDF
    The planar cell polarity (PCP) pathway is highly conserved from Drosophila to humans and a PCP-like pathway has recently been described in the nematode Caenorhabditis elegans. The developmental function of this pathway is to coordinate the orientation of cells or structures within the plane of an epithelium or to organize cell-cell intercalation required for correct morphogenesis. Here, we describe a novel role of VANG-1, the only C. elegans ortholog of the conserved PCP component Strabismus/Van Gogh. We show that two alleles of vang-1 and depletion of the protein by RNAi cause an increase of mean life span up to 40%. Consistent with the longevity phenotype vang-1 animals also show enhanced resistance to thermal- and oxidative stress and decreased lipofuscin accumulation. In addition, vang-1 mutants show defects like reduced brood size, decreased ovulation rate and prolonged reproductive span, which are also related to gerontogenes. The germline, but not the intestine or neurons, seems to be the primary site of vang-1 function. Life span extension in vang-1 mutants depends on the insulin/IGF-1-like receptor DAF-2 and DAF-16/FoxO transcription factor. RNAi against the phase II detoxification transcription factor SKN-1/Nrf2 also reduced vang-1 life span that might be explained by gradual inhibition of insulin/IGF-1-like signaling in vang-1. This is the first time that a key player of the PCP pathway is shown to be involved in the insulin/IGF-1-like signaling dependent modulation of life span in C. elegans
    corecore