27 research outputs found

    Linking disease-associated genes to regulatory networks via promoter organization

    Get PDF
    Pathway- or disease-associated genes may participate in more than one transcriptional co-regulation network. Such gene groups can be readily obtained by literature analysis or by high-throughput techniques such as microarrays or protein-interaction mapping. We developed a strategy that defines regulatory networks by in silico promoter analysis, finding potentially co-regulated subgroups without a priori knowledge. Pairs of transcription factor binding sites conserved in orthologous genes (vertically) as well as in promoter sequences of co-regulated genes (horizontally) were used as seeds for the development of promoter models representing potential co-regulation. This approach was applied to a Maturity Onset Diabetes of the Young (MODY)-associated gene list, which yielded two models connecting functionally interacting genes within MODY-related insulin/glucose signaling pathways. Additional genes functionally connected to our initial gene list were identified by database searches with these promoter models. Thus, data-driven in silico promoter analysis allowed integrating molecular mechanisms with biological functions of the cell

    Genome-wide analysis of regions similar to promoters of histone genes

    Get PDF
    Background: The purpose of this study is to: i) develop a computational model of promoters of human histone-encoding genes (shortly histone genes), an important class of genes that participate in various critical cellular processes, ii) use the model so developed to identify regions across the human genome that have similar structure as promoters of histone genes; such regions could represent potential genomic regulatory regions, e.g. promoters, of genes that may be coregulated with histone genes, and iii/ identify in this way genes that have high likelihood of being coregulated with the histone genes. Results: We successfully developed a histone promoter model using a comprehensive collection of histone genes. Based on leave-one-out cross-validation test, the model produced good prediction accuracy (94.1% sensitivity, 92.6% specificity, and 92.8% positive predictive value). We used this model to predict across the genome a number of genes that shared similar promoter structures with the histone gene promoters. We thus hypothesize that these predicted genes could be coregulated with histone genes. This hypothesis matches well with the available gene expression, gene ontology, and pathways data. Jointly with promoters of the above-mentioned genes, we found a large number of intergenic regions with similar structure as histone promoters. Conclusions: This study represents one of the most comprehensive computational analyses conducted thus far on a genome-wide scale of promoters of human histone genes. Our analysis suggests a number of other human genes that share a high similarity of promoter structure with the histone genes and thus are highly likely to be coregulated, and consequently coexpressed, with the histone genes. We also found that there are a large number of intergenic regions across the genome with their structures similar to promoters of histone genes. These regions may be promoters of yet unidentified genes, or may represent remote control regions that participate in regulation of histone and histone-coregulated gene transcription initiation. While these hypotheses still remain to be verified, we believe that these form a useful resource for researchers to further explore regulation of human histone genes and human genome. It is worthwhile to note that the regulatory regions of the human genome remain largely un-annotated even today and this study is an attempt to supplement our understanding of histone regulatory regions.Statistic

    MMP-20 Is Predominately a Tooth-Specific Enzyme with a Deep Catalytic Pocket that Hydrolyzes Type V Collagen †

    Get PDF
    Matrix metalloproteinase-20 (MMP-20, enamelysin) has a highly restricted pattern of expression. In healthy tissues, MMP-20 is observed in the enamel organ and pulp organ of developing teeth and is present only as an activated enzyme. To identify other tissues that may express MMP-20, we performed a systematic mouse tissue expression screen. Among the non-tooth tissues assayed, MMP-20 transcripts were only identified in minute quantities within the large intestine. The murine Mmp20 promoter was cloned, sequenced and assessed for potential tooth-specific regulatory elements. In silico analysis identified four promoter modules that were common to Mmp20 and at least two of three co-regulated predominantly tooth-specific genes that encode ameloblastin, amelogenin, and enamelin. We asked if the highly restricted MMP-20 expression pattern was associated with a broad substrate specificity that might preclude its expression in other tissues. An iterative mixture-based random doedecamer peptide library screen with Edman sequencing of MMP-20 cleavage products revealed that, among MMPs previously screened, MMP-20 had unique substrate preferences. These preferences indicate that MMP-20 has a deep and wide catalytic pocket that can accommodate substrates with large aromatic residues in the P1′ position. Based on matrices derived from the peptide library data, we identified and then confirmed that Type V collagen is an MMP-20 substrate. Since Type V collagen is not present in dental enamel but is an otherwise widely distributed collagen, and since only active MMP-20 has been observed in teeth, our data suggests that control of MMP-20 activity is primarily regulated by transcriptional means

    Statistical significance of cis-regulatory modules

    Get PDF
    BACKGROUND: It is becoming increasingly important for researchers to be able to scan through large genomic regions for transcription factor binding sites or clusters of binding sites forming cis-regulatory modules. Correspondingly, there has been a push to develop algorithms for the rapid detection and assessment of cis-regulatory modules. While various algorithms for this purpose have been introduced, most are not well suited for rapid, genome scale scanning. RESULTS: We introduce methods designed for the detection and statistical evaluation of cis-regulatory modules, modeled as either clusters of individual binding sites or as combinations of sites with constrained organization. In order to determine the statistical significance of module sites, we first need a method to determine the statistical significance of single transcription factor binding site matches. We introduce a straightforward method of estimating the statistical significance of single site matches using a database of known promoters to produce data structures that can be used to estimate p-values for binding site matches. We next introduce a technique to calculate the statistical significance of the arrangement of binding sites within a module using a max-gap model. If the module scanned for has defined organizational parameters, the probability of the module is corrected to account for organizational constraints. The statistical significance of single site matches and the architecture of sites within the module can be combined to provide an overall estimation of statistical significance of cis-regulatory module sites. CONCLUSION: The methods introduced in this paper allow for the detection and statistical evaluation of single transcription factor binding sites and cis-regulatory modules. The features described are implemented in the Search Tool for Occurrences of Regulatory Motifs (STORM) and MODSTORM software

    CLC-2 single nucleotide polymorphisms (SNPs) as potential modifiers of cystic fibrosis disease severity

    Get PDF
    BACKGROUND: Cystic fibrosis (CF) lung disease manifest by impaired chloride secretion leads to eventual respiratory failure. Candidate genes that may modify CF lung disease severity include alternative chloride channels. The objectives of this study are to identify single nucleotide polymorphisms (SNPs) in the airway epithelial chloride channel, CLC-2, and correlate these polymorphisms with CF lung disease. METHODS: The CLC-2 promoter, intron 1 and exon 20 were examined for SNPs in adult CF dF508/dF508 homozygotes with mild and severe lung disease (forced expiratory volume at one second (FEV1) > 70% and < 40%). RESULTS: PCR amplification of genomic CLC-2 and sequence analysis revealed 1 polymorphism in the hClC -2 promoter, 4 in intron 1, and none in exon 20. Fisher's analysis within this data set, did not demonstrate a significant relationship between the severity of lung disease and SNPs in the CLC-2 gene. CONCLUSIONS: CLC-2 is not a key modifier gene of CF lung phenotype. Further studies evaluating other phenotypes associated with CF may be useful in the future to assess the ability of CLC-2 to modify CF disease severity

    Beschreibung genomischer Regulationsschalter mit bioinformatischen Methoden Abschlussbericht

    No full text
    SIGLEAvailable from TIB Hannover: DtF QN1(92,42) / FIZ - Fachinformationszzentrum Karlsruhe / TIB - Technische InformationsbibliothekBundesministerium fuer Bildung, Wissenschaft, Forschung und Technologie, Bonn (Germany)DEGerman
    corecore