94 research outputs found

    Bioinformatic identification of novel putative photoreceptor specific cis-elements

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Cell specific gene expression is largely regulated by different combinations of transcription factors that bind <it>cis</it>-elements in the upstream promoter sequence. However, experimental detection of <it>cis</it>-elements is difficult, expensive, and time-consuming. This provides a motivation for developing bioinformatic methods to identify <it>cis</it>-elements that could prioritize future experimental studies. Here, we use motif discovery algorithms to predict transcription factor binding sites involved in regulating the differences between murine rod and cone photoreceptor populations.</p> <p>Results</p> <p>To identify highly conserved motifs enriched in promoters that drive expression in either rod or cone photoreceptors, we assembled a set of murine rod-specific, cone-specific, and non-photoreceptor background promoter sequences. These sets were used as input to a newly devised motif discovery algorithm called Iterative Alignment/Modular Motif Selection (IAMMS). Using IAMMS, we predicted 34 motifs that may contribute to rod-specific (19 motifs) or cone-specific (15 motifs) expression patterns. Of these, 16 rod- and 12 cone-specific motifs were found in clusters near the transcription start site. New findings include the observation that cone promoters tend to contain TATA boxes, while rod promoters tend to be TATA-less (exempting <it>Rho </it>and <it>Cnga1</it>). Additionally, we identify putative sites for IL-6 effectors (in rods) and RXR family members (in cones) that can explain experimental data showing changes to cell-fate by activating these signaling pathways during rod/cone development. Two of the predicted motifs (NRE and ROP2) have been confirmed experimentally to be involved in cell-specific expression patterns. We provide a full database of predictions as additional data that may contain further valuable information. IAMMS predictions are compared with existing motif discovery algorithms, DME and BioProspector. We find that over 60% of IAMMS predictions are confirmed by at least one other motif discovery algorithm.</p> <p>Conclusion</p> <p>We predict novel, putative <it>cis-</it>elements enriched in the promoter of rod-specific or cone-specific genes. These are candidate binding sites for transcription factors involved in maintaining functional differences between rod and cone photoreceptor populations.</p

    The dryas iulia genome supports multiple gains of a W chromosome from a B chromosome in butterflies

    Get PDF
    In butterflies and moths, which exhibit highly variable sex determination mechanisms, the homogametic Z chromosome is deeply conserved and is featured in many genome assemblies. The evolution and origin of the female W sex chromosome, however, remains mostly unknown. Previous studies have proposed that a ZZ/Z0 sex determination system is ancestral to Lepidoptera, and that W chromosomes may originate from sex-linked B chromosomes. Here, we sequence and assemble the female Dryas iulia genome into 32 highly contiguous ordered and oriented chromosomes, including the Z and W sex chromosomes. We then use sex-specific Hi-C, ATAC-seq, PRO-seq, and whole-genome DNA sequence data sets to test if features of the D. iulia W chromosome are consistent with a hypothesized B chromosome origin. We show that the putative W chromosome displays female-associated DNA sequence, gene expression, and chromatin accessibility to confirm the sex-linked function of the W sequence. In contrast with expectations from studies of homologous sex chromosomes, highly repetitive DNA content on the W chromosome, the sole presence of domesticated repetitive elements in functional DNA, and lack of sequence homology with the Z chromosome or autosomes is most consistent with a B chromosome origin for the W, although it remains challenging to rule out extensive sequence divergence. Synteny analysis of the D. iulia W chromosome with other female lepidopteran genome assemblies shows no homology between W chromosomes and suggests multiple, independent origins of the W chromosome from a B chromosome likely occurred in butterflies

    Chromosome fusion affects genetic diversity and evolutionary turnover of functional loci, but consistently depends on chromosome size

    Get PDF
    Major changes in chromosome number and structure are linked to a series of evolutionary phenomena, including intrinsic barriers to gene flow or suppression of recombination due to chromosomal rearrangements. However, chromosome rearrangements can also affect the fundamental dynamics of molecular evolution within populations by changing relationships between linked loci and altering rates of recombination. Here, we build chromosome-level assembly Eueides isabella and, together with a recent chromosome-level assembly of Dryas iulia, examine the evolutionary consequences of multiple chromosome fusions in Heliconius butterflies. These assemblies pinpoint fusion points on 10 of the 20 autosomal chromosomes and reveal striking differences in the characteristics of fused and unfused chromosomes. The ten smallest autosomes in D. iulia and E. isabella, which have each fused to a longer chromosome in Heliconius, have higher repeat and GC content, and longer introns than predicted by their chromosome length. When fused, these characteristics change to become more in line with chromosome length. The fusions also led to reduced diversity, which likely reflects increased background selection and selection against introgression between diverging populations, following a reduction in per-base recombination rate. We further show that chromosome size and fusion impact turnover rates of functional loci at a macroevolutionary scale. Together these results provide further evidence that chromosome fusion in Heliconius likely had dramatic effects on population level processes shaping rates of neutral and adaptive divergence. These effects may have impacted patterns of diversification in Heliconius, a classic example of an adaptive radiation

    Deconvolution of Expression for Nascent RNA sequencing data (DENR) highlights pre-RNA isoform diversity in human cells.

    Get PDF
    MOTIVATION: Quantification of isoform abundance has been extensively studied at the mature-RNA level using RNA-seq but not at the level of precursor RNAs using nascent RNA sequencing. RESULTS: We address this problem with a new computational method called Deconvolution of Expression for Nascent RNA sequencing data (DENR), which models nascent RNA sequencing read counts as a mixture of user-provided isoforms. The baseline algorithm is enhanced by machine-learning predictions of active transcription start sites and an adjustment for the typical "shape profile" of read counts along a transcription unit. We show that DENR outperforms simple read-count-based methods for estimating gene and isoform abundances, and that transcription of multiple pre-RNA isoforms per gene is widespread, with frequent differences between cell types. In addition, we provide evidence that a majority of human isoform diversity derives from primary transcription rather than from post-transcriptional processes. AVAILABILITY: DENR and nascentRNASim are freely available at https://github.com/CshlSiepelLab/DENR (version v1.0.0) and https://github.com/CshlSiepelLab/nascentRNASim (version v0.3.0). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online

    Characterizing RNA stability genome-wide through combined analysis of PRO-seq and RNA-seq data.

    Get PDF
    BACKGROUND: The concentrations of distinct types of RNA in cells result from a dynamic equilibrium between RNA synthesis and decay. Despite the critical importance of RNA decay rates, current approaches for measuring them are generally labor-intensive, limited in sensitivity, and/or disruptive to normal cellular processes. Here, we introduce a simple method for estimating relative RNA half-lives that is based on two standard and widely available high-throughput assays: Precision Run-On sequencing (PRO-seq) and RNA sequencing (RNA-seq). RESULTS: Our method treats PRO-seq as a measure of transcription rate and RNA-seq as a measure of RNA concentration, and estimates the rate of RNA decay required for a steady-state equilibrium. We show that this approach can be used to assay relative RNA half-lives genome-wide, with good accuracy and sensitivity for both coding and noncoding transcription units. Using a structural equation model (SEM), we test several features of transcription units, nearby DNA sequences, and nearby epigenomic marks for associations with RNA stability after controlling for their effects on transcription. We find that RNA splicing-related features are positively correlated with RNA stability, whereas features related to miRNA binding and DNA methylation are negatively correlated with RNA stability. Furthermore, we find that a measure based on U1 binding and polyadenylation sites distinguishes between unstable noncoding and stable coding transcripts but is not predictive of relative stability within the mRNA or lincRNA classes. We also identify several histone modifications that are associated with RNA stability. CONCLUSION: We introduce an approach for estimating the relative half-lives of individual RNAs. Together, our estimation method and systematic analysis shed light on the pervasive impacts of RNA stability on cellular RNA concentrations

    Identification of gene co-regulatory modules and associated cis-elements involved in degenerative heart disease

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Cardiomyopathies, degenerative diseases of cardiac muscle, are among the leading causes of death in the developed world. Microarray studies of cardiomyopathies have identified up to several hundred genes that significantly alter their expression patterns as the disease progresses. However, the regulatory mechanisms driving these changes, in particular the networks of transcription factors involved, remain poorly understood. Our goals are (A) to identify modules of co-regulated genes that undergo similar changes in expression in various types of cardiomyopathies, and (B) to reveal the specific pattern of transcription factor binding sites, <it>cis</it>-elements, in the proximal promoter region of genes comprising such modules.</p> <p>Methods</p> <p>We analyzed 149 microarray samples from human hypertrophic and dilated cardiomyopathies of various etiologies. Hierarchical clustering and Gene Ontology annotations were applied to identify modules enriched in genes with highly correlated expression and a similar physiological function. To discover motifs that may underly changes in expression, we used the promoter regions for genes in three of the most interesting modules as input to motif discovery algorithms. The resulting motifs were used to construct a probabilistic model predictive of changes in expression across different cardiomyopathies.</p> <p>Results</p> <p>We found that three modules with the highest degree of functional enrichment contain genes involved in myocardial contraction (n = 9), energy generation (n = 20), or protein translation (n = 20). Using motif discovery tools revealed that genes in the contractile module were found to contain a TATA-box followed by a CACC-box, and are depleted in other GC-rich motifs; whereas genes in the translation module contain a pyrimidine-rich initiator, Elk-1, SP-1, and a novel motif with a GCGC core. Using a naïve Bayes classifier revealed that patterns of motifs are statistically predictive of expression patterns, with odds ratios of 2.7 (contractile), 1.9 (energy generation), and 5.5 (protein translation).</p> <p>Conclusion</p> <p>We identified patterns comprised of putative <it>cis</it>-regulatory motifs enriched in the upstream promoter sequence of genes that undergo similar changes in expression secondary to cardiomyopathies of various etiologies. Our analysis is a first step towards understanding transcription factor networks that are active in regulating gene expression during degenerative heart disease.</p

    Measurement of Charge Asymmetries in Charmless Hadronic in B Meson Decays

    Full text link
    We search for CP-violating asymmetries (Acp) in the B meson decays to K+- pi-+, K+- pi0, Ks pi+-, K+- eta', and omega pi+-. Using 9.66 million Upsilon(4S) decays collected with the CLEO detector, the statistical precision on Acp is in the range of \pm 0.12 to \pm 0.25 depending on decay mode. While CP-violating asymmetries of up to \pm 0.5 are possible within the Standard Model, the measured asymmetries are consistent with zero in all five decay modes studied.Comment: 10 pages, 3 figure
    corecore