159 research outputs found

    Studying Functions of All Yeast Genes Simultaneously

    Get PDF
    A method of studying the functions of all the genes of a given species of microorganism simultaneously has been developed in experiments on Saccharomyces cerevisiae (commonly known as baker's or brewer's yeast). It is already known that many yeast genes perform functions similar to those of corresponding human genes; therefore, by facilitating understanding of yeast genes, the method may ultimately also contribute to the knowledge needed to treat some diseases in humans. Because of the complexity of the method and the highly specialized nature of the underlying knowledge, it is possible to give only a brief and sketchy summary here. The method involves the use of unique synthetic deoxyribonucleic acid (DNA) sequences that are denoted as DNA bar codes because of their utility as molecular labels. The method also involves the disruption of gene functions through deletion of genes. Saccharomyces cerevisiae is a particularly powerful experimental system in that multiple deletion strains easily can be pooled for parallel growth assays. Individual deletion strains recently have been created for 5,918 open reading frames, representing nearly all of the estimated 6,000 genetic loci of Saccharomyces cerevisiae. Tagging of each deletion strain with one or two unique 20-nucleotide sequences enables identification of genes affected by specific growth conditions, without prior knowledge of gene functions. Hybridization of bar-code DNA to oligonucleotide arrays can be used to measure the growth rate of each strain over several cell-division generations. The growth rate thus measured serves as an index of the fitness of the strain

    MachiBase: a Drosophila melanogaster 5β€²-end mRNA transcription database

    Get PDF
    MachiBase (http://machibase.gi.k.u-tokyo.ac.jp/) provides a comprehensive and freely accessible resource regarding Drosophila melanogaster 5β€²-end mRNA transcription at different developmental states, supporting studies on the variabilities of promoter transcriptional activities and gene-expression profiles in the fruitfly. The data were generated in conjunction with the recently developed high-throughput genome sequencer Illumina/Solexa using a newly developed 5β€²-end mRNA collection method

    Improving comparability between microarray probe signals by thermodynamic intensity correction

    Get PDF
    Signals from different oligonucleotide probes against the same target show great variation in intensities. However, detection of differences along a sequence e.g. to reveal intron/exon architecture, transcription boundary as well as simple absent/present calls depends on comparisons between different probes. It is therefore of great interest to correct for the variation between probes. Much of this variation is sequence dependent. We demonstrate that a thermodynamic model for hybridization of either DNA or RNA to a DNA microarray, which takes the sequence-dependent probe affinities into account significantly reduces the signal fluctuation between probes targeting the same gene transcript. For a test set of tightly tiled yeast genes, the model reduces the variance by up to a factor ∼1/3. As a consequence of this reduction, the model is shown to yield a more accurate determination of transcription start sites for a subset of yeast genes. In another application, we identify present/absent calls for probes hybridized to the sequenced Escherichia coli strain O157:H7 EDL933. The model improves the correct calls from 85 to 95% relative to raw intensity measures. The model thus makes applications which depend on comparisons between probes aimed at different sections of the same target more reliable

    Large introns in relation to alternative splicing and gene evolution: a case study of Drosophila bruno-3

    Get PDF
    Background: Alternative splicing (AS) of maturing mRNA can generate structurally and functionally distinct transcripts from the same gene. Recent bioinformatic analyses of available genome databases inferred a positive correlation between intron length and AS. To study the interplay between intron length and AS empirically and in more detail, we analyzed the diversity of alternatively spliced transcripts (ASTs) in the Drosophila RNA-binding Bruno-3 (Bru-3) gene. This gene was known to encode thirteen exons separated by introns of diverse sizes, ranging from 71 to 41,973 nucleotides in D. melanogaster. Although Bru-3's structure is expected to be conducive to AS, only two ASTs of this gene were previously described. Results: Cloning of RT-PCR products of the entire ORF from four species representing three diverged Drosophila lineages provided an evolutionary perspective, high sensitivity, and long-range contiguity of splice choices currently unattainable by high-throughput methods. Consequently, we identified three new exons, a new exon fragment and thirty-three previously unknown ASTs of Bru-3. All exon-skipping events in the gene were mapped to the exons surrounded by introns of at least 800 nucleotides, whereas exons split by introns of less than 250 nucleotides were always spliced contiguously in mRNA. Cases of exon loss and creation during Bru-3 evolution in Drosophila were also localized within large introns. Notably, we identified a true de novo exon gain: exon 8 was created along the lineage of the obscura group from intronic sequence between cryptic splice sites conserved among all Drosophila species surveyed. Exon 8 was included in mature mRNA by the species representing all the major branches of the obscura group. To our knowledge, the origin of exon 8 is the first documented case of exonization of intronic sequence outside vertebrates. Conclusion: We found that large introns can promote AS via exon-skipping and exon turnover during evolution likely due to frequent errors in their removal from maturing mRNA. Large introns could be a reservoir of genetic diversity, because they have a greater number of mutable sites than short introns. Taken together, gene structure can constrain and/or promote gene evolution

    Global Identification and Characterization of Transcriptionally Active Regions in the Rice Genome

    Get PDF
    Genome tiling microarray studies have consistently documented rich transcriptional activity beyond the annotated genes. However, systematic characterization and transcriptional profiling of the putative novel transcripts on the genome scale are still lacking. We report here the identification of 25,352 and 27,744 transcriptionally active regions (TARs) not encoded by annotated exons in the rice (Oryza. sativa) subspecies japonica and indica, respectively. The non-exonic TARs account for approximately two thirds of the total TARs detected by tiling arrays and represent transcripts likely conserved between japonica and indica. Transcription of 21,018 (83%) japonica non-exonic TARs was verified through expression profiling in 10 tissue types using a re-array in which annotated genes and TARs were each represented by five independent probes. Subsequent analyses indicate that about 80% of the japonica TARs that were not assigned to annotated exons can be assigned to various putatively functional or structural elements of the rice genome, including splice variants, uncharacterized portions of incompletely annotated genes, antisense transcripts, duplicated gene fragments, and potential non-coding RNAs. These results provide a systematic characterization of non-exonic transcripts in rice and thus expand the current view of the complexity and dynamics of the rice transcriptome

    Non-Coding RNA Prediction and Verification in Saccharomyces cerevisiae

    Get PDF
    Non-coding RNA (ncRNA) play an important and varied role in cellular function. A significant amount of research has been devoted to computational prediction of these genes from genomic sequence, but the ability to do so has remained elusive due to a lack of apparent genomic features. In this work, thermodynamic stability of ncRNA structural elements, as summarized in a Z-score, is used to predict ncRNA in the yeast Saccharomyces cerevisiae. This analysis was coupled with comparative genomics to search for ncRNA genes on chromosome six of S. cerevisiae and S. bayanus. Sets of positive and negative control genes were evaluated to determine the efficacy of thermodynamic stability for discriminating ncRNA from background sequence. The effect of window sizes and step sizes on the sensitivity of ncRNA identification was also explored. Non-coding RNA gene candidates, common to both S. cerevisiae and S. bayanus, were verified using northern blot analysis, rapid amplification of cDNA ends (RACE), and publicly available cDNA library data. Four ncRNA transcripts are well supported by experimental data (RUF10, RUF11, RUF12, RUF13), while one additional putative ncRNA transcript is well supported but the data are not entirely conclusive. Six candidates appear to be structural elements in 5β€² or 3β€² untranslated regions of annotated protein-coding genes. This work shows that thermodynamic stability, coupled with comparative genomics, can be used to predict ncRNA with significant structural elements

    Automatic Annotation of Spatial Expression Patterns via Sparse Bayesian Factor Models

    Get PDF
    Advances in reporters for gene expression have made it possible to document and quantify expression patterns in 2D–4D. In contrast to microarrays, which provide data for many genes but averaged and/or at low resolution, images reveal the high spatial dynamics of gene expression. Developing computational methods to compare, annotate, and model gene expression based on images is imperative, considering that available data are rapidly increasing. We have developed a sparse Bayesian factor analysis model in which the observed expression diversity of among a large set of high-dimensional images is modeled by a small number of hidden common factors. We apply this approach on embryonic expression patterns from a Drosophila RNA in situ image database, and show that the automatically inferred factors provide for a meaningful decomposition and represent common co-regulation or biological functions. The low-dimensional set of factor mixing weights is further used as features by a classifier to annotate expression patterns with functional categories. On human-curated annotations, our sparse approach reaches similar or better classification of expression patterns at different developmental stages, when compared to other automatic image annotation methods using thousands of hard-to-interpret features. Our study therefore outlines a general framework for large microscopy data sets, in which both the generative model itself, as well as its application for analysis tasks such as automated annotation, can provide insight into biological questions

    GAMETOPHYTE DEFECTIVE 1, a Putative Subunit of RNases P/MRP, Is Essential for Female Gametogenesis and Male Competence in Arabidopsis

    Get PDF
    RNA biogenesis, including biosynthesis and maturation of rRNA, tRNA and mRNA, is a fundamental process that is critical for cell growth, division and differentiation. Previous studies showed that mutations in components involved in RNA biogenesis resulted in abnormalities in gametophyte and leaf development in Arabidopsis. In eukaryotes, RNases P/MRP (RNase mitochondrial RNA processing) are important ribonucleases that are responsible for processing of tRNA, and transcription of small non-coding RNAs. Here we report that Gametophyte Defective 1 (GAF1), a gene encoding a predicted protein subunit of RNases P/MRP, AtRPP30, plays a role in female gametophyte development and male competence. Embryo sacs were arrested at stages ranging from FG1 to FG7 in gaf1 mutant, suggesting that the progression of the gametophytic division during female gametogenesis was impaired in gaf1 mutant. In contrast, pollen development was not affected in gaf1. However, the fitness of the mutant pollen tube was weaker than that of the wild-type, leading to reduced transmission through the male gametes. GAF1 is featured as a typical RPP30 domain protein and interacts physically with AtPOP5, a homologue of RNases P/MRP subunit POP5 of yeast. Together, our data suggest that components of the RNases P/MRP family, such as RPP30, play important roles in gametophyte development and function in plants

    Cis-by-Trans Regulatory Divergence Causes the Asymmetric Lethal Effects of an Ancestral Hybrid Incompatibility Gene

    Get PDF
    The Dobzhansky and Muller (D-M) model explains the evolution of hybrid incompatibility (HI) through the interaction between lineage-specific derived alleles at two or more loci. In agreement with the expectation that HI results from functional divergence, many protein-coding genes that contribute to incompatibilities between species show signatures of adaptive evolution, including Lhr, which encodes a heterochromatin protein whose amino acid sequence has diverged extensively between Drosophila melanogaster and D. simulans by natural selection. The lethality of D. melanogaster/D. simulans F1 hybrid sons is rescued by removing D. simulans Lhr, but not D. melanogaster Lhr, suggesting that the lethal effect results from adaptive evolution in the D. simulans lineage. It has been proposed that adaptive protein divergence in Lhr reflects antagonistic coevolution with species-specific heterochromatin sequences and that defects in LHR protein localization cause hybrid lethality. Here we present surprising results that are inconsistent with this coding-sequence-based model. Using Lhr transgenes expressed under native conditions, we find no evidence that LHR localization differs between D. melanogaster and D. simulans, nor do we find evidence that it mislocalizes in their interspecific hybrids. Rather, we demonstrate that Lhr orthologs are differentially expressed in the hybrid background, with the levels of D. simulans Lhr double that of D. melanogaster Lhr. We further show that this asymmetric expression is caused by cis-by-trans regulatory divergence of Lhr. Therefore, the non-equivalent hybrid lethal effects of Lhr orthologs can be explained by asymmetric expression of a molecular function that is shared by both orthologs and thus was presumably inherited from the ancestral allele of Lhr. We present a model whereby hybrid lethality occurs by the interaction between evolutionarily ancestral and derived alleles

    Functional conservation of the Drosophila hybrid incompatibility gene Lhr

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Hybrid incompatibilities such as sterility and lethality are commonly modeled as being caused by interactions between two genes, each of which has diverged separately in one of the hybridizing lineages. The gene <it>Lethal hybrid rescue </it>(<it>Lhr</it>) encodes a rapidly evolving heterochromatin protein that causes lethality of hybrid males in crosses between <it>Drosophila melanogaster </it>females and <it>D. simulans </it>males. Previous genetic analyses showed that hybrid lethality is caused by <it>D. simulans Lhr </it>but not by <it>D. melanogaster Lhr</it>, confirming a critical prediction of asymmetry in the evolution of a hybrid incompatibility gene.</p> <p>Results</p> <p>Here we have examined the functional properties of <it>Lhr </it>orthologs from multiple Drosophila species, including interactions with other heterochromatin proteins, localization to heterochromatin, and ability to complement hybrid rescue in <it>D. melanogaster</it>/<it>D. simulans </it>hybrids. We find that these properties are conserved among most <it>Lhr </it>orthologs, including <it>Lhr </it>from <it>D. melanogaster</it>, <it>D. simulans </it>and the outgroup species <it>D. yakuba</it>.</p> <p>Conclusions</p> <p>We conclude that evolution of the hybrid lethality properties of <it>Lhr </it>between <it>D. melanogaster </it>and <it>D. simulans </it>did not involve extensive loss or gain of functions associated with protein interactions or localization to heterochromatin.</p
    • …
    corecore