96 research outputs found

    A Method for the Simultaneous Estimation of Selection Intensities in Overlapping Genes

    Get PDF
    Inferring the intensity of positive selection in protein-coding genes is important since it is used to shed light on the process of adaptation. Recently, it has been reported that overlapping genes, which are ubiquitous in all domains of life, seem to exhibit inordinate degrees of positive selection. Here, we present a new method for the simultaneous estimation of selection intensities in overlapping genes. We show that the appearance of positive selection is caused by assuming that selection operates independently on each gene in an overlapping pair, thereby ignoring the unique evolutionary constraints on overlapping coding regions. Our method uses an exact evolutionary model, thereby voiding the need for approximation or intensive computation. We test the method by simulating the evolution of overlapping genes of different types as well as under diverse evolutionary scenarios. Our results indicate that the independent estimation approach leads to the false appearance of positive selection even though the gene is in reality subject to negative selection. Finally, we use our method to estimate selection in two influenza A genes for which positive selection was previously inferred. We find no evidence for positive selection in both cases

    Absence of pathogenic mitochondrial DNA mutations in mouse brain tumors

    Get PDF
    BACKGROUND: Somatic mutations in the mitochondrial genome occur in numerous tumor types including brain tumors. These mutations are generally found in the hypervariable regions I and II of the displacement loop and unlikely alter mitochondrial function. Two hypervariable regions of mononucleotide repeats occur in the mouse mitochondrial genome, i.e., the origin of replication of the light strand (O(L)) and the Arg tRNA. METHODS: In this study we examined the entire mitochondrial genome in a series of chemically induced brain tumors in the C57BL/6J strain and spontaneous brain tumors in the VM mouse strain. The tumor mtDNA was compared to that of mtDNA in brain mitochondrial populations from the corresponding syngeneic mouse host strain. RESULTS: Direct sequencing revealed a few homoplasmic base pair insertions, deletions, and substitutions in the tumor cells mainly in regions of mononucleotide repeats. A heteroplasmic mutation in the 16srRNA gene was detected in a spontaneous metastatic VM brain tumor. CONCLUSION: None of the mutations were considered pathogenic, indicating that mtDNA somatic mutations do not likely contribute to the initiation or progression of these diverse mouse brain tumors

    Composition-based statistics and translated nucleotide searches: Improving the TBLASTN module of BLAST

    Get PDF
    BACKGROUND: TBLASTN is a mode of operation for BLAST that aligns protein sequences to a nucleotide database translated in all six frames. We present the first description of the modern implementation of TBLASTN, focusing on new techniques that were used to implement composition-based statistics for translated nucleotide searches. Composition-based statistics use the composition of the sequences being aligned to generate more accurate E-values, which allows for a more accurate distinction between true and false matches. Until recently, composition-based statistics were available only for protein-protein searches. They are now available as a command line option for recent versions of TBLASTN and as an option for TBLASTN on the NCBI BLAST web server. RESULTS: We evaluate the statistical and retrieval accuracy of the E-values reported by a baseline version of TBLASTN and by two variants that use different types of composition-based statistics. To test the statistical accuracy of TBLASTN, we ran 1000 searches using scrambled proteins from the mouse genome and a database of human chromosomes. To test retrieval accuracy, we modernize and adapt to translated searches a test set previously used to evaluate the retrieval accuracy of protein-protein searches. We show that composition-based statistics greatly improve the statistical accuracy of TBLASTN, at a small cost to the retrieval accuracy. CONCLUSION: TBLASTN is widely used, as it is common to wish to compare proteins to chromosomes or to libraries of mRNAs. Composition-based statistics improve the statistical accuracy, and therefore the reliability, of TBLASTN results. The algorithms used by TBLASTN are not widely known, and some of the most important are reported here. The data used to test TBLASTN are available for download and may be useful in other studies of translated search algorithms

    The Prevalence and Regulation of Antisense Transcripts in Schizosaccharomyces pombe

    Get PDF
    A strand-specific transcriptome sequencing strategy, directional ligation sequencing or DeLi-seq, was employed to profile antisense transcriptome of Schizosaccharomyces pombe. Under both normal and heat shock conditions, we found that polyadenylated antisense transcripts are broadly expressed while distinct expression patterns were observed for protein-coding and non-coding loci. Dominant antisense expression is enriched in protein-coding genes involved in meiosis or stress response pathways. Detailed analyses further suggest that antisense transcripts are independently regulated with respect to their sense transcripts, and diverse mechanisms might be potentially involved in the biogenesis and degradation of antisense RNAs. Taken together, antisense transcription may have profound impacts on global gene regulation in S. pombe
    corecore