74 research outputs found

    PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments

    Get PDF
    PAL2NAL is a web server that constructs a multiple codon alignment from the corresponding aligned protein sequences. Such codon alignments can be used to evaluate the type and rate of nucleotide substitutions in coding DNA for a wide range of evolutionary analyses, such as the identification of levels of selective constraint acting on genes, or to perform DNA-based phylogenetic studies. The server takes a protein sequence alignment and the corresponding DNA sequences as input. In contrast to other existing applications, this server is able to construct codon alignments even if the input DNA sequence has mismatches with the input protein sequence, or contains untranslated regions and polyA tails. The server can also deal with frame shifts and inframe stop codons in the input models, and is thus suitable for the analysis of pseudogenes. Another distinct feature is that the user can specify a subregion of the input alignment in order to specifically analyze functional domains or exons of interest. The PAL2NAL server is available at

    Identification and Analysis of Genes and Pseudogenes within Duplicated Regions in the Human and Mouse Genomes

    Get PDF
    The identification and classification of genes and pseudogenes in duplicated regions still constitutes a challenge for standard automated genome annotation procedures. Using an integrated homology and orthology analysis independent of current gene annotation, we have identified 9,484 and 9,017 gene duplicates in human and mouse, respectively. On the basis of the integrity of their coding regions, we have classified them into functional and inactive duplicates, allowing us to define the first consistent and comprehensive collection of 1,811 human and 1,581 mouse unprocessed pseudogenes. Furthermore, of the total of 14,172 human and mouse duplicates predicted to be functional genes, as many as 420 are not included in current reference gene databases and therefore correspond to likely novel mammalian genes. Some of these correspond to partial duplicates with less than half of the length of the original source genes, yet they are conserved and syntenic among different mammalian lineages. The genes and unprocessed pseudogenes obtained here will enable further studies on the mechanisms involved in gene duplication as well as of the fate of duplicated genes

    Selective maintenance of Drosophila tandemly arranged duplicated genes during evolution

    Get PDF
    Genes occurring in conserved, tandemly-arrayed clusters in Drosophila melanogaster are co-expressed to a much higher extent than other duplicated genes

    Non-random retention of protein-coding overlapping genes in Metazoa

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Although the overlap of transcriptional units occurs frequently in eukaryotic genomes, its evolutionary and biological significance remains largely unclear. Here we report a comparative analysis of overlaps between genes coding for well-annotated proteins in five metazoan genomes (human, mouse, zebrafish, fruit fly and worm).</p> <p>Results</p> <p>For all analyzed species the observed number of overlapping genes is always lower than expected assuming functional neutrality, suggesting that gene overlap is negatively selected. The comparison to the random distribution also shows that retained overlaps do not exhibit random features: antiparallel overlaps are significantly enriched, while overlaps lying on the same strand and those involving coding sequences are highly underrepresented. We confirm that overlap is mostly species-specific and provide evidence that it frequently originates through the acquisition of terminal, non-coding exons. Finally, we show that overlapping genes tend to be significantly co-expressed in a breast cancer cDNA library obtained by 454 deep sequencing, and that different overlap types display different patterns of reciprocal expression.</p> <p>Conclusion</p> <p>Our data suggest that overlap between protein-coding genes is selected against in Metazoa. However, when retained it may be used as a species-specific mechanism for the reciprocal regulation of neighboring genes. The tendency of overlaps to involve non-coding regions of the genes leads to the speculation that the advantages achieved by an overlapping arrangement may be optimized by evolving regulatory non-coding transcripts.</p

    Differential lactate and cholesterol synthetic activities in XY and XX Sertoli cells

    Get PDF
    SRY, a sex-determining gene, induces testis development in chromosomally female (XX) individuals. However, mouse XX Sertoli cells carrying Sry (XX/Sry Sertoli cells) are incapable of fully supporting germ cell development, even when the karyotype of the germ cells is XY. While it has therefore been assumed that XX/Sry Sertoli cells are not functionally equivalent to XY Sertoli cells, it has remained unclear which specific functions are affected. To elucidate the functional difference, we compared the gene expression of XY and XX/Sry Sertoli cells. Lactate and cholesterol metabolisms, essential for nursing the developing germ cells, were down-regulated in XX/Sry cells, which appears to be caused at least in part by the differential expression of histone modification enzymes SMCX/SMCY (H3K4me3 demethylase) and UTX/UTY (H3K27me3 demethylase) encoded by the sex chromosomes. We suggest that down-regulation of lactate and cholesterol metabolism that may be due to altered epigenetic modification affects the nursing functions of XX/Sry Sertoli cells.This work was supported by the Japan Society for the Promotion of Science (JSPS) KAKENHI Grant Number 21249018 and 16H05142 (K. Mo.), Ministry of Education, Culture, Sports, Science, and Technology, Japan (MEXT) KAKENHI Grant Number 22132002 (K. Mo.), the Uehara Memorial Foundation, and Takeda Science Foundation (T.B.)

    Ad4BP/SF-1 regulates cholesterol synthesis to boost the production of steroids

    Get PDF
    Housekeeping metabolic pathways such as glycolysis are active in all cell types. In addition, many types of cells are equipped with cell-specific metabolic pathways. To properly perform their functions, housekeeping and cell-specific metabolic pathways must function cooperatively. However, the regulatory mechanisms that couple metabolic pathways remain largely unknown. Recently, we showed that the steroidogenic cell-specific nuclear receptor Ad4BP/ SF-1, which regulates steroidogenic genes, also regulates housekeeping glycolytic genes. Here, we identify cholesterogenic genes as the targets of Ad4BP/SF-1. Further, we reveal that Ad4BP/SF-1 regulates Hummr, a candidate mediator of cholesterol transport from endoplasmic reticula to mitochondria. Given that cholesterol is the starting material for steroidogenesis and is synthesized from acetyl-CoA, which partly originates from glucose, our results suggest that multiple biological processes involved in synthesizing steroid hormones are governed by Ad4BP/SF-1. To our knowledge, this study provides the first example where housekeeping and cell-specific metabolism are coordinated at the transcriptional level.This work was supported by Grants 16H05142 (K.M.), 17H06427 (K.M.), 16K08593 (T.B.), and 17J03270 (M.I.) from the Japan Society for the Promotion of Science (JSPS) KAKENHI; The Uehara Memorial Foundation (K.M.); Takeda Science Foundation (T.B.); The Shin-Nihon of Advanced Medical Research (T.B.).Supplementary information accompanies this paper at https://doi.org/10.1038/s42003-018-0020-z

    A network of conserved co-occurring motifs for the regulation of alternative splicing

    Get PDF
    Cis-acting short sequence motifs play important roles in alternative splicing. It is now possible to identify such sequence motifs as conserved sequence patterns in genome sequence alignments. Here, we report the systematic search for motifs in the neighboring introns of alternatively spliced exons by using comparative analysis of mammalian genome alignments. We identified 11 conserved sequence motifs that might be involved in the regulation of alternative splicing. These motifs are not only significantly overrepresented near alternatively spliced exons, but they also co-occur with each other, thus, forming a network of cis-elements, likely to be the basis for context-dependent regulation. Based on this finding, we applied the motif co-occurrence to predict alternatively skipped exons. We verified exon skipping in 29 cases out of 118 predictions (25%) by EST and mRNA sequences in the databases. For the predictions not verified by the database sequences, we confirmed exon skipping in 10 additional cases by using both RT–PCR experiments and the publicly available RNA-Seq data. These results indicate that even more alternative splicing events will be found with the progress of large-scale and high-throughput analyses for various tissue samples and developmental stages

    Erratum: Corrigendum: Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution

    Get PDF
    International Chicken Genome Sequencing Consortium. The Original Article was published on 09 December 2004. Nature432, 695–716 (2004). In Table 5 of this Article, the last four values listed in the ‘Copy number’ column were incorrect. These should be: LTR elements, 30,000; DNA transposons, 20,000; simple repeats, 140,000; and satellites, 4,000. These errors do not affect any of the conclusions in our paper. Additional information. The online version of the original article can be found at 10.1038/nature0315
    corecore