10 research outputs found

    The errors, insights and lessons of famous AI predictions – and what they mean for the future

    Get PDF
    Predicting the development of artificial intelligence (AI) is a difficult project – but a vital one, according to some analysts. AI predictions already abound: but are they reliable? This paper will start by proposing a decomposition schema for classifying them. Then it constructs a variety of theoretical tools for analysing, judging and improving them. These tools are demonstrated by careful analysis of five famous AI predictions: th

    Comparative genome analysis and gene finding in Candida species using CGOB.

    Get PDF
    The Candida Gene Order Browser (CGOB) was developed as a tool to visualize and analyze synteny relationships in multiple Candida species, and to provide an accurate, manually curated set of orthologous Candida genes for evolutionary analyses. Here, we describe major improvements to CGOB. The underlying structure of the database has been changed significantly. Genomic features are now based directly on genome annotations rather than on protein sequences, which allows non-protein features such as centromere locations in Candida albicans and tRNA genes in all species to be included. The data set has been expanded to 13 species, including genomes of pathogens (C. albicans, C. parapsilosis, C. tropicalis, and C. orthopsilosis), and those of xylose-degrading species with important biotechnological applications (C. tenuis, Scheffersomyces stipitis, and Spathaspora passalidarum). Updated annotations of C. parapsilosis, C. dubliniensis, and Debaryomyces hansenii have been incorporated. We discovered more than 1,500 previously unannotated genes among the 13 genomes, ranging in size from 29 to 3,850 amino acids. Poorly conserved and rapidly evolving genes were also identified. Re-analysis of the mating type loci of the xylose degraders suggests that C. tenuis is heterothallic, whereas both Spa. passalidarum and S. stipitis are homothallic. As well as hosting the browser, the CGOB website (http://cgob.ucd.ie) gives direct access to all the underlying genome annotations, sequences, and curated orthology data

    Systematic discovery of unannotated genes in 11 yeast species using a database of orthologous genomic segments

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>In standard BLAST searches, no information other than the sequences of the query and the database entries is considered. However, in situations where two genes from different species have only borderline similarity in a BLAST search, the discovery that the genes are located within a region of conserved gene order (synteny) can provide additional evidence that they are orthologs. Thus, for interpreting borderline search results, it would be useful to know whether the syntenic context of a database hit is similar to that of the query. This principle has often been used in investigations of particular genes or genomic regions, but to our knowledge it has never been implemented systematically.</p> <p>Results</p> <p>We made use of the synteny information contained in the Yeast Gene Order Browser database for 11 yeast species to carry out a systematic search for protein-coding genes that were overlooked in the original annotations of one or more yeast genomes but which are syntenic with their orthologs. Such genes tend to have been overlooked because they are short, highly divergent, or contain introns. The key features of our software - called SearchDOGS - are that the database entries are classified into sets of genomic segments that are already known to be orthologous, and that very weak BLAST hits are retained for further analysis if their genomic location is similar to that of the query. Using SearchDOGS we identified 595 additional protein-coding genes among the 11 yeast species, including two new genes in <it>Saccharomyces cerevisiae</it>. We found additional genes for the mating pheromone a-factor in six species including <it>Kluyveromyces lactis</it>.</p> <p>Conclusions</p> <p>SearchDOGS has proven highly successful for identifying overlooked genes in the yeast genomes. We anticipate that our approach can be adapted for study of further groups of species, such as bacterial genomes. More generally, the concept of doing sequence similarity searches against databases to which external information has been added may prove useful in other settings.</p

    Evolutionary erosion of yeast sex chromosomes by mating-type switching accidents

    No full text
    We investigate yeast sex chromosome evolution by comparing genome sequences from 16 species in the family Saccharomycetaceae, including data from genera Tetrapisispora, Kazachstania, Naumovozyma, and Torulaspora. We show that although most yeast species contain a mating-type (MAT) locus and silent HML and HMR loci structurally analogous to those of Saccharomyces cerevisiae, their detailed organization is highly variable and indicates that the MAT locus is a deletion hotspot. Over evolutionary time, chromosomal genes located immediately beside MAT have continually been deleted, truncated, or transposed to other places in the genome in a process that is gradually shortening the distance between MAT and HML. Each time a gene beside MAT is removed by deletion or transposition, the next gene on the chromosome is brought into proximity with MAT and is in turn put at risk for removal. This process has also continually replaced the triplicated sequence regions, called Z and X, that allow HML and HMR to be used as templates for DNA repair at MAT during mating-type switching. We propose that the deletion and transposition events are caused by evolutionary accidents during mating-type switching, combined with natural selection to keep MAT and HML on the same chromosome. The rate of deletion accelerated greatly after whole-genome duplication, probably because genes were redundant and could be deleted without requiring transposition. We suggest that, despite its mutational cost, switching confers an evolutionary benefit by providing a way for an isolated germinating spore to reform spores if the environment is too poor

    Gene finding in Candida species

    Get PDF
    The Candida Gene Order Browser (CGOB) was developed as a tool to visualize and analyze synteny relationships in multiple Candida species, and to provide an accurate, manually curated set of orthologous Candida genes for evolutionary analyses. Here, we describe major improvements to CGOB. The underlying structure of the database has been changed significantly. Genomic features are now based directly on genome annotations rather than on protein sequences, which allows non-protein features such as centromere locations in Candida albicans and tRNA genes in all species to be included. The data set has been expanded to 13 species, including genomes of pathogens (C. albicans, C. parapsilosis, C. tropicalis, and C. orthopsilosis), and those of xylose-degrading species with important biotechnological applications (C. tenuis, Scheffersomyces stipitis, and Spathaspora passalidarum). Updated annotations of C. parapsilosis, C. dubliniensis, and Debaryomyces hansenii have been incorporated. We discovered more than 1,500 previously unannotated genes among the 13 genomes, ranging in size from 29 to 3,850 amino acids. Poorly conserved and rapidly evolving genes were also identified. Re-analysis of the mating type loci of the xylose degraders suggests that C. tenuis is heterothallic, whereas both Spa. passalidarum and S. stipitis are homothallic. As well as hosting the browser, the CGOB website (http://cgob.ucd.ie) gives direct access to all the underlying genome annotations, sequences, and curated orthology data.European Research CouncilScience Foundation IrelandWellcome Trust Computacional Infection Biology PhD programmeAuthor has checked copyrightDM. 9/12/201
    corecore