15 research outputs found

    A New Chicken Genome Assembly Provides Insight into Avian Genome Structure

    Get PDF
    The importance of the Gallus gallus (chicken) as a model organism and agricultural animal merits a continuation of sequence assembly improvement efforts. We present a new version of the chicken genome assembly (Gallus_gallus-5.0; GCA_000002315.3), built from combined long single molecule sequencing technology, finished BACs, and improved physical maps. In overall assembled bases, we see a gain of 183 Mb, including 16.4 Mb in placed chromosomes with a corresponding gain in the percentage of intact repeat elements characterized. Of the 1.21 Gb genome, we include three previously missing autosomes, GGA30, 31, and 33, and improve sequence contig length 10-fold over the previous Gallus_gallus-4.0. Despite the significant base representation improvements made, 138 Mb of sequence is not yet located to chromosomes. When annotated for gene content, Gallus_gallus-5.0 shows an increase of 4679 annotated genes (2768 noncoding and 1911 protein-coding) over those in Gallus_gallus-4.0. We also revisited the question of what genes are missing in the avian lineage, as assessed by the highest quality avian genome assembly to date, and found that a large fraction of the original set of missing genes are still absent in sequenced bird species. Finally, our new data support a detailed map of MHC-B, encompassing two segments: one with a highly stable gene copy number and another in which the gene copy number is highly variable. The chicken model has been a critical resource for many other fields of study, and this new reference assembly will substantially further these efforts

    A putative RA-like region in the brain of the scale-backed antbird, Willisornis poecilinotus (Furnariides, Suboscines, Passeriformes, Thamnophilidae)

    No full text
    The memorization and production of song in songbirds share important parallels with the process of speech acquisition in humans. In songbirds, these processes are dependent on a group of specialized telencephalic nuclei known as the song system: HVC (used as a proper name), RA (robust nucleus of arcopallium), LMAN (lateral magnocellular nucleus of the nidopallium) and striatal Area X. A recent study suggested that the arcopallium of the Sayornis phoebe, a non vocal learner suboscine species, contains a nucleus with some properties similar to those of songbird RA, suggesting that the song system may have been present in the last common ancestor of these groups. Here we report morphological and gene expression evidence that a region with some properties similar to RA is present in another suboscine, the Amazonian endemic Willisornis poecilinotus. Specifically, a discrete domain with a distinct Nissl staining pattern and that expresses the RA marker RGS4 was found in the arcopallium where the oscine RA is localized. Our findings, combined with the previous report on the S. phoebe, suggest that an arcopallial region with some RA-like properties was present in the ancestor of both Suboscines infraorders Tyranni and Furnarii, and is possibly an ancestral feature of Passeriformes

    Relating enhancer genetic variation across mammals to complex phenotypes using machine learning

    No full text
    Protein-coding differences between species often fail to explain phenotypic diversity, suggesting the involvement of genomic elements that regulate gene expression such as enhancers. Identifying associations between enhancers and phenotypes is challenging because enhancer activity can be tissue-dependent and functionally conserved despite low sequence conservation. We developed the Tissue-Aware Conservation Inference Toolkit (TACIT) to associate candidate enhancers with species' phenotypes using predictions from machine learning models trained on specific tissues. Applying TACIT to associate motor cortex and parvalbumin-positive interneuron enhancers with neurological phenotypes revealed dozens of enhancer-phenotype associations, including brain size-associated enhancers that interact with genes implicated in microcephaly or macrocephaly. TACIT provides a foundation for identifying enhancers associated with the evolution of any convergently evolved phenotype in any large group of species with aligned genomes

    Dense sampling of bird diversity increases power of comparative genomics

    Get PDF
    Whole-genome sequencing projects are increasingly populating the tree of life and characterizing biodiversity(1-4). Sparse taxon sampling has previously been proposed to confound phylogenetic inference(5), and captures only a fraction of the genomic diversity. Here we report a substantial step towards the dense representation of avian phylogenetic and molecular diversity, by analysing 363 genomes from 92.4% of bird families-including 267 newly sequenced genomes produced for phase II of the Bird 10,000 Genomes (B10K) Project. We use this comparative genome dataset in combination with a pipeline that leverages a reference-free whole-genome alignment to identify orthologous regions in greater numbers than has previously been possible and to recognize genomic novelties in particular bird lineages. The densely sampled alignment provides a single-base-pair map of selection, has more than doubled the fraction of bases that are confidently predicted to be under conservation and reveals extensive patterns of weak selection in predominantly non-coding DNA. Our results demonstrate that increasing the diversity of genomes used in comparative studies can reveal more shared and lineage-specific variation, and improve the investigation of genomic characteristics. We anticipate that this genomic resource will offer new perspectives on evolutionary processes in cross-species comparative analyses and assist in efforts to conserve species. A dataset of the genomes of 363 species from the Bird 10,000 Genomes Project shows increased power to detect shared and lineage-specific variation, demonstrating the importance of phylogenetically diverse taxon sampling in whole-genome sequencing.Peer reviewe

    Evolutionary constraint and innovation across hundreds of placental mammals.

    No full text
    Zoonomia is the largest comparative genomics resource for mammals produced to date. By aligning genomes for 240 species, we identify bases that, when mutated, are likely to affect fitness and alter disease risk. At least 332 million bases (~10.7%) in the human genome are unusually conserved across species (evolutionarily constrained) relative to neutrally evolving repeats, and 4552 ultraconserved elements are nearly perfectly conserved. Of 101 million significantly constrained single bases, 80% are outside protein-coding exons and half have no functional annotations in the Encyclopedia of DNA Elements (ENCODE) resource. Changes in genes and regulatory elements are associated with exceptional mammalian traits, such as hibernation, that could inform therapeutic development. Earth\u27s vast and imperiled biodiversity offers distinctive power for identifying genetic variants that affect genome function and organismal phenotypes
    corecore