35 research outputs found
Detecting heterozygosity in shotgun genome assemblies: Lessons from obligately outcrossing nematodes
The majority of nematodes are gonochoristic (dioecious) with distinct male and female sexes, but the best-studied species, Caenorhabditis elegans, is a self-fertile hermaphrodite. The sequencing of the genomes of C. elegans and a second hermaphrodite, C. briggsae, was facilitated in part by the low amount of natural heterozygosity, which typifies selfing species. Ongoing genome projects for gonochoristic Caenorhabditis species seek to approximate this condition by intense inbreeding prior to sequencing. Here we show that despite this inbreeding, the heterozygous fraction of the whole genome shotgun assemblies of three gonochoristic Caenorhabditis species, C. brenneri, C. remanei, and C. japonica, is considerable. We first demonstrate experimentally that independently assembled sequence variants in C. remanei and C. brenneri are allelic. We then present gene-based approaches for recognizing heterozygous regions of WGS assemblies. We also develop a simple method for quantifying heterozygosity that can be applied to assemblies lacking gene annotations. Consistently we find that ∼10% and 30% of the C. remanei and C. brenneri genomes, respectively, are represented by two alleles in the assemblies. Heterozygosity is restricted to autosomes and its retention is accompanied by substantial inbreeding depression, suggesting that it is caused by multiple recessive deleterious alleles and not merely by chance. Both the overall amount and chromosomal distribution of heterozygous DNA is highly variable between assemblies of close relatives produced by identical methodologies, and allele frequencies have continued to change after strains were sequenced. Our results highlight the impact of mating systems on genome sequencing projects
Recommended from our members
Enabling precision medicine in neonatology, an integrated repository for preterm birth research.
Preterm birth, or the delivery of an infant prior to 37 weeks of gestation, is a significant cause of infant morbidity and mortality. In the last decade, the advent and continued development of molecular profiling technologies has enabled researchers to generate vast amount of 'omics' data, which together with integrative computational approaches, can help refine the current knowledge about disease mechanisms, diagnostics, and therapeutics. Here we describe the March of Dimes' Database for Preterm Birth Research (http://www.immport.org/resources/mod), a unique resource that contains a variety of 'omics' datasets related to preterm birth. The database is open publicly, and as of January 2018, links 13 molecular studies with data across tens of thousands of patients from 6 measurement modalities. The data in the repository are highly diverse and include genomic, transcriptomic, immunological, and microbiome data. Relevant datasets are augmented with additional molecular characterizations of almost 25,000 biological samples from public databases. We believe our data-sharing efforts will lead to enhanced research collaborations and coordination accelerating the overall pace of discovery in preterm birth research
Recommended from our members
MetaCyto: A Tool for Automated Meta-analysis of Mass and Flow Cytometry Data
While meta-analysis has demonstrated increased statistical power and more robust estimations in studies, the application of this commonly accepted methodology to cytometry data has been challenging. Different cytometry studies often involve diverse sets of markers. Moreover, the detected values of the same marker are inconsistent between studies due to different experimental designs and cytometer configurations. As a result, the cell subsets identified by existing auto-gating methods cannot be directly compared across studies. We developed MetaCyto for automated meta-analysis of both flow and mass cytometry (CyTOF) data. By combining clustering methods with a silhouette scanning method, MetaCyto is able to identify commonly labeled cell subsets across studies, thus enabling meta-analysis. Applying MetaCyto across a set of ten heterogeneous cytometry studies totaling 2,926 samples enabled us to identify multiple cell populations exhibiting differences in abundance between demographic groups
Study protocol for the multicentre cohorts of Zika virus infection in pregnant women, infants, and acute clinical cases in Latin America and the Caribbean: The ZIKAlliance consortium
Background: The European Commission (EC) Horizon 2020 (H2020)-funded ZIKAlliance Consortium designed a multicentre study including pregnant women (PW), children (CH) and natural history (NH) cohorts. Clinical sites were selected over a wide geographic range within Latin America and the Caribbean, taking into account the dynamic course of the ZIKV epidemic. Methods: Recruitment to the PW cohort will take place in antenatal care clinics. PW will be enrolled regardless of symptoms and followed over the course of pregnancy, approximately every 4 weeks. PW will be revisited at delivery (or after miscarriage/abortion) to assess birth outcomes, including microcephaly and other congenital abnormalities according to the evolving definition of congenital Zika syndrome (CZS). After birth, children will be followed for 2 years in the CH cohort. Follow-up visits are scheduled at ages 1-3, 4-6, 12, and 24 months to assess neurocognitive and developmental milestones. In addition, a NH cohort for the characterization of symptomatic rash/fever illness was designed, including follow-up to capture persisting health problems. Blood, urine, and other biological materials will be collected, and tested for ZIKV and other relevant arboviral diseases (dengue, chikungunya, yellow fever) using RT-PCR or serological methods. A virtual, decentralized biobank will be created. Reciprocal clinical monitoring has been established between partner sites. Substudies of ZIKV seroprevalence, transmissio
Full-genome evolutionary histories of selfing, splitting, and selection in Caenorhabditis.
Simplification and Desexualization of Gene Expression in Self-Fertile Nematodes
SummaryEvolutionary transitions between sexual modes could be potent forces in genome evolution [1, 2]. Several Caenorhabditis nematode species have evolved self-fertile hermaphrodites from the obligately outcrossing females of their ancestors [3]. We explored the relationship between sexual mode and global gene expression by comparing two selfing species, C. elegans and C. briggsae, with three phylogenetically informative outcrossing relatives, C. remanei, C. brenneri, and C. japonica. Adult transcriptome assemblies from the selfing species are consistently and strikingly smaller than those of the outcrossing species. Against this background of overall simplification, genes conserved in multiple outcrossing species with strong sex-biased expression are even more likely to be missing from the genomes of the selfing species. In addition, the sexual regulation of remaining transcripts has diverged markedly from the ancestral pattern in both selfing lineages, though in distinct ways. Thus, both the complexity and the sexual specialization of transciptomes are rapidly altered in response to the evolution of self-fertility. These changes may result from the combination of relaxed sexual selection and a recently reported genetic mechanism favoring genome shrinkage [4] in partial selfers
Reproductive Mode and the Evolution of Genome Size and Structure in Caenorhabditis Nematodes.
The self-fertile nematode worms Caenorhabditis elegans, C. briggsae, and C. tropicalis evolved independently from outcrossing male-female ancestors and have genomes 20-40% smaller than closely related outcrossing relatives. This pattern of smaller genomes for selfing species and larger genomes for closely related outcrossing species is also seen in plants. We use comparative genomics, including the first high quality genome assembly for an outcrossing member of the genus (C. remanei) to test several hypotheses for the evolution of genome reduction under a change in mating system. Unlike plants, it does not appear that reductions in the number of repetitive elements, such as transposable elements, are an important contributor to the change in genome size. Instead, all functional genomic categories are lost in approximately equal proportions. Theory predicts that self-fertilization should equalize the effective population size, as well as the resulting effects of genetic drift, between the X chromosome and autosomes. Contrary to this, we find that the self-fertile C. briggsae and C. elegans have larger intergenic spaces and larger protein-coding genes on the X chromosome when compared to autosomes, while C. remanei actually has smaller introns on the X chromosome than either self-reproducing species. Rather than being driven by mutational biases and/or genetic drift caused by a reduction in effective population size under self reproduction, changes in genome size in this group of nematodes appear to be caused by genome-wide patterns of gene loss, most likely generated by genomic adaptation to self reproduction per se
Full-genome evolutionary histories of selfing, splitting, and selection in Caenorhabditis
Recommended from our members
Rapid genome shrinkage in a self-fertile nematode reveals sperm competition proteins
To reveal impacts of sexual mode on genome content, we compared chromosome-scale assemblies of the outcrossing nematode Caenorhabditis nigoni to its self-fertile sibling species, C. briggsaeC. nigoni's genome resembles that of outcrossing relatives but encodes 31% more protein-coding genes than C. briggsaeC. nigoni genes lacking C. briggsae orthologs were disproportionately small and male-biased in expression. These include the male secreted short (mss) gene family, which encodes sperm surface glycoproteins conserved only in outcrossing species. Sperm from mss-null males of outcrossing C. remanei failed to compete with wild-type sperm, despite normal fertility in noncompetitive mating. Restoring mss to C. briggsae males was sufficient to enhance sperm competitiveness. Thus, sex has a pervasive influence on genome content that can be used to identify sperm competition factors