11 research outputs found

    Epistatic and Combinatorial Effects of Pigmentary Gene Mutations in the Domestic Pigeon

    Get PDF
    SummaryUnderstanding the molecular basis of phenotypic diversity is a critical challenge in biology, yet we know little about the mechanistic effects of different mutations and epistatic relationships among loci that contribute to complex traits. Pigmentation genetics offers a powerful model for identifying mutations underlying diversity and for determining how additional complexity emerges from interactions among loci. Centuries of artificial selection in domestic rock pigeons (Columba livia) have cultivated tremendous variation in plumage pigmentation through the combined effects of dozens of loci. The dominance and epistatic hierarchies of key loci governing this diversity are known through classical genetic studies [1–6], but their molecular identities and the mechanisms of their genetic interactions remain unknown. Here we identify protein-coding and cis-regulatory mutations in Tyrp1, Sox10, and Slc45a2 that underlie classical color phenotypes of pigeons and present a mechanistic explanation of their dominance and epistatic relationships. We also find unanticipated allelic heterogeneity at Tyrp1 and Sox10, indicating that color variants evolved repeatedly though mutations in the same genes. These results demonstrate how a spectrum of coding and regulatory mutations in a small number of genes can interact to generate substantial phenotypic diversity in a classic Darwinian model of evolution [7]

    A DNA Replication Mechanism Can Explain Structural Variation at the Pigeon Recessive Red Locus

    No full text
    For species to adapt to their environment, evolution must act upon genetic variation that is present in the population. Elucidating the molecular mechanisms that give rise to this variation is thus of crucial importance for understanding how organisms evolve. In addition to variation caused by point mutations, structural variation (deletions, duplications, inversions, translocations) is also an important source of variety. Mechanisms involving recombination, transposition and retrotransposition, and replication have been proposed for generating structural variation, and each are capable of explaining certain rearrangements. In this study, we conduct a detailed analysis of two partially overlapping rearrangements (e1 and e2 allele) in domestic rock pigeon (Columba livia) which are both associated with the recessive red phenotype. We find that a replicative mechanism is best able to explain the complex architecture of the e1 allele, and is also compatible with the simpler architecture of the e2 allele as well

    Wham: Identifying Structural Variants of Biological Consequence

    No full text
    <div><p>Existing methods for identifying structural variants (SVs) from short read datasets are inaccurate. This complicates disease-gene identification and efforts to understand the consequences of genetic variation. In response, we have created Wham (Whole-genome Alignment Metrics) to provide a single, integrated framework for both structural variant calling and association testing, thereby bypassing many of the difficulties that currently frustrate attempts to employ SVs in association testing. Here we describe Wham, benchmark it against three other widely used SV identification tools–Lumpy, Delly and SoftSearch–and demonstrate Wham’s ability to identify and associate SVs with phenotypes using data from humans, domestic pigeons, and vaccinia virus. Wham and all associated software are covered under the MIT License and can be freely downloaded from github (<a href="https://github.com/zeeev/wham" target="_blank">https://github.com/zeeev/wham</a>), with documentation on a wiki (<a href="http://zeeev.github.io/wham/" target="_blank">http://zeeev.github.io/wham/</a>). For community support please post questions to <a href="https://www.biostars.org/" target="_blank">https://www.biostars.org/</a>.</p></div

    Genotyping assays.

    No full text
    <p><b>A)</b> Comparison of Genome STRiP (GS) genotypes vs. Delly, Lumpy and Wham. The x-axis lists the GS genotype. Different colors denote the zygosity of the Delly, Lumpy, and Wham genotypes. <b>B)</b> The fraction of Chromosome 1 deletions for the NA12878, NA1277, and NA12882 trio that conform to Mendelian inheritance patterns. <b>C)</b> The CEPH/Utah Pedigree 1463 allele frequency (AF) spectrum represented as an empirical cumulative distribution function (ECDF). This curve is derived from Chromosome 1 deletions. FB, Freebayes; UG, Unified Genotyper [<a href="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1004572#pcbi.1004572.ref049" target="_blank">49</a>].</p

    Wham detects structural variation in vaccinia virus populations.

    No full text
    <p><b>A)</b> Read depth normalized within each sample is plotted across the ~200 kb vaccinia genome (excluding inverted terminal repeats) for either the parental strain (top panel) or an adapted strain (middle and bottom panels, called by Wham or Lumpy, respectively). Arrows highlight the positions of K3L CNV and E3L deletion. The black lines represent the breakpoints of every SV call after filtering (see <a href="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1004572#sec016" target="_blank">Supporting Information</a>). <b>B)</b> Wham calls in the adapted strain near the K3L duplication breakpoint are shown as black triangles above the viral genes in colored boxes. The height of the triangle represents split-read (SR) count supporting the call. Sanger sequencing positions relative to the reference sequence are listed below. Asterisks (*) indicate Wham calls that match the exact breakpoint determined by Sanger sequencing (see <a href="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1004572#pcbi.1004572.s004" target="_blank">S3 Table</a> for Wham and Lumpy breakpoints). <b>C)</b> Wham calls in the adapted strain near the E3L deletion are shown above the genes, and Sanger sequence confirmed positions below, as in B. The arrow indicates the position of the 11K promoter driving β-gal expression. For breakpoints in grey, the height of the triangle indicates the relative mate-pair count from Wham, as these positions do not have SR support.</p

    Sensitivity and false discovery rates (FDR) for simulated data.

    No full text
    <p>The sensitivity and FDR of Delly, Lumpy, SoftSearch and Wham for simulated deletions, duplications, insertions and inversions. The sensitivity is measured for each category at depths of 10x and 50x. SVs ranging from 50 bp to 1 Mb are grouped into four left-closed size intervals. <b>A)</b> The sensitivity of the three tools is faceted on size, depth and SV type. At 10x Wham has noticeably better sensitivity for deletions and duplications in the smallest size class. Wham’s sensitivity is higher than Delly and Lumpy for insertions at 10x and gains sensitivity at 50x. <b>B)</b> The FDR for each type of SV faceted by depth and the amount of slop added to each confidence interval. In the 25 bp slop category, each confidence interval was extended in both directions by 25 bp. At 10x depth Wham has the highest FDR across all SV classes and Lumpy has the lowest. At 50x Delly has heightened FDR for deletions and Lumpy has a much higher FDR for insertions. Shrinking the confidence intervals increases the FDR for Delly and Lumpy, but not Wham. <b>C)</b> Breakpoint sensitivity for deletions. The confidence intervals, provided by the three tools are ignored and slop is incrementally added to the predicted breakpoints. Wham has the highest sensitivity when 1–10 bp of slop is added. <b>D)</b> Genotype sensitivity for the homozygous non-reference simulated SVs. Delly and Wham have similar sensitivity for deletions and duplications while both tools fail to correctly genotype duplications.</p

    Benchmarking Delly, Lumpy, SoftSearch and Wham against NA12878 and CHM1 datasets.

    No full text
    <p><b>A)</b> The sensitivity and FDR for filtered NA12878 Phase III deletion calls across four size intervals. The number of true positives and the number NA12878 calls are listed above sensitivity, while the total number of false positives and total calls for each tool is listed above FDR. Most true positives and false positives are within the 150–1,000 bp interval. <b>B)</b> The sensitivity and FDR for CHM1 deletions. <b>C)</b> The size distribution of the true positive calls that overlap the CHM1 deletions. One thousand true positives were randomly sampled from each tool and the truth set (CHM1-DEL).</p

    Identification of the <i>e1</i> allele using Wham’s LRT.

    No full text
    <p><b>A)</b> Wham’s LRT interrogates allele frequency differences between recessive red and wild type birds. Genomic scaffolds are denoted by different colors and are sorted by size in increasing order. The highest LRT score (dashed vertical line) is a 7.5-kb deletion upstream of the <i>Sox10</i> gene, which encodes a transcription factor that is critical to the melanin synthesis pathway. Only LRT values above 1.5 are shown in A. <b>B)</b> The quantile-quantile plot after converting Wham’s likelihood ratio values to p-values. <b>C)</b> Scaffold974 association tests from SNPs, Delly SV calls and WHAM SV calls.</p
    corecore