11 research outputs found

    allele_counts_all_samples

    No full text
    This contains allele counts for known ploidy and presumed diploid samples genotyped by GT-seq

    Estimating microhaplotype allele frequencies from low-coverage or pooled sequencing data

    No full text
    Abstract Background Microhaplotypes have the potential to be more cost-effective than SNPs for applications that require genetic panels of highly variable loci. However, development of microhaplotype panels is hindered by a lack of methods for estimating microhaplotype allele frequency from low-coverage whole genome sequencing or pooled sequencing (pool-seq) data. Results We developed new methods for estimating microhaplotype allele frequency from low-coverage whole genome sequence and pool-seq data. We validated these methods using datasets from three non-model organisms. These methods allowed estimation of allele frequency and expected heterozygosity at depths routinely achieved from pooled sequencing. Conclusions These new methods will allow microhaplotype panels to be designed using low-coverage WGS and pool-seq data to discover and evaluate candidate loci. The python script implementing the two methods and documentation are available at https://www.github.com/delomast/mhFromLowDepSeq

    Data from Zebrafish embryonic development is induced by carp sperm

    No full text
    Data collected during the experiments describe

    Efficient population representation with more genetic markers increases performance of a steelhead (Oncorhynchus mykiss) genetic stock identification baseline

    No full text
    Abstract Genetic stock identification (GSI) is an important fisheries management tool to identify the origin of fish harvested in mixed stock fisheries. Periodic updates of genetic baselines can improve performance via the addition of unsampled or under‐sampled populations and the inclusion of more informative markers. We used a combination of baselines to evaluate how population representation, marker number, and marker type affected the performance and accuracy of genetic stock assignments (self‐assignment, bias, and holdout group tests) for steelhead (Oncorhynchus mykiss) in the Snake River basin. First, we compared the performance of an existing genetic baseline with a newly developed one which had a reduced number of individuals from more populations using the same set of markers. Self‐assignment rates were significantly higher (p < 0.001; +5.4%) for the older, larger baseline, bias did not differ significantly between the two, but there was a significant improvement in performance for the new baseline in holdout results (p < 0.001; mean increase of 25.0%). Second, we compared the performance of the new baseline with increased numbers of genetic markers (~2x increase of single‐nucleotide polymorphisms; SNPs) for the same set of baseline individuals. In this comparison, results produced significantly higher rates of self‐assignment (p < 0.001; +9.7%) but neither bias nor leave‐one‐out were significantly affected. Third, we compared 334 SNPs versus opportunistically discovered microhaplotypes from the same amplicons for the new baseline, and showed the latter produced significantly higher rates of self‐assignment (p < 0.01; +2.6%), similar bias, but slightly lower holdout performance (−0.1%). Combined, we show the performance of genetic baselines can be improved via representative and efficient sampling, that increased marker number consistently improved performance over the original baseline, and that opportunistic discovery of microhaplotypes can lead to small improvements in GSI performance

    Parentage-based tagging improves escapement estimates for ESA-listed adult Chinook Salmon and Steelhead in the Snake River basin

    No full text
    Parentage-based tagging (PBT) is a non-lethal, genetic tagging method that has been successfully applied in hatchery supplemented populations to manage hatchery broodstock and monitor hatchery harvest and straying rates. We show that PBT can also improve the accuracy of escapement estimates by significantly reducing the number of hatchery-origin fish falsely classified as natural-origin. Unlike conventional abundance estimates, which use physical marks and tags to distinguish hatchery individuals from their wild counterparts, PBT identifies origin independent of physical form. We applied PBT to populations of Chinook Salmon (Oncorhynchus tshawytscha) and Steelhead (O. mykiss) which are classified as threatened under the Endangered Species Act and subject to extensive hatchery supplementation efforts. For spawn years 2014-2018, 16,511 adipose-intact Chinook Salmon and 21,953 adipose-intact Steelhead were sampled, and PBT identified 19.6% of returning Chinook Salmon and 8.3% of Steelhead were of hatchery-origin, despite having no physical or mechanical marks. The 90% confidence intervals for escapement estimates of natural-origin Chinook Salmon and Steelhead made with and without corrections using PBT were non-overlapping for nine of ten comparisons indicating that failing to account for unmarked, untagged hatchery-origin fish would result in a significant overestimation of natural abundance.The accepted manuscript in pdf format is listed with the files at the bottom of this page. The presentation of the authors' names and (or) special characters in the title of the manuscript may differ slightly between what is listed on this page and what is listed in the pdf file of the accepted manuscript; that in the pdf file of the accepted manuscript is what was submitted by the author
    corecore