49 research outputs found

    Secondary Evolve and Resequencing : An Experimental Confirmation of Putative Selection Targets without Phenotyping

    Get PDF
    Evolve and resequencing (E&R) studies investigate the genomic responses of adaptation during experimental evolution. Because replicate populations evolve in the same controlled environment, consistent responses to selection across replicates are frequently used to identify reliable candidate regions that underlie adaptation to a new environment. However, recent work demonstrated that selection signatures can be restricted to one or a few replicate(s) only. These selection signatures frequently have weak statistical support, and given the difficulties of functional validation, additional evidence is needed before considering them as candidates for functional analysis. Here, we introduce an experimental procedure to validate candidate loci with weak or replicate-specific selection signature(s). Crossing an evolved population from a primary E&R experiment to the ancestral founder population reduces the frequency of candidate alleles that have reached a high frequency. We hypothesize that genuine selection targets will experience a repeatable frequency increase after the mixing with the ancestral founders if they are exposed to the same environment (secondary E&R experiment). Using this approach, we successfully validate two overlapping selection targets, which showed a mutually exclusive selection signature in a primary E&R experiment of Drosophila simulans adapting to a novel temperature regime. We conclude that secondary E&R experiments provide a reliable confirmation of selection signatures that either are not replicated or show only a low statistical significance in a primary E&R experiment unless epistatic interactions predominate. Such experiments are particularly helpful to prioritize candidate loci for time-consuming functional follow-up investigations.Peer reviewe

    Combining evidence of selection with association analysis increases power to detect regions influencing complex traits in dairy cattle

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Hitchhiking mapping and association studies are two popular approaches to map genotypes to phenotypes. In this study we combine both approaches to complement their specific strengths and weaknesses, resulting in a method with higher statistical power and fewer false positive signals. We applied our approach to dairy cattle as they underwent extremely successful selection for milk production traits and since an excellent phenotypic record is available. We performed whole genome association tests with a new mixed model approach to account for stratification, which we validated via Monte Carlo simulations. Selection signatures were inferred with the integrated haplotype score and a locus specific permutation based integrated haplotype score that works with a folded frequency spectrum and provides a formal test of signifance to identify selection signatures.</p> <p>Results</p> <p>About 1,600 out of 34,851 SNPs showed signatures of selection and the locus specific permutation based integrated haplotype score showed overall good accordance with the whole genome association study. Each approach provides distinct information about the genomic regions that influence complex traits. Combining whole genome association with hitchhiking mapping yielded two significant loci for the trait protein yield. These regions agree well with previous results from other selection signature scans and whole genome association studies in cattle.</p> <p>Conclusion</p> <p>We show that the combination of whole genome association and selection signature mapping based on the same SNPs increases the power to detect loci influencing complex traits. The locus specific permutation based integrated haplotype score provides a formal test of significance in selection signature mapping. Importantly it does not rely on knowledge of ancestral and derived allele states.</p

    Genome-wide association studies using copy number variants in Brown Swiss Dairy cattle.

    Get PDF
    Detecting Copy Number Variation (CNV) in cattle provides the opportunity to study their association with quantitative traits (Winchester et al., 2009; Zhang et al., 2009; Hou et al., 2011; Clop et al., 2012; de Almeida et al., 2016;). The aim of this study was to map CNVs in 1,410 Brown Swiss males and females using Illumina BovineHD Genotyping BeadChip data and to perform a genome-wide association analysis for production functional and health traits. After quality control, CNVs were called with the GoldenHelix SVS 8.3.1 and PennCNV software and were summarized to CNV regions (CNVRs) at a population level, using BEDTools. Additionally, common CNVRs between the two software were set as consensus. CNV-association studies were executed with the CNVRuler software using a linear regression model. Genes within significant associated CNVRs for each trait were annotated with a GO analysis using the DAVID Bioinformatics Resources 6.7.The quality control filtered out 294 samples. The GoldenHelix SVS 8.3.1 software identified 25,030 CNVs summarized to 398 CNVRs while PennCNV identified 62,341 CNVs summarized to 5,578 CNVRs. A total of 127 CNVRs were identified to be significantly associated with one or more of the evaluated traits. The result of this study is a comprehensive genomic analysis of the Brown Swiss breed, which enriches the bovine CNV map in its genome. Finally, the results of the association studies deliver new information for quantitative traits considered in selection programs of the Brown Swiss breed

    A high-resolution CNV map across Brown Swiss cattle populations.

    Get PDF
    Genomic studies and their use in selection programs are having a strong impact in dairy cattle selection (E. Liu et al., 2010). The first aim was to create a high resolution map of CNV regions (CNVRs) in Brown Swiss cattle and the characterization of identified CNVs as markers for quantitative and population genetic studies. CNVs were called in a set of 164 sires with PennCNV and genoCN. PennCNV identified 2,377 CNVRs comprising 1,162 and 1,131 gain and loss events, respectively, and 84 regions of complex nature. GenoCN detected 41,519 CNVRs comprising 3,475 and 34,485 gain and loss events, respectively, and 3,559 regions of complex ones. Consensus calls between algorithms were summarized to CNVRs at the population level. GenoCN was also used to identify total allelic content in consensus CNVRs. Moreover, population haplotype frequencies were calculated. Linkage disequilibrium (LD) was established between CNVs and SNPs in and around CNVRs. In this study the potential contribution of CNVs as genetic markers for genome wide association studies (GWAS) has been assessed thanks to PIC and LD values. The next aim is to investigate genomic structural variation in cattle using dense SNP information in more than 1000 samples of the Italian and Swiss Brown Swiss breed genotyped on HD Bovine BeadChips. Today there is still no CNV map available across Brown Swiss populations belonging to different countries. This study therefore expands the catalogue of CNVRs in the bovine genome, delivers an international based high-resolution map of CNVRs specific to Brown Swiss dairy cattle and will lastly provide information for GEBV estimation with CNVs

    Comparison of variant calling methods for whole genome sequencing data in dairy cattle

    Get PDF
    Accurate identification of SNPs from next-generation sequencing data is crucial for high-quality downstream analysis. Whole genome sequence data of 65 key ancestors of genotyped Swiss dairy populations were available for investigation (24 billion reads, 96.8% mapped to UMD31, 12x coverage). Four publically available variant calling programmes were assessed and different levels of pre-calling handling for each method were tested and compared. SNP concordance was examined with Illumina’s BovineHD Genotyping BeadChip¼. Depending on variant calling software used, between 16,894,054 and 22,048,382 SNP were identified (multi-sample calling). A total of 14,644,310 SNP were identified by all four variant callers (multi-sample calling). InDel counts ranged from 1,997,791 to 2,857,754; 1,708,649 InDels were identified by all four variant callers. A minimum of pre-calling data handling resulted in the highest non-reference sensitivity and the lowest non-reference discrepancy rates

    The importance of identity-by-state information for the accuracy of genomic selection

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>It is commonly assumed that prediction of genome-wide breeding values in genomic selection is achieved by capitalizing on linkage disequilibrium between markers and QTL but also on genetic relationships. Here, we investigated the reliability of predicting genome-wide breeding values based on population-wide linkage disequilibrium information, based on identity-by-descent relationships within the known pedigree, and to what extent linkage disequilibrium information improves predictions based on identity-by-descent genomic relationship information.</p> <p>Methods</p> <p>The study was performed on milk, fat, and protein yield, using genotype data on 35 706 SNP and deregressed proofs of 1086 Italian Brown Swiss bulls. Genome-wide breeding values were predicted using a genomic identity-by-state relationship matrix and a genomic identity-by-descent relationship matrix (averaged over all marker loci). The identity-by-descent matrix was calculated by linkage analysis using one to five generations of pedigree data.</p> <p>Results</p> <p>We showed that genome-wide breeding values prediction based only on identity-by-descent genomic relationships within the known pedigree was as or more reliable than that based on identity-by-state, which implicitly also accounts for genomic relationships that occurred before the known pedigree. Furthermore, combining the two matrices did not improve the prediction compared to using identity-by-descent alone. Including different numbers of generations in the pedigree showed that most of the information in genome-wide breeding values prediction comes from animals with known common ancestors less than four generations back in the pedigree.</p> <p>Conclusions</p> <p>Our results show that, in pedigreed breeding populations, the accuracy of genome-wide breeding values obtained by identity-by-descent relationships was not improved by identity-by-state information. Although, in principle, genomic selection based on identity-by-state does not require pedigree data, it does use the available pedigree structure. Our findings may explain why the prediction equations derived for one breed may not predict accurate genome-wide breeding values when applied to other breeds, since family structures differ among breeds.</p

    Nonsense-Mediated Decay Enables Intron Gain in Drosophila

    Get PDF
    Intron number varies considerably among genomes, but despite their fundamental importance, the mutational mechanisms and evolutionary processes underlying the expansion of intron number remain unknown. Here we show that Drosophila, in contrast to most eukaryotic lineages, is still undergoing a dramatic rate of intron gain. These novel introns carry significantly weaker splice sites that may impede their identification by the spliceosome. Novel introns are more likely to encode a premature termination codon (PTC), indicating that nonsense-mediated decay (NMD) functions as a backup for weak splicing of new introns. Our data suggest that new introns originate when genomic insertions with weak splice sites are hidden from selection by NMD. This mechanism reduces the sequence requirement imposed on novel introns and implies that the capacity of the spliceosome to recognize weak splice sites was a prerequisite for intron gain during eukaryotic evolution

    Meta-analysis of genome-wide association studies for cattle stature identifies common genes that regulate body size in mammals

    Get PDF
    peer-reviewedH.D.D., A.J.C., P.J.B. and B.J.H. would like to acknowledge the Dairy Futures Cooperative Research Centre for funding. H.P. and R.F. acknowledge funding from the German Federal Ministry of Education and Research (BMBF) within the AgroClustEr ‘Synbreed—Synergistic Plant and Animal Breeding’ (grant 0315527B). H.P., R.F., R.E. and K.-U.G. acknowledge the Arbeitsgemeinschaft SĂŒddeutscher RinderzĂŒchter, the Arbeitsgemeinschaft Österreichischer FleckviehzĂŒchter and ZuchtData EDV Dienstleistungen for providing genotype data. A. Bagnato acknowledges the European Union (EU) Collaborative Project LowInputBreeds (grant agreement 222623) for providing Brown Swiss genotypes. Braunvieh Schweiz is acknowledged for providing Brown Swiss phenotypes. H.P. and R.F. acknowledge the German Holstein Association (DHV) and the ConfederaciĂłn de Asociaciones de Frisona Española (CONCAFE) for sharing genotype data. H.P. was financially supported by a postdoctoral fellowship from the Deutsche Forschungsgemeinschaft (DFG) (grant PA 2789/1-1). D.B. and D.C.P. acknowledge funding from the Research Stimulus Fund (11/S/112) and Science Foundation Ireland (14/IA/2576). M.S. and F.S.S. acknowledge the Canadian Dairy Network (CDN) for providing the Holstein genotypes. P.S. acknowledges funding from the Genome Canada project entitled ‘Whole Genome Selection through Genome Wide Imputation in Beef Cattle’ and acknowledges WestGrid and Compute/Calcul Canada for providing computing resources. J.F.T. was supported by the National Institute of Food and Agriculture, US Department of Agriculture, under awards 2013-68004-20364 and 2015-67015-23183. A. Bagnato, F.P., M.D. and J.W. acknowledge EU Collaborative Project Quantomics (grant 516 agreement 222664) for providing Brown Swiss and Finnish Ayrshire sequences and genotypes. A.C.B. and R.F.V. acknowledge funding from the public–private partnership ‘Breed4Food’ (code BO-22.04-011- 001-ASG-LR) and EU FP7 IRSES SEQSEL (grant 317697). A.C.B. and R.F.V. acknowledge CRV (Arnhem, the Netherlands) for providing data on Dutch and New Zealand Holstein and Jersey bulls.Stature is affected by many polymorphisms of small effect in humans1. In contrast, variation in dogs, even within breeds, has been suggested to be largely due to variants in a small number of genes2,3. Here we use data from cattle to compare the genetic architecture of stature to those in humans and dogs. We conducted a meta-analysis for stature using 58,265 cattle from 17 populations with 25.4 million imputed whole-genome sequence variants. Results showed that the genetic architecture of stature in cattle is similar to that in humans, as the lead variants in 163 significantly associated genomic regions (P < 5 × 10−8) explained at most 13.8% of the phenotypic variance. Most of these variants were noncoding, including variants that were also expression quantitative trait loci (eQTLs) and in ChIP–seq peaks. There was significant overlap in loci for stature with humans and dogs, suggesting that a set of common genes regulates body size in mammals
    corecore