24 research outputs found

    MetaGS: an accurate method to impute and combine SNP effects across populations using summary statistics

    Get PDF
    Background Meta-analysis describes a category of statistical methods that aim at combining the results of multiple studies to increase statistical power by exploiting summary statistics. Different industries that use genomic prediction do not share their raw data due to logistic or privacy restrictions, which can limit the size of their reference populations and creates a need for a practical meta-analysis method. Results We developed a meta-analysis, named MetaGS, that duplicates the results of multi-trait best linear unbiased prediction (mBLUP) analysis without accessing raw data. MetaGS exploits the correlations among different populations to produce more accurate population-specific single nucleotide polymorphism (SNP) effects. The method improves SNP effect estimations for a given population depending on its relations to other populations. MetaGS was tested on milk, fat and protein yield data of Australian Holstein and Jersey cattle and it generated very similar genomic estimated breeding values to those produced using the mBLUP method for all traits in both breeds. One of the major difficulties when combining SNP effects across populations is the use of different variants for the populations, which limits the applications of meta-analysis in practice. We solved this issue by developing a method to impute missing summary statistics without using raw data. Our results showed that imputing summary statistics can be done with high accuracy (r > 0.9) even when more than 70% of the SNPs were missing with a minimal effect on prediction accuracy. Conclusions We demonstrated that MetaGS can replace the mBLUP model when raw data cannot be shared, which can lead to more flexible collaborations compared to the single-trait BLUP model

    Genetic Dissection of Heat Stress Tolerance in Faba Bean (Vicia faba L.) Using GWAS

    Get PDF
    Heat waves are expected to become more frequent and intense, which will impact faba bean cultivation globally. Conventional breeding methods are effective but take considerable time to achieve breeding goals, and, therefore, the identification of molecular markers associated with key genes controlling heat tolerance can facilitate and accelerate efficient variety development. We phenotyped 134 accessions in six open field experiments during summer seasons at Terbol, Lebanon, at Hudeiba, Sudan, and at Central Ferry, WA, USA from 2015 to 2018. These accessions were genotyped using genotyping by sequencing (GBS), and 10,794 high quality single nucleotide polymorphisms (SNPs) were discovered. These accessions were clustered in one diverse large group, although several discrete groups may exist surrounding it. Fifteen lines belonging to different botanical groups were identified as tolerant to heat. SNPs associated with heat tolerance using single-trait (ST) and multi-trait (MT) genome-wide association studies (GWASs) showed 9 and 11 significant associations, respectively. Through the annotation of the discovered significant SNPs, we found that SNPs from transcription factor helix–loop–helix bHLH143-like S-adenosylmethionine carrier, putative pentatricopeptide repeat-containing protein At5g08310, protein NLP8-like, and photosystem II reaction center PSB28 proteins are associated with heat tolerance

    Genome-wide association study of root-lesion nematodes Pratylenchus species and crown rot Fusarium culmorum in bread wheat

    Get PDF
    Triticum aestivum L., also known as common wheat, is affected by many biotic stresses. Root diseases are the most difficult to tackle due to the complexity of phenotypic evaluation and the lack of resistant sources compared to other biotic stress factors. Soil-borne pathogens such as the root-lesion nematodes caused by the Pratylenchus species and crown rot caused by various Fusarium species are major wheat root diseases, causing substantial yield losses globally. A set of 189 advanced spring bread wheat lines obtained from the International Maize and Wheat Improvement Center (CIMMYT) were genotyped with 4056 single nucleotide polymorphisms (SNP) markers and screened for root-lesion nematodes and crown rot resistance. Population structure revealed that the genotypes could be divided into five subpopulations. Genome-Wide Association Studies were carried out for both resistances to Pratylenchus and Fusarium species. Based on our results, 11 different SNPs on chromosomes 1A, 1B, 2A, 3A, 4A, 5B, and 5D were significantly associated with root-lesion nematode resistance. Seven markers demonstrated association with P. neglectus, while the remaining four were linked to P. thornei resistance. In the case of crown rot, eight different markers on chromosomes 1A, 2B, 3A, 4B, 5B, and 7D were associated with Fusarium crown rot resistance. Identification and screening of root diseases is a challenging task; therefore, the newly identified resistant sources/genotypes could be exploited by breeders to be incorporated in breeding programs. The use of the identified markers in marker-assisted selection could enhance the selection process and cultivar development with root-lesion nematode and crown rot resistance

    Genomic regions associated with herbicide tolerance in a worldwide faba bean (Vicia faba L.) collection

    Get PDF
    Weeds represent one of the major constraints for faba bean crop. The identification of molecular markers associated with key genes imparting tolerance to herbicides can facilitate and fasten the efficient and effective development of herbicide tolerant cultivars. We phenotyped 140 faba bean genotypes in three open field experiments at two locations in Lebanon and Morocco against three herbicide treatments (T1 metribuzin 250 g ai/ha; T2 imazethapyr 75 g ai/ha; T3 untreated) and one in greenhouse where T1 and T3 were applied. The same set was genotyped using genotyping by sequencing (GBS) which yield 10,794 high quality single nucleotide polymorphisms (SNPs). ADMIXTURE software was used to infer the population structure which revealed two ancestral subpopulations. To identify SNPs associated with phenological and yield related traits under herbicide treatments, Single-trait (ST) and Multi-trait (MT) Genome Wide Association Studies (GWAS) were fitted using GEMMA software, showing 10 and 14 highly significant associations, respectively. Genomic sequences containing herbicide tolerance associated SNPs were aligned against the NCBI database using BLASTX tool using default parameters to annotate candidate genes underlying the causal variants. SNPs from acidic endochitinase, LRR receptor-like serine/threonine-protein kinase RCH1, probable serine/threonine-protein kinase NAK, malate dehydrogenase, photosystem I core protein PsaA and MYB-related protein P-like were significantly associated with herbicide tolerance traits

    Optimization of sequence alignment for simple sequence repeat regions

    Get PDF
    Abstract Background Microsatellites, or simple sequence repeats (SSRs), are tandemly repeated DNA sequences, including tandem copies of specific sequences no longer than six bases, that are distributed in the genome. SSR has been used as a molecular marker because it is easy to detect and is used in a range of applications, including genetic diversity, genome mapping, and marker assisted selection. It is also very mutable because of slipping in the DNA polymerase during DNA replication. This unique mutation increases the insertion/deletion (INDELs) mutation frequency to a high ratio - more than other types of molecular markers such as single nucleotide polymorphism (SNPs). SNPs are more frequent than INDELs. Therefore, all designed algorithms for sequence alignment fit the vast majority of the genomic sequence without considering microsatellite regions, as unique sequences that require special consideration. The old algorithm is limited in its application because there are many overlaps between different repeat units which result in false evolutionary relationships. Findings To overcome the limitation of the aligning algorithm when dealing with SSR loci, a new algorithm was developed using PERL script with a Tk graphical interface. This program is based on aligning sequences after determining the repeated units first, and the last SSR nucleotides positions. This results in a shifting process according to the inserted repeated unit type. When studying the phylogenic relations before and after applying the new algorithm, many differences in the trees were obtained by increasing the SSR length and complexity. However, less distance between different linage had been observed after applying the new algorithm. Conclusions The new algorithm produces better estimates for aligning SSR loci because it reflects more reliable evolutionary relations between different linages. It reduces overlapping during SSR alignment, which results in a more realistic phylogenic relationship.</p

    Decomposing Additive Genetic Variance Revealed Novel Insights into Trait Evolution in Synthetic Hexaploid Wheat

    No full text
    Whole genome duplication (WGD) is an evolutionary phenomenon, which causes significant changes to genomic structure and trait architecture. In recent years, a number of studies decomposed the additive genetic variance explained by different sets of variants. However, they investigated diploid populations only and none of the studies examined any polyploid organism. In this research, we extended the application of this approach to polyploids, to differentiate the additive variance explained by the three subgenomes and seven sets of homoeologous chromosomes in synthetic allohexaploid wheat (SHW) to gain a better understanding of trait evolution after WGD. Our SHW population was generated by crossing improved durum parents (Triticum turgidum; 2n = 4x = 28, AABB subgenomes) with the progenitor species Aegilops tauschii (syn Ae. squarrosa, T. tauschii; 2n = 2x = 14, DD subgenome). The population was phenotyped for 10 fungal/nematode resistance traits as well as two abiotic stresses. We showed that the wild D subgenome dominated the additive effect and this dominance affected the A more than the B subgenome. We provide evidence that this dominance was not inflated by population structure, relatedness among individuals or by longer linkage disequilibrium blocks observed in the D subgenome within the population used for this study. The cumulative size of the three homoeologs of the seven chromosomal groups showed a weak but significant positive correlation with their cumulative explained additive variance. Furthermore, an average of 69% for each chromosomal group's cumulative additive variance came from one homoeolog that had the highest explained variance within the group across all 12 traits. We hypothesize that structural and functional changes during diploidization may explain chromosomal group relations as allopolyploids keep balanced dosage for many genes. Our results contribute to a better understanding of trait evolution mechanisms in polyploidy, which will facilitate the effective utilization of wheat wild relatives in breeding
    corecore