89 research outputs found
In Vitro vs In Silico Detected SNPs for the Development of a Genotyping Array: What Can We Learn from a Non-Model Species?
Background: There is considerable interest in the high-throughput discovery and genotyping of single nucleotide polymorphisms (SNPs) to accelerate genetic mapping and enable association studies. This study provides an assessment of EST-derived and resequencing-derived SNP quality in maritime pine (Pinus pinaster Ait.), a conifer characterized by a huge genome size (~23.8 Gb/C). [br/]
Methodology/Principal Findings: A 384-SNPs GoldenGate genotyping array was built from i/ 184 SNPs originally detected in a set of 40 re-sequenced candidate genes (in vitro SNPs), chosen on the basis of functionality scores, presence of neighboring polymorphisms, minor allele frequencies and linkage disequilibrium and ii/ 200 SNPs screened from ESTs (in silico SNPs) selected based on the number of ESTs used for SNP detection, the SNP minor allele frequency and the quality of SNP flanking sequences. The global success rate of the assay was 66.9%, and a conversion rate (considering only polymorphic SNPs) of 51% was achieved. In vitro SNPs showed significantly higher genotyping-success and conversion rates than in silico SNPs (+11.5% and +18.5%, respectively). The reproducibility was 100%, and the genotyping error rate very low (0.54%, dropping down to 0.06% when removing four SNPs showing elevated error rates). [br/]
Conclusions/Significance: This study demonstrates that ESTs provide a resource for SNP identification in non-model species, which do not require any additional bench work and little bio-informatics analysis. However, the time and cost benefits of in silico SNPs are counterbalanced by a lower conversion rate than in vitro SNPs. This drawback is acceptable for population-based experiments, but could be dramatic in experiments involving samples from narrow genetic backgrounds. In addition, we showed that both the visual inspection of genotyping clusters and the estimation of a per SNP error rate should help identify markers that are not suitable to the GoldenGate technology in species characterized by a large and complex genome
Association mapping of spot blotch resistance in wild barley
Spot blotch, caused by Cochliobolus sativus, is an important foliar disease of barley. The disease has been controlled for over 40 years through the deployment of cultivars with durable resistance derived from the line NDB112. Pathotypes of C. sativus with virulence for the NDB112 resistance have been detected in Canada; thus, many commercial cultivars are vulnerable to spot blotch epidemics. To increase the diversity of spot blotch resistance in cultivated barley, we evaluated 318 diverse wild barley accessions comprising the Wild Barley Diversity Collection (WBDC) for reaction to C. sativus at the seedling stage and utilized an association mapping (AM) approach to identify and map resistance loci. A high frequency of resistance was found in the WBDC as 95% (302/318) of the accessions exhibited low infection responses. The WBDC was genotyped with 558 Diversity Array Technology (DArT®) and 2,878 single nucleotide polymorphism (SNP) markers and subjected to structure analysis before running the AM procedure. Thirteen QTL for spot blotch resistance were identified with DArT and SNP markers. These QTL were found on chromosomes 1H, 2H, 3H, 5H, and 7H and explained from 2.3 to 3.9% of the phenotypic variance. Nearly half of the identified QTL mapped to chromosome bins where spot blotch resistance loci were previously reported, offering some validation for the AM approach. The other QTL mapped to unique genomic regions and may represent new spot blotch resistance loci. This study demonstrates that AM is an effective technique for identifying and mapping QTL for disease resistance in a wild crop progenitor
A sequence-based genetic linkage map as a reference for Brassica rapa pseudochromosome assembly
<p>Abstract</p> <p>Background</p> <p><it>Brassica rapa </it>is an economically important crop and a model plant for studies concerning polyploidization and the evolution of extreme morphology. The multinational <it>B. rapa </it>Genome Sequencing Project (BrGSP) was launched in 2003. In 2008, next generation sequencing technology was used to sequence the <it>B. rapa </it>genome. Several maps concerning <it>B. rapa </it>pseudochromosome assembly have been published but their coverage of the genome is incomplete, anchoring approximately 73.6% of the scaffolds on to chromosomes. Therefore, a new genetic map to aid pseudochromosome assembly is required.</p> <p>Results</p> <p>This study concerns the construction of a reference genetic linkage map for <it>Brassica rapa</it>, forming the backbone for anchoring sequence scaffolds of the <it>B. rapa </it>genome resulting from recent sequencing efforts. One hundred and nineteen doubled haploid (DH) lines derived from microspore cultures of an F1 cross between a Chinese cabbage (<it>B. rapa </it>ssp. <it>pekinensis</it>) DH line (Z16) and a rapid cycling inbred line (L144) were used to construct the linkage map. PCR-based insertion/deletion (InDel) markers were developed by re-sequencing the two parental lines. The map comprises a total of 507 markers including 415 InDels and 92 SSRs. Alignment and orientation using SSR markers in common with existing <it>B. rapa </it>linkage maps allowed ten linkage groups to be identified, designated A01-A10. The total length of the linkage map was 1234.2 cM, with an average distance of 2.43 cM between adjacent marker loci. The lengths of linkage groups ranged from 71.5 cM to 188.5 cM for A08 and A09, respectively. Using the developed linkage map, 152 scaffolds were anchored on to the chromosomes, encompassing more than 82.9% of the <it>B. rapa </it>genome. Taken together with the previously available linkage maps, 183 scaffolds were anchored on to the chromosomes and the total coverage of the genome was 88.9%.</p> <p>Conclusions</p> <p>The development of this linkage map is vital for the integration of genome sequences and genetic information, and provides a useful resource for the international <it>Brassica </it>research community.</p
Genetic Diversity and Linkage Disequilibrium in Chinese Bread Wheat (Triticum aestivum L.) Revealed by SSR Markers
Two hundred and fifty bread wheat lines, mainly Chinese mini core accessions, were assayed for polymorphism and linkage disequilibrium (LD) based on 512 whole-genome microsatellite loci representing a mean marker density of 5.1 cM. A total of 6,724 alleles ranging from 1 to 49 per locus were identified in all collections. The mean PIC value was 0.650, ranging from 0 to 0.965. Population structure and principal coordinate analysis revealed that landraces and modern varieties were two relatively independent genetic sub-groups. Landraces had a higher allelic diversity than modern varieties with respect to both genomes and chromosomes in terms of total number of alleles and allelic richness. 3,833 (57.0%) and 2,788 (41.5%) rare alleles with frequencies of <5% were found in the landrace and modern variety gene pools, respectively, indicating greater numbers of rare variants, or likely new alleles, in landraces. Analysis of molecular variance (AMOVA) showed that A genome had the largest genetic differentiation and D genome the lowest. In contrast to genetic diversity, modern varieties displayed a wider average LD decay across the whole genome for locus pairs with r2>0.05 (P<0.001) than the landraces. Mean LD decay distance for the landraces at the whole genome level was <5 cM, while a higher LD decay distance of 5–10 cM in modern varieties. LD decay distances were also somewhat different for each of the 21 chromosomes, being higher for most of the chromosomes in modern varieties (<5∼25 cM) compared to landraces (<5∼15 cM), presumably indicating the influences of domestication and breeding. This study facilitates predicting the marker density required to effectively associate genotypes with traits in Chinese wheat genetic resources
Nucleotide diversity and molecular evolution of the WAG-2 gene in common wheat (Triticum aestivum L) and its relatives
In this work, we examined the genetic diversity and evolution of the WAG-2 gene based on new WAG-2 alleles isolated from wheat and its relatives. Only single nucleotide polymorphisms (SNP) and no insertions and deletions (indels) were found in exon sequences of WAG-2 from different species. More SNPs and indels occurred in introns than in exons. For exons, exons+introns and introns, the nucleotide polymorphism π decreased from diploid and tetraploid genotypes to hexaploid genotypes. This finding indicated that the diversity of WAG-2 in diploids was greater than in hexaploids because of the strong selection pressure on the latter. All dn/ds ratios were < 1.0, indicating that WAG-2 belongs to a conserved gene affected by negative selection. Thirty-nine of the 57 particular SNPs and eight of the 10 indels were detected in diploid species. The degree of divergence in intron length among WAG-2 clones and phylogenetic tree topology suggested the existence of three homoeologs in the A, B or D genome of common wheat. Wheat AG-like genes were divided into WAG-1 and WAG-2 clades. The latter clade contained WAG-2, OsMADS3 and ZMM2 genes, indicating functional homoeology among them
Species Discrimination, Population Structure and Linkage Disequilibrium in Eucalyptus camaldulensis and Eucalyptus tereticornis Using SSR Markers
Eucalyptus camaldulensis and E. tereticornis are closely related species commonly cultivated for pulp wood in many tropical countries including India. Understanding the genetic structure and linkage disequilibrium (LD) existing in these species is essential for the improvement of industrially important traits. Our goal was to evaluate the use of simple sequence repeat (SSR) loci for species discrimination, population structure and LD analysis in these species. Investigations were carried out with the most common alleles in 93 accessions belonging to these two species using 62 SSR markers through cross amplification. The polymorphic information content (PIC) ranged from 0.44 to 0.93 and 0.36 to 0.93 in E. camaldulensis and E. tereticornis respectively. A clear delineation between the two species was evident based on the analysis of population structure and species-specific alleles. Significant genotypic LD was found in E. camaldulensis, wherein out of 135 significant pairs, 17 pairs showed r2≥0.1. Similarly, in E. tereticornis, out of 136 significant pairs, 18 pairs showed r2≥0.1. The extent of LD decayed rapidly showing the significance of association analyses in eucalypts with higher resolution markers. The availability of whole genome sequence for E. grandis and the synteny and co-linearity in the genome of eucalypts, will allow genome-wide genotyping using microsatellites or single nucleotide polymorphims
Genetic diversity, linkage disequilibrium and power of a large grapevine (Vitis vinifera L) diversity panel newly designed for association studies
UMR-AGAP Equipe DAVV (Diversité, adaptation et amélioration de la vigne) ; équipe ID (Intégration de Données)International audienceAbstractBackgroundAs for many crops, new high-quality grapevine varieties requiring less pesticide and adapted to climate change are needed. In perennial species, breeding is a long process which can be speeded up by gaining knowledge about quantitative trait loci linked to agronomic traits variation. However, due to the long juvenile period of these species, establishing numerous highly recombinant populations for high resolution mapping is both costly and time-consuming. Genome wide association studies in germplasm panels is an alternative method of choice, since it allows identifying the main quantitative trait loci with high resolution by exploiting past recombination events between cultivars. Such studies require adequate panel design to represent most of the available genetic and phenotypic diversity. Assessing linkage disequilibrium extent and panel power is also needed to determine the marker density required for association studies.ResultsStarting from the largest grapevine collection worldwide maintained in Vassal (France), we designed a diversity panel of 279 cultivars with limited relatedness, reflecting the low structuration in three genetic pools resulting from different uses (table vs wine) and geographical origin (East vs West), and including the major founders of modern cultivars. With 20 simple sequence repeat markers and five quantitative traits, we showed that our panel adequately captured most of the genetic and phenotypic diversity existing within the entire Vassal collection. To assess linkage disequilibrium extent and panel power, we genotyped single nucleotide polymorphisms: 372 over four genomic regions and 129 distributed over the whole genome. Linkage disequilibrium, measured by correlation corrected for kinship, reached 0.2 for a physical distance between 9 and 458 Kb depending on genetic pool and genomic region, with varying size of linkage disequilibrium blocks. This panel achieved reasonable power to detect associations between traits with high broad-sense heritability (> 0.7) and causal loci with intermediate allelic frequency and strong effect (explaining > 10 % of total variance).ConclusionsOur association panel constitutes a new, highly valuable resource for genetic association studies in grapevine, and deserves dissemination to diverse field and greenhouse trials to gain more insight into the genetic control of many agronomic traits and their interaction with the environment
Genetic Variation of HvCBF Genes and Their Association with Salinity Tolerance in Tibetan Annual Wild Barley
The evaluation of both the genetic variation and the identification of salinity tolerant accessions of Tibetan annual wild barley (hereafter referred to as Tibetan barley) (Hordeum vulgare L. ssp. Spontaneum and H. vulgare L. ssp. agriocrithum) are essential for discovering and exploiting novel alleles involved in salinity tolerance. In this study, we examined tissue dry biomass and the Na+ and K+ contents of 188 Tibetan barley accessions in response to salt stress. We investigated the genetic variation of transcription factors HvCBF1, HvCBF3 and HvCBF4 within these accessions, conducting association analysis between these three genes and the respective genotypic salt tolerance. Salt stress significantly reduced shoot and root dry weight by 27.6% to 73.1% in the Tibetan barley lines. HvCBF1, HvCBF3 and HvCBF4 showed diverse sequence variation in amplicon as evident by the identification of single nucleotide polymorphisms (SNPs) and 3, 8 and 13 haplotypes, respectively. Furthermore, the decay of Linkage disequilibrium (LD) of chromosome 5 was 8.9 cM (r2<0.1). Marker bpb-4891 and haplotype 13 (Ps 610) of the HvCBF4 gene were significantly (P<0.05) and highly significantly (P<0.001) associated with salt tolerance. However, HvCBF1 and HvCBF3 genes were not associated with salinity tolerance. The accessions from haplotype 13 of the HvCBF4 gene showed high salinity tolerance, maintaining significantly lower Na+/K+ ratios and higher dry weight. It is thus proposed that these Tibetan barley accessions could be of value for enhancing salinity tolerance in cultivated barley
Genetic variants of HvCbf14 are statistically associated with frost tolerance in a European germplasm collection of Hordeum vulgare
Two quantitative trait loci (Fr-H1 and Fr-H2) for frost tolerance (FT) have been discovered on the long arm of chromosome 5H in barley. Two tightly linked groups of CBF genes, known to play a key role in the FT regulatory network in A. thaliana, have been found to co-segregate with Fr-H2. Here, we investigate the allelic variations of four barley CBF genes (HvCbf3, HvCbf6, HvCbf9 and HvCbf14) in a panel of European cultivars, landraces and H. spontaneum accessions. In the cultivars a reduction of nucleotide and haplotype diversities in CBFs compared with the landraces and the wild ancestor H. spontaneum, was evident. In particular, in cultivars the loss of HvCbf9 genetic variants was higher compared to other sequences. In order to verify if the pattern of CBF genetic variants correlated with the level of FT, an association procedure was adopted. The pairwise analysis of linkage disequilibrium (LD) among the genetic variants in four CBF genes was computed to evaluate the resolution of the association procedure. The pairwise plotting revealed a low level of LD in cultivated varieties, despite the tight physical linkage of CBF genes analysed. A structured association procedure based on a general liner model was implemented, including the variants in CBFs, of Vrn-H1, and of two reference genes not involved in FT (α-Amy1 and Gapdh) and considering the phenotypic data for FT. Association analysis recovered two nucleotide variants of HvCbf14 and one nucleotide variant of Vrn-H1 as statistically associated to FT
High-throughput SNP genotyping in the highly heterozygous genome of Eucalyptus: assay success, polymorphism and transferability across species
<p>Abstract</p> <p>Background</p> <p>High-throughput SNP genotyping has become an essential requirement for molecular breeding and population genomics studies in plant species. Large scale SNP developments have been reported for several mainstream crops. A growing interest now exists to expand the speed and resolution of genetic analysis to outbred species with highly heterozygous genomes. When nucleotide diversity is high, a refined diagnosis of the target SNP sequence context is needed to convert queried SNPs into high-quality genotypes using the Golden Gate Genotyping Technology (GGGT). This issue becomes exacerbated when attempting to transfer SNPs across species, a scarcely explored topic in plants, and likely to become significant for population genomics and inter specific breeding applications in less domesticated and less funded plant genera.</p> <p>Results</p> <p>We have successfully developed the first set of 768 SNPs assayed by the GGGT for the highly heterozygous genome of <it>Eucalyptus </it>from a mixed Sanger/454 database with 1,164,695 ESTs and the preliminary 4.5X draft genome sequence for <it>E. grandis</it>. A systematic assessment of <it>in silico </it>SNP filtering requirements showed that stringent constraints on the SNP surrounding sequences have a significant impact on SNP genotyping performance and polymorphism. SNP assay success was high for the 288 SNPs selected with more rigorous <it>in silico </it>constraints; 93% of them provided high quality genotype calls and 71% of them were polymorphic in a diverse panel of 96 individuals of five different species.</p> <p>SNP reliability was high across nine <it>Eucalyptus </it>species belonging to three sections within subgenus Symphomyrtus and still satisfactory across species of two additional subgenera, although polymorphism declined as phylogenetic distance increased.</p> <p>Conclusions</p> <p>This study indicates that the GGGT performs well both within and across species of <it>Eucalyptus </it>notwithstanding its nucleotide diversity ≥2%. The development of a much larger array of informative SNPs across multiple <it>Eucalyptus </it>species is feasible, although strongly dependent on having a representative and sufficiently deep collection of sequences from many individuals of each target species. A higher density SNP platform will be instrumental to undertake genome-wide phylogenetic and population genomics studies and to implement molecular breeding by Genomic Selection in <it>Eucalyptus</it>.</p
- …