5 research outputs found
Genome-wide SNPs and re-sequencing of growth habit and inflorescence genes in barley: implications for association mapping in germplasm arrays varying in size and structure
<p>Abstract</p> <p>Background</p> <p>Considerations in applying association mapping (AM) to plant breeding are population structure and size: not accounting for structure and/or using small populations can lead to elevated false-positive rates. The principal determinants of population structure in cultivated barley are growth habit and inflorescence type. Both are under complex genetic control: growth habit is controlled by the epistatic interactions of several genes. For inflorescence type, multiple loss-of-function alleles in one gene lead to the same phenotype. We used these two traits as models for assessing the effectiveness of AM. This research was initiated using the CAP Core germplasm array (n = 102) assembled at the start of the Barley Coordinated Agricultural Project (CAP). This array was genotyped with 4,608 SNPs and we re-sequenced genes involved in morphology, growth and development. Larger arrays of breeding germplasm were subsequently genotyped and phenotyped under the auspices of the CAP project. This provided sets of 247 accessions phenotyped for growth habit and 2,473 accessions phenotyped for inflorescence type. Each of the larger populations was genotyped with 3,072 SNPs derived from the original set of 4,608.</p> <p>Results</p> <p>Significant associations with SNPs located in the vicinity of the loci involved in growth habit and inflorescence type were found in the CAP Core. Differentiation of true and spurious associations was not possible without <it>a priori </it>knowledge of the candidate genes, based on re-sequencing. The re-sequencing data were used to define allele types of the determinant genes based on functional polymorphisms. In a second round of association mapping, these synthetic markers based on allele types gave the most significant associations. When the synthetic markers were used as anchor points for analysis of interactions, we detected other known-function genes and candidate loci involved in the control of growth habit and inflorescence type. We then conducted association analyses - with SNP data only - in the larger germplasm arrays. For both vernalization sensitivity and inflorescence type, the most significant associations in the larger data sets were found with SNPs coincident with the synthetic markers used in the CAP Core and with SNPs detected via interaction analysis in the CAP Core.</p> <p>Conclusions</p> <p>Small and highly structured collections of germplasm, such as the CAP Core, are cost-effectively phenotyped and genotyped with high-throughput markers. They are also useful for characterizing allelic diversity at loci in germplasm of interest. Our results suggest that discovery-oriented exercises in AM in such small arrays may generate a large number of false-positives. However, if haplotypes in candidate genes are available, they may be used as anchors in an analysis of interactions to identify other candidate regions harboring genes determining target traits. Using larger germplasm arrays, genome regions where the principal genes determining vernalization sensitivity and row type are located were identified.</p
Validation of in silico-predicted genic SNPs in white clover (Trifolium repens L.), an outbreeding allopolyploid species
White clover (Trifolium repens L.) is an obligate outbreeding allotetraploid forage legume. Gene-associated SNPs provide the optimum genetic system for improvement of such crop species. An EST resource obtained from multiple cDNA libraries constructed from numerous genotypes of a single cultivar has been used for in silico SNP discovery and validation. A total of 58 from 236 selected sequence clusters (24.5%) were fully validated as containing polymorphic SNPs by genotypic analysis across the parents and progeny of several two-way pseudo-testcross mapping families. The clusters include genes belonging to a broad range of predicted functional categories. Polymorphic SNP-containing ESTs have also been used for comparative genomic analysis by comparison with whole genome data from model legume species, as well as Arabidopsis thaliana. A total of 29 (50%) of the 58 clusters detected putative ortholoci with known chromosomal locations in Medicago truncatula, which is closely related to white clover within the Trifolieae tribe of the Fabaceae. This analysis provides access to translational data from model species. The efficiency of in silico SNP discovery in white clover is limited by paralogous and homoeologous gene duplication effects, which are resolved unambiguously by the transmission test. This approach will also be applicable to other agronomically important cross-pollinating allopolyploid plant species