Search CORE

Repositorio Institucional – Biblioteca Digital

A fast and cost-effective approach to develop and map EST-SSR markers: oak as a case study

Author: Alberto F.
Bodénès C.
Buonamici A.
Cabané C.
Chancerel E.
Cherubini M.
Daruvar A., de
Dumoulin P.Y.
Durand J.
Frigerio J.M.
Gailing O.
Goicoechea P.G.
Guichoux E.
Herrán A.
Ikaran Z.
Koelewijn H.P.
Kremer A.
Mattioni C.
Plomion C.
Sebastiani F.
Ueno S.
Vendramin G.
Villani F.
Publication venue
Publication date: 01/01/2010
Field of study

Background: Expressed Sequence Tags (ESTs) are a source of simple sequence repeats (SSRs) that can be used to develop molecular markers for genetic studies. The availability of ESTs for Quercus robur and Quercus petraea provided a unique opportunity to develop microsatellite markers to accelerate research aimed at studying adaptation of these long-lived species to their environment. As a first step toward the construction of a SSR-based linkage map of oak for quantitative trait locus (QTL) mapping, we describe the mining and survey of EST-SSRs as well as a fast and cost-effective approach (bin mapping) to assign these markers to an approximate map position. We also compared the level of polymorphism between genomic and EST-derived SSRs and address the transferability of EST-SSRs in Castanea sativa (chestnut). Results: A catalogue of 103,000 Sanger ESTs was assembled into 28,024 unigenes from which 18.6% presented one or more SSR motifs. More than 42% of these SSRs corresponded to trinucleotides. Primer pairs were designed for 748 putative unigenes. Overall 37.7% (283) were found to amplify a single polymorphic locus in a reference fullsib pedigree of Quercus robur. The usefulness of these loci for establishing a genetic map was assessed using a bin mapping approach. Bin maps were constructed for the male and female parental tree for which framework linkage maps based on AFLP markers were available. The bin set consisting of 14 highly informative offspring selected based on the number and position of crossover sites. The female and male maps comprised 44 and 37 bins, with an average bin length of 16.5 cM and 20.99 cM, respectively. A total of 256 EST-SSRs were assigned to bins and their map position was further validated by linkage mapping. EST-SSRs were found to be less polymorphic than genomic SSRs, but their transferability rate to chestnut, a phylogenetically related species to oak, was higher. Conclusion: We have generated a bin map for oak comprising 256 EST-SSRs. This resource constitutes a first step toward the establishment of a gene-based map for this genus that will facilitate the dissection of QTLs affecting complex traits of ecological importance

Michigan Technological University

Wageningen University & Research Publications

Oskar Bordeaux

Data Mining for Simple Sequence Repeats in Oil Palm Expressed Sequence Tags

Author: Aikkal Riju
Vadivel Arunachalam
Publication venue
Publication date: 08/08/2009
Field of study

Expressed Sequence Tags or ESTs are small pieces of DNA sequence that are generated by sequencing either one or both ends of an expressed gene. ESTs provide researchers with a quick and inexpensive route for discovering new genes, for obtaining data on gene expression and regulation, and for constructing genome maps. Oil palm EST sequences as available in public domain are downloaded. They were grouped and made contigs using CAP3 and Phrap. Microsatellite repeats are located using 5 softwares (MISA, TRA, TROLL, SSRIT, SSR primer). Among the 5 methods MISA is found to be the best. It can elucidate the compound repeat also. Frequency and total number (202) of SSR were detected. Mononucleotide repeat is more abundant especially ‘A/T’ repeats in Oil palm. Flanking primers were designed using primer3, SSR primers. The results of the study are given as an online database ‘MEMCO’ to help Oil palm researchers

Nature Precedings

Development and characterization of BAC-end sequence derived SSRs, and their incorporation into a new higher density genetic map for cultivated peanut (Arachis hypogaea L.)

Author: Bertioli David J
Cook Douglas R
Farmer Andrew D
Gao Jinliang
Gong Limin
Guo Baozhu
He Guohao
Isobe Sachiko
Penmetsa R Varma
Rosen Benjamin D
Varshney Rajeev K
Wang Hui
Yuan Mei
Zhao Yongli
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background Cultivated peanut (<it>Arachis hypogaea </it>L.) is an important crop worldwide, valued for its edible oil and digestible protein. It has a very narrow genetic base that may well derive from a relatively recent single polyploidization event. Accordingly molecular markers have low levels of polymorphism and the number of polymorphic molecular markers available for cultivated peanut is still limiting. Results Here, we report a large set of BAC-end sequences (BES), use them for developing SSR (BES-SSR) markers, and apply them in genetic linkage mapping. The majority of BESs had no detectable homology to known genes (49.5%) followed by sequences with similarity to known genes (44.3%), and miscellaneous sequences (6.2%) such as transposable element, retroelement, and organelle sequences. A total of 1,424 SSRs were identified from 36,435 BESs. Among these identified SSRs, dinucleotide (47.4%) and trinucleotide (37.1%) SSRs were predominant. The new set of 1,152 SSRs as well as about 4,000 published or unpublished SSRs were screened against two parents of a mapping population, generating 385 polymorphic loci. A genetic linkage map was constructed, consisting of 318 loci onto 21 linkage groups and covering a total of 1,674.4 cM, with an average distance of 5.3 cM between adjacent loci. Two markers related to resistance gene homologs (RGH) were mapped to two different groups, thus anchoring 1 RGH-BAC contig and 1 singleton. Conclusions The SSRs mined from BESs will be of use in further molecular analysis of the peanut genome, providing a novel set of markers, genetically anchoring BAC clones, and incorporating gene sequences into a linkage map. This will aid in the identification of markers linked to genes of interest and map-based cloning.</p

ICRISAT Open Access Repository

eScholarship - University of California

Research Repository

Development of Microsatellite Markers by Data Mining from DNA Sequences

Author: Dan Wang
Jingou Tong
Lei Cheng
Publication venue: 'IntechOpen'
Publication date: 01/01/2009
Field of study

IntechOpen

PSR: Polymorphic SSR retrieval

Author: Cantarella Concita
D'Agostino Nunzio
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

BACKGROUND: With the advent of high-throughput sequencing technologies large-scale identification of microsatellites became affordable and was especially directed to non-model species. By contrast, few efforts have been published toward the automatic identification of polymorphic microsatellites by exploiting sequence redundancy. Few tools for genotyping microsatellite repeats have been implemented so far that are able to manage huge amount of sequence data and handle the SAM/BAM file format. Most of them have been developed for and tested on human or model organisms with high quality reference genomes. RESULTS: In this note we describe polymorphic SSR retrieval (PSR), a read counter and simple sequence repeat (SSR) length polymorphism detection tool. It is written in Perl and was developed to identify length polymorphisms in perfect microsatellites exploiting next generation sequencing (NGS) data. PSR has been developed bearing in mind plant non-model species for which de novo transcriptome assembly is generally the first sequence resource available to be used for SSR-mining. PSR is divided into two modules: the read-counting module (PSR_read_retrieval) identifies all the reads that cover the full-length of perfect microsatellites; the comparative module (PSR_poly_finder) detects both heterozygous and homozygous alleles at each microsatellite locus across all genotypes under investigation. Two threshold values to call a length polymorphism and reduce the number of false positives can be defined by the user: the minimum number of reads overlapping the repetitive stretch and the minimum read depth. The first parameter determines if the microsatellite-containing sequence must be processed or not, while the second one is decisive for the identification of minor alleles. PSR was tested on two different case studies. The first study aims at the identification of polymorphic SSRs in a set of de novo assembled transcripts defined by RNA-sequencing of two different plant genotypes. The second research activity aims to investigate sequence variations within a collection of newly sequenced chloroplast genomes. In both the cases PSR results are in agreement with those obtained by capillary gel separation. CONCLUSION: PSR has been specifically developed from the need to automate the gene-based and genome-wide identification of polymorphic microsatellites from NGS data. It overcomes the limits related to the existing and time-consuming efforts based on tools developed in the pre-NGS era

Archivio della ricerca - Università degli studi di Napoli Federico II

Wageningen University & Research Publications

Identification and Validation of EST-Derived Molecular Markers, TRAP and VNTRs, for Banana Research

Author: Ferreira C.F.
Garcia S.A.L.
Kema G.H.J.
Paiva L.V.
Souza M.T.
Talebi R.
Vroh B.I.
Publication venue: 'International Society for Horticultural Science (ISHS)'
Publication date
Field of study

The advent of high-throughput sequencing technology has generated abundant information on DNA sequences for the genomes of many plant species. Expressed Sequence Tags (ESTs), which are unique DNA sequences derived from a cDNA library and therefore representing genes transcribed in specific tissues or at some stage of development, are one type of DNA sequences highly available today for many important crop species. Molecular markers are used for bridging DNA sequence information with particular phenotypes and are useful tools for genotyping germplasm collections and also for tagging genes involved in desirable agronomic traits. In this sense, there is always a strong demand for suitable marker techniques to better utilise existing sequence information. A transcriptome database from banana (Musa spp.), DATAMusa, containing 42,724 ESTs from 11 different cDNA libraries and encompassing approximately 24 Mb of DNA sequence, was used in this study for the design of primers to PCR-amplify two types of EST-derived molecular markers, Variable Nucleotide Tandem Repeat (VNTR) and Target Region Amplification Polymorphism (TRAP). These primers were then validated against a panel of 14 diploid Musa genotypes and produced 32 (VNTR) and 119 (TRAP) alleles. Used separately or together, both types of markers were able to discriminate Musa genotypes from different genome background (A or B genomes). The TRAP alleles identified were derived from only one EST, while the VNTR alleles were derived from 12 unigenes. Based on the results of this study, EST-derived markers can be an important source of polymorphism to be used in genetic diversity and gene discovery studies in banan

Novel SSR Markers from BAC-End Sequences, DArT Arrays and a Comprehensive Genetic Map with 1,291 Marker Loci for Chickpea (Cicer arietinum L.)

Author: Abhishek Bohra
Andrzej Kilian
Bengt Hansson
Christopher D. Town
Douglas R. Cook
Günter Kahl
Hari D. Upadhyaya
Mahendar Thudi
Nepolean Thirunavukkarasu
Nicy Varghese
Pawan L. Kulwal
Peter Winter
Polavarapu B. KaviKishor
Pooran M. Gaur
R. Varma Penmetsa
Rajeev K. Varshney
Spurthi N. Nayak
Srivani Gudipati
Trushar M. Shah
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Chickpea (Cicer arietinum L.) is the third most important cool season food legume, cultivated in arid and semi-arid regions of the world. The goal of this study was to develop novel molecular markers such as microsatellite or simple sequence repeat (SSR) markers from bacterial artificial chromosome (BAC)-end sequences (BESs) and diversity arrays technology (DArT) markers, and to construct a high-density genetic map based on recombinant inbred line (RIL) population ICC 4958 (C. arietinum)×PI 489777 (C. reticulatum). A BAC-library comprising 55,680 clones was constructed and 46,270 BESs were generated. Mining of these BESs provided 6,845 SSRs, and primer pairs were designed for 1,344 SSRs. In parallel, DArT arrays with ca. 15,000 clones were developed, and 5,397 clones were found polymorphic among 94 genotypes tested. Screening of newly developed BES-SSR markers and DArT arrays on the parental genotypes of the RIL mapping population showed polymorphism with 253 BES-SSR markers and 675 DArT markers. Segregation data obtained for these polymorphic markers and 494 markers data compiled from published reports or collaborators were used for constructing the genetic map. As a result, a comprehensive genetic map comprising 1,291 markers on eight linkage groups (LGs) spanning a total of 845.56 cM distance was developed (http://cmap.icrisat.ac.in/cmap/sm/cp/thudi/). The number of markers per linkage group ranged from 68 (LG 8) to 218 (LG 3) with an average inter-marker distance of 0.65 cM. While the developed resource of molecular markers will be useful for genetic diversity, genetic mapping and molecular breeding applications, the comprehensive genetic map with integrated BES-SSR markers will facilitate its anchoring to the physical map (under construction) to accelerate map-based cloning of genes in chickpea and comparative genome evolution studies in legumes

Public Library of Science (PLOS)

ICRISAT Open Access Repository

Hochschulschriftenserver - Universität Frankfurt am Main

Research Repository

Genome-wide identification of microsatellites in white clover (Trifolium repens L.) using FIASCO and phpSSRMiner

Author: Bouton Joseph H
He Ji
Monteros Maria J
Zhang Yan
Zhao Patrick X
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Allotetraploid white clover (<it>Trifolium repens </it>L.) is an important forage legume widely cultivated in most temperate regions. Only a small number of microsatellite markers are publicly available and can be utilized in white clover breeding programs. The objectives of this study were to develop an integrated approach for microsatellite development and to evaluate the approach for the development of new SSR markers for white clover. Results Genomic libraries containing simple sequence repeat (SSR) sequences were constructed using a modified Fast Isolation by AFLP of Sequences COntaining repeats (FIASCO) procedure and phpSSRMiner was used to develop the microsatellite markers. SSR motifs were isolated using two biotin-labeled probes, (CA)17 and (ATG)12. The sequences of 6,816 clones were assembled into 1,698 contigs, 32% of which represented novel sequences based on BLASTN searches. Approximately 32%, 28%, and 16% of these SSRs contained hexa-, tri-, and di-nucleotide repeats, respectively. The most frequent motifs were the CA and ATG complementary repeats and the associated compound sequences. Primer pairs were designed for 859 SSR loci based on sequences from these genomic libraries and from GenBank white clover nucleotide sequences. A total of 191 SSR primers developed from the two libraries were tested for polymorphism in individual clones from the parental genotypes GA43 ('Durana'), 'SRVR' and six F1 progeny from a mapping population. Ninety two percent produced amplicons and 66% of these were polymorphic. Conclusion The combined approach of identifying SSR-enriched fragments by FIASCO coupled with the primer design and <it>in silico </it>amplification using phpSSRMiner represents an efficient and low cost pipeline for the large-scale development of microsatellite markers in plants. The approach described here could be readily adapted and utilized in other non-related species with none or limited genomic resources.</p

ICRISAT Open Access Repository

Analysis of BAC-end sequences (BESs) and development of BES-SSR markers for genetic mapping and hybrid purity assessment in pigeonpea (Cajanus spp.)

Author: Bohra Abhishek
Cook Douglas R
Dubey Anuja
Farmer Andrew D
Gothalwal Ragini
Kishor PB Kavi
Kumar Naresh
May Gregory D
Penmetsa R Varma
Poornima KN
Ramesh S
Saxena Kulbhushan
Saxena Rachit K
Singh Dhiraj
Singh Nagendra K
Srivani Gudipati
Town Christopher D
Upadhyaya Hari D
Varshney Rajeev K
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Pigeonpea [<it>Cajanus cajan </it>(L.) Millsp.] is an important legume crop of rainfed agriculture. Despite of concerted research efforts directed to pigeonpea improvement, stagnated productivity of pigeonpea during last several decades may be accounted to prevalence of various biotic and abiotic constraints and the situation is exacerbated by availability of inadequate genomic resources to undertake any molecular breeding programme for accelerated crop improvement. With the objective of enhancing genomic resources for pigeonpea, this study reports for the first time, large scale development of SSR markers from BAC-end sequences and their subsequent use for genetic mapping and hybridity testing in pigeonpea. Results A set of 88,860 BAC (bacterial artificial chromosome)-end sequences (BESs) were generated after constructing two BAC libraries by using <it>Hin</it>dIII (34,560 clones) and <it>Bam</it>HI (34,560 clones) restriction enzymes. Clustering based on sequence identity of BESs yielded a set of >52K non-redundant sequences, comprising 35 Mbp or >4% of the pigeonpea genome. These sequences were analyzed to develop annotation lists and subdivide the BESs into genome fractions (e.g., genes, retroelements, transpons and non-annotated sequences). Parallel analysis of BESs for microsatellites or simple sequence repeats (SSRs) identified 18,149 SSRs, from which a set of 6,212 SSRs were selected for further analysis. A total of 3,072 novel SSR primer pairs were synthesized and tested for length polymorphism on a set of 22 parental genotypes of 13 mapping populations segregating for traits of interest. In total, we identified 842 polymorphic SSR markers that will have utility in pigeonpea improvement. Based on these markers, the <it>first </it>SSR-based genetic map comprising of 239 loci was developed for this previously uncharacterized genome. Utility of developed SSR markers was also demonstrated by identifying a set of 42 markers each for two hybrids (ICPH 2671 and ICPH 2438) for genetic purity assessment in commercial hybrid breeding programme. Conclusion In summary, while BAC libraries and BESs should be useful for genomics studies, BES-SSR markers, and the genetic map should be very useful for linking the genetic map with a future physical map as well as for molecular breeding in pigeonpea.</p