Search CORE

76 research outputs found

Optimization of sequence alignment for simple sequence repeat regions

Author: Abdulqader Jighly
Aladdin Hamwieh
C Gow
D Tautz
DA Chistiakov
Francis C Ogbonnaya
GI Bell
H Ellegren
H Ellegren
J Weber
J Wiessenbach
JAL Armour
JC Whittaker
JP Jakupciak
K Tamura
M Brandström
M Sekar
MF Santibanez-Koref
R Peakall
R Sainudiin
RV Kantety
S Kruglyak
S Leclercq
W Powell
YD Kelkar
Z Zhang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Microsatellites, or simple sequence repeats (SSRs), are tandemly repeated DNA sequences, including tandem copies of specific sequences no longer than six bases, that are distributed in the genome. SSR has been used as a molecular marker because it is easy to detect and is used in a range of applications, including genetic diversity, genome mapping, and marker assisted selection. It is also very mutable because of slipping in the DNA polymerase during DNA replication. This unique mutation increases the insertion/deletion (INDELs) mutation frequency to a high ratio - more than other types of molecular markers such as single nucleotide polymorphism (SNPs). SNPs are more frequent than INDELs. Therefore, all designed algorithms for sequence alignment fit the vast majority of the genomic sequence without considering microsatellite regions, as unique sequences that require special consideration. The old algorithm is limited in its application because there are many overlaps between different repeat units which result in false evolutionary relationships. Findings To overcome the limitation of the aligning algorithm when dealing with SSR loci, a new algorithm was developed using PERL script with a Tk graphical interface. This program is based on aligning sequences after determining the repeated units first, and the last SSR nucleotides positions. This results in a shifting process according to the inserted repeated unit type. When studying the phylogenic relations before and after applying the new algorithm, many differences in the trees were obtained by increasing the SSR length and complexity. However, less distance between different linage had been observed after applying the new algorithm. Conclusions The new algorithm produces better estimates for aligning SSR loci because it reflects more reliable evolutionary relations between different linages. It reduces overlapping during SSR alignment, which results in a more realistic phylogenic relationship.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Exploiting a wheat EST database to assess genetic diversity

Author: Ahu Altinkut Uncuoglu
Akfrat-Senturk F
Altnta S
Altschul SF
Asf M
Bai G
Bennett MD
Chen XM
Ercan S
Filiz Gurel
Gülbitti-Onarici S
Jaccard P
Kantety RV
Kovach WL
Leigh F
Li S
Lu G
Maheswaran M
McNeal FM
Nagaoka T
Ozge Karakas
Pakniyat H
Parkinson J
Pashley CH
Plaschke J
Rudd S
Schondelmaier J
Song W
Torre J
Wei YM
Weining S
Yee E
Yu JK
Zeybek A
Zhang W
Publication venue: Sociedade Brasileira de Genética
Publication date: 01/01/2010
Field of study

Expressed sequence tag (EST) markers have been used to assess variety and genetic diversity in wheat (Triticum aestivum). In this study, 1549 ESTs from wheat infested with yellow rust were used to examine the genetic diversity of six susceptible and resistant wheat cultivars. The aim of using these cultivars was to improve the competitiveness of public wheat breeding programs through the intensive use of modern, particularly marker-assisted, selection technologies. The F2 individuals derived from cultivar crosses were screened for resistance to yellow rust at the seedling stage in greenhouses and adult stage in the field to identify DNA markers genetically linked to resistance. Five hundred and sixty ESTs were assembled into 136 contigs and 989 singletons. BlastX search results showed that 39 (29%) contigs and 96 (10%) singletons were homologous to wheat genes. The database-matched contigs and singletons were assigned to eight functional groups related to protein synthesis, photosynthesis, metabolism and energy, stress proteins, transporter proteins, protein breakdown and recycling, cell growth and division and reactive oxygen scavengers. PCR analyses with primers based on the contigs and singletons showed that the most polymorphic functional categories were photosynthesis (contigs) and metabolism and energy (singletons). EST analysis revealed considerable genetic variability among the Turkish wheat cultivars resistant and susceptible to yellow rust disease and allowed calculation of the mean genetic distance between cultivars, with the greatest similarity (0.725) being between Harmankaya99 and Sönmez2001, and the lowest (0.622) between Aytin98 and Izgi01

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Directory of Open Access Journals

PubMed Central

İstanbul Üniversitesi Açık Erişim Sistemi

A SNP and SSR Based Genetic Map of Asparagus Bean (Vigna. unguiculata ssp. sesquipedialis) and Comparison with the Broader Species

Author: A Fatmi
Baogen Wang
BB Singh
BJ Bassam
CA Fatokun
CD Li
CG Williams
DD Kosambi
Dehui Qin
E Jenczewski
Guojing Li
H Lu
HK Choi
JD Ehlers
JD Farisa
Jeffery D. Ehlers
JG Fang
K Arumuganathan
LR Chen
M Lorieux
M Lorieux
M Muchero
MP Timko
MS Roder
N Yamanaka
Ndeye-Ndack Diop
OK Han
P Xu
Pei Xu
Philip A. Roberts
PK Gupta
Roland G. Roberts
RV Kantety
S Paillard
S Sato
S Xue
SN Nayak
T Maguire
Timothy J. Close
Tingting Hu
TY Hwang
WD Beavis
X Qi
Xiaohua Wu
Yonghua Liu
Zhongfu Lu
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Asparagus bean (Vigna. unguiculata ssp. sesquipedialis) is a distinctive subspecies of cowpea [Vigna. unguiculata (L.) Walp.] that apparently originated in East Asia and is characterized by extremely long and thin pods and an aggressive climbing growth habit. The crop is widely cultivated throughout Asia for the production of immature pods known as ‘long beans’ or ‘asparagus beans’. While the genome of cowpea ssp. unguiculata has been characterized recently by high-density genetic mapping and partial sequencing, little is known about the genome of asparagus bean. We report here the first genetic map of asparagus bean based on SNP and SSR markers. The current map consists of 375 loci mapped onto 11 linkage groups (LGs), with 191 loci detected by SNP markers and 184 loci by SSR markers. The overall map length is 745 cM, with an average marker distance of 1.98 cM. There are four high marker-density blocks distributed on three LGs and three regions of segregation distortion (SDRs) identified on two other LGs, two of which co-locate in chromosomal regions syntenic to SDRs in soybean. Synteny between asparagus bean and the model legume Lotus. japonica was also established. This work provides the basis for mapping and functional analysis of genes/QTLs of particular interest in asparagus bean, as well as for comparative genomics study of cowpea at the subspecies level

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Tandem repeat distribution of gene transcripts in three plant families

Author: Antonio Costa de Oliveira
Asp T
Bell GI
Cardle L
Conne B
Cordeiro GM
Davis BM
Fernando Irajá Félix de Carvalho
Hong CP
Iyer RR
Jiang D
Jung S
Kantety RV
Kashi Y
Kumpatla SP
La Rota M
Lawson MJ
Li B
Li YC
Li YC
Luciano Carlos da Maia
Maia LC da
Mauricio Marini Kopp
McCouch SR
Morgante M
Morgante M
Nicot N
Palmieri DA
Parida SK
Peng JH
Philips AV
Subramanian S
Temnykh S
Thiel T
Thornton CA
Tóth G
Varshney RK
Varshney RK
Varshney RK
Velci Queiróz de Souza
Yu JK
Zhang L
Zhang L
Zhang L
Publication venue: Sociedade Brasileira de Genética
Publication date: 01/01/2009
Field of study

Tandem repeats (microsatellites or SSRs) are molecular markers with great potential for plant genetic studies. Modern strategies include the transfer of these markers among widely studied and orphan species. In silico analyses allow for studying distribution patterns of microsatellites and predicting which motifs would be more amenable to interspecies transfer. Transcribed sequences (Unigene) from ten species of three plant families were surveyed for the occurrence of micro and minisatellites. Transcripts from different species displayed different rates of tandem repeat occurrence, ranging from 1.47% to 11.28%. Both similar and different patterns were found within and among plant families. The results also indicate a lack of association between genome size and tandem repeat fractions in expressed regions. The conservation of motifs among species and its implication on genome evolution and dynamics are discussed

Repository Open Access to Scientific Information from Embrapa

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Directory of Open Access Journals

PubMed Central

Developmenrt of EST-SSR and genomic-SSR markers to assess genetic diversity in Jatropha Curcas L.

Author: A Wunsch
BN Divakara
CG Vander Linden
Cheng Lu
CX Chen
D Fairless
D Metzgar
D Wang
DVN Sudheer Pamidimarri
ET Akintayo
FC Yeh
FJ Rohlf
GM Cordeiro
Haiyan Wang
HM Chen
HPS Makkar
JJ Doyle
K Openshaw
K Suwabe
KV Rajeev
Leela Tatikonda
LF Gao
LM Cano-Asseleih
LM Cano-Asseleih
LY Zhang
LZ Li
M Wink
Meiling Zou
Mingfu Wen
MS Roder
N Sunil
O Tahan
P Sourdille
PK Gupta
QB Sun
REC Mba
RF Vieira
RK Varshney
RPS Katwal
RV Kantety
S Bory
S Ganesh Ram
SD Basha
SD Basha
T Thiel
Wenquan Wang
WJ Ou
YB Xu
YH Wang
Zhiqiang Xia
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background <it>Jatropha curcas L. </it>has attracted a great deal of attention worldwide, regarding its potential as a new biodiesel crop. However, the understanding of this crop remains very limited and little genomic research has been done. We used simple sequence repeat (SSR) markers that could be transferred from <it>Manihot esculenta </it>(cassava) to analyze the genetic relationships among 45 accessions of <it>J. curcas </it>from our germplasm collection. Results In total, 187 out of 419 expressed sequence tag (EST)-SSR and 54 out of 182 genomic (G)-SSR markers from cassava were polymorphic among the <it>J. curcas </it>accessions. The EST-SSR markers comprised 26.20% dinucleotide repeats, 57.75% trinucleotide repeats, 7.49% tetranucleotide repeats, and 8.56% pentanucleotide repeats, whereas the majority of the G-SSR markers were dinucleotide repeats (62.96%). The 187 EST-SSRs resided in genes that are involved mainly in biological and metabolic processes. Thirty-six EST-SSRs and 20 G-SSRs were chosen to analyze the genetic diversity among 45 <it>J. curcas </it>accessions. A total of 183 polymorphic alleles were detected. On the basis of the distribution of these polymorphic alleles, the 45 accessions were classified into six groups, in which the genotype showed a correlation with geographic origin. The estimated mean genetic diversity index was 0.5572, which suggests that our <it>J. curcas </it>germplasm collection has a high level of genetic diversity. This should facilitate subsequent studies on genetic mapping and molecular breeding. Conclusion We identified 241 novel EST-SSR and G-SSR markers in <it>J. curcas</it>, which should be useful for genetic mapping and quantitative trait loci analysis of important agronomic traits. By using these markers, we found that the intergroup gene diversity of <it>J. curcas </it>was greater than the intragroup diversity, and that the domestication of the species probably occurred partly in America and partly in Hainan, China.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Population Genetic Structure of the Grasshopper Eyprepocnemis plorans in the South and East of the Iberian Peninsula

Author: AJ Muñoz-Pajares
AK Hundsdoerfer
AR Pradeep
CB Phillips
CD Soulsbury
D Falush
DJ Spiegelhalter
DJ Tester
E Bonnet
E Muñoz
E Zietkiewicz
F Hernandez
F Perfectti
Francisco Perfectti
G Evanno
J Cabrero
J Cabrero
J Cano
J Felsenstein
JA Herrera
JH De Leon
JPM Camacho
Juan Pedro Martínez Camacho
K Vijayan
KE Holsinger
L Excoffier
LR Dice
María Dolores López-León
María Inmaculada Manrique-Poyato
MB Ratnaparkhe
MC Pardo
MD López-León
ME Fernandez
N Saitou
Nicolas Salamin
O Roux
P Taberlet
PK Kar
PM Schlüter
Ricardo Gómez
RN Jones
RV Kantety
S Sesarini
S Wright
SL Datwyler
T Kojima
T Nagaoka
V Pinedo-Cancino
VM Dirsh
W Wu
X Vekemans
XP Zhang
Y Tsumura
Z Lu
Z Ren
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

The grasshopper Eyprepocnemis plorans subsp. plorans harbors a very widespread polymorphism for supernumerary (B) chromosomes which appear to have arisen recently. These chromosomes behave as genomic parasites because they are harmful for the individuals carrying them and show meiotic drive in the initial stages of population invasion. The rapid increase in B chromosome frequency at intrapopulation level is thus granted by meiotic drive, but its spread among populations most likely depends on interpopulation gene flow. We analyze here the population genetic structure in 10 natural populations from two regions (in the south and east) of the Iberian Peninsula. The southern populations were coastal whereas the eastern ones were inland populations located at 260–655 m altitude. The analysis of 97 ISSR markers revealed significant genetic differentiation among populations (average GST = 0.129), and the Structure software and AMOVA indicated a significant genetic differentiation between southern and eastern populations. There was also significant isolation by distance (IBD) between populations. Remarkably, these results were roughly similar to those found when only the markers showing low or no dropout were included, suggesting that allelic dropout had negligible effects on population genetic analysis. We conclude that high gene flow helped this parasitic B chromosome to spread through most of the geographical range of the subspecies E. plorans plorans.This study was supported by a grant from the Spanish Ministerio de Ciencia e Innovación (CGL2009-11917), and was partially performed by FEDER funds. MIMP was supported by a fellowship (FPU) from the Spanish Ministerio de Ciencia e Innovación

Public Library of Science (PLOS)

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

Repositorio Institucional Universidad de Granada

PubMed Central

Digital.CSIC

idUS. Depósito de Investigación Universidad de Sevilla

FigShare

Exploring the Switchgrass Transcriptome Using Second-Generation Sequencing Technology

Background: Switchgrass (Panicum virgatum L.) is a C4 perennial grass and widely popular as an important bioenergy crop. To accelerate the pace of developing high yielding switchgrass cultivars adapted to diverse environmental niches, the generation of genomic resources for this plant is necessary. The large genome size and polyploid nature of switchgrass makes whole genome sequencing a daunting task even with current technologies. Exploring the transcriptional landscape using next generation sequencing technologies provides a viable alternative to whole genome sequencing in switchgrass. Principal Findings: Switchgrass cDNA libraries from germinating seedlings, emerging tillers, flowers, and dormant seeds were sequenced using Roche 454 GS-FLX Titanium technology, generating 980,000 reads with an average read length of 367 bp. De novo assembly generated 243,600 contigs with an average length of 535 bp. Using the foxtail millet genome as a reference greatly improved the assembly and annotation of switchgrass ESTs. Comparative analysis of the 454-derived switchgrass EST reads with other sequenced monocots including Brachypodium, sorghum, rice and maize indicated a 70– 80 % overlap. RPKM analysis demonstrated unique transcriptional signatures of the four tissues analyzed in this study. More than 24,000 ESTs were identified in the dormant seed library. In silico analysis indicated that there are more than 2000 EST-SSRs in this collection. Expression of several orphan ESTs was confirmed by RT-PCR. Significance: We estimate that about 90 % of the switchgrass gene space has been covered in this analysis. This study nearl

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers

Abstract Background Sesame is an important oil crop, but limited transcriptomic and genomic data are currently available. This information is essential to clarify the fatty acid and lignan biosynthesis molecular mechanism. In addition, a shortage of sesame molecular markers limits the efficiency and accuracy of genetic breeding. High-throughput transcriptomic sequencing is essential to generate a large transcriptome sequence dataset for gene discovery and molecular marker development. Results Sesame transcriptomes from five tissues were sequenced using Illumina paired-end sequencing technology. The cleaned raw reads were assembled into a total of 86,222 unigenes with an average length of 629 bp. Of the unigenes, 46,584 (54.03%) had significant similarity with proteins in the NCBI nonredundant protein database and Swiss-Prot database (E-value < 10-5). Of these annotated unigenes, 10,805 and 27,588 unigenes were assigned to gene ontology categories and clusters of orthologous groups, respectively. In total, 22,003 (25.52%) unigenes were mapped onto 119 pathways using the Kyoto Encyclopedia of Genes and Genomes Pathway database (KEGG). Furthermore, 44,750 unigenes showed homology to 15,460 <it>Arabidopsis </it>genes based on BLASTx analysis against The Arabidopsis Information Resource (TAIR, Version 10) and revealed relatively high gene coverage. In total, 7,702 unigenes were converted into SSR markers (EST-SSR). Dinucleotide SSRs were the dominant repeat motif (67.07%, 5,166), followed by trinucleotide (24.89%, 1,917), tetranucleotide (4.31%, 332), hexanucleotide (2.62%, 202), and pentanucleotide (1.10%, 85) SSRs. AG/CT (46.29%) was the dominant repeat motif, followed by AC/GT (16.07%), AT/AT (10.53%), AAG/CTT (6.23%), and AGG/CCT (3.39%). Fifty EST-SSRs were randomly selected to validate amplification and to determine the degree of polymorphism in the genomic DNA pools. Forty primer pairs successfully amplified DNA fragments and detected significant amounts of polymorphism among 24 sesame accessions. Conclusions This study demonstrates that Illumina paired-end sequencing is a fast and cost-effective approach to gene discovery and molecular marker development in non-model organisms. Our results provide a comprehensive sequence resource for sesame research.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Analysis of expressed sequence tags generated from full-length enriched cDNA libraries of melon

Author: A Boualem
A Martin
A Mascarell-Creus
A Omid
Abdelhafid Bendahmane
Adnane Boualem
AH Paterson
Albert Mascarell-Creus
Ana I Caño-Delgado
AP Chan
Arabidopsis Genome Initiative
B Ewing
C Clepet
C Iseli
C Jeffrey
C Jeffrey
C Perin
Christian Clepet
D Gonzalez-Ibeas
Delphine Jublot
DJ Stekel
DM Shattuck-Eidens
E Haritatos
EI Boyle
F Dahmani-Mardas
G Gomez
GA Tuskan
GT Marth
H Ezura
H Korpelainen
H van Leeuwen
HG Nunez-Palenius
HH Chou
I Fernandez-Silva
I Gonda
International Brachypodium Initiative
International Rice Genome Sequencing Project
J Blanca
J Heslop-Harrison
J Jia
J Schmutz
James J Giovannoni
JJ Giovannoni
JJ Giovannoni
Jordi Garcia-Mas
JT Cuperus
K Aoki
K Arumuganathan
K Yamada
L Li
M Morales
M Tanurdzic
Maria Elena Hernandez-Gonzalez
Miguel A Aranda
Mingyun Huang
N Dai
Nurit Katzir
O Jaillon
P Rice
PD Karp
PS Schnable
R Apweiler
R Harel-Beja
R Lister
R Ming
R Velasco
Ramon Dolcet-Sanjuan
RD Finn
RV Kantety
S Guo
S Huang
S Rudd
SF Yang
SG Ralph
T Umezawa
Tarek Joobeur
V Portnoy
V Shulaev
Veronica Truniger
Vitaly Portnoy
VM Gonzalez
VM Gonzalez
W Deleu
W Schwab
X Argout
Y Benjamini
Yi Zheng
Zhangjun Fei
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Melon (Cucumis melo), an economically important vegetable crop, belongs to the Cucurbitaceae family which includes several other important crops such as watermelon, cucumber, and pumpkin. It has served as a model system for sex determination and vascular biology studies. However, genomic resources currently available for melon are limited. Result We constructed eleven full-length enriched and four standard cDNA libraries from fruits, flowers, leaves, roots, cotyledons, and calluses of four different melon genotypes, and generated 71,577 and 22,179 ESTs from full-length enriched and standard cDNA libraries, respectively. These ESTs, together with ~35,000 ESTs available in public domains, were assembled into 24,444 unigenes, which were extensively annotated by comparing their sequences to different protein and functional domain databases, assigning them Gene Ontology (GO) terms, and mapping them onto metabolic pathways. Comparative analysis of melon unigenes and other plant genomes revealed that 75% to 85% of melon unigenes had homologs in other dicot plants, while approximately 70% had homologs in monocot plants. The analysis also identified 6,972 gene families that were conserved across dicot and monocot plants, and 181, 1,192, and 220 gene families specific to fleshy fruit-bearing plants, the Cucurbitaceae family, and melon, respectively. Digital expression analysis identified a total of 175 tissue-specific genes, which provides a valuable gene sequence resource for future genomics and functional studies. Furthermore, we identified 4,068 simple sequence repeats (SSRs) and 3,073 single nucleotide polymorphisms (SNPs) in the melon EST collection. Finally, we obtained a total of 1,382 melon full-length transcripts through the analysis of full-length enriched cDNA clones that were sequenced from both ends. Analysis of these full-length transcripts indicated that sizes of melon 5' and 3' UTRs were similar to those of tomato, but longer than many other dicot plants. Codon usages of melon full-length transcripts were largely similar to those of Arabidopsis coding sequences. Conclusion The collection of melon ESTs generated from full-length enriched and standard cDNA libraries is expected to play significant roles in annotating the melon genome. The ESTs and associated analysis results will be useful resources for gene discovery, functional analysis, marker-assisted breeding of melon and closely related species, comparative genomic studies and for gaining insights into gene expression patterns.This work was supported by Research Grant Award No. IS-4223-09C from BARD, the United States-Israel Binational Agricultural Research and Development Fund, and by SNC Laboratoire ASL, de Ruiter Seeds B.V., Enza Zaden B.V., Gautier Semences S.A., Nunhems B.V., Rijk Zwaan B.V., Sakata Seed Inc, Semillas Fitó S.A., Seminis Vegetable Seeds Inc, Syngenta Seeds B.V., Takii and Company Ltd, Vilmorin and Cie S.A. and Zeraim Gedera Ltd (all of them as part of the support to ICuGI). CC was supported by CNRS ERL 8196.Peer Reviewe

HAL Evry

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Springer - Publisher Connector

PubMed Central

Digital.CSIC

Diposit Digital de Documents de la UAB

ProdInra