105 research outputs found

    Tablet—next generation sequence assembly visualization

    Get PDF
    Summary: Tablet is a lightweight, high-performance graphical viewer for next-generation sequence assemblies and alignments. Supporting a range of input assembly formats, Tablet provides high-quality visualizations showing data in packed or stacked views, allowing instant access and navigation to any region of interest, and whole contig overviews and data summaries. Tablet is both multi-core aware and memory efficient, allowing it to handle assemblies containing millions of reads, even on a 32-bit desktop machine

    Tandem repeat distribution of gene transcripts in three plant families

    Get PDF
    Tandem repeats (microsatellites or SSRs) are molecular markers with great potential for plant genetic studies. Modern strategies include the transfer of these markers among widely studied and orphan species. In silico analyses allow for studying distribution patterns of microsatellites and predicting which motifs would be more amenable to interspecies transfer. Transcribed sequences (Unigene) from ten species of three plant families were surveyed for the occurrence of micro and minisatellites. Transcripts from different species displayed different rates of tandem repeat occurrence, ranging from 1.47% to 11.28%. Both similar and different patterns were found within and among plant families. The results also indicate a lack of association between genome size and tandem repeat fractions in expressed regions. The conservation of motifs among species and its implication on genome evolution and dynamics are discussed

    Mining and validating grape (Vitis L.) ESTs to develop EST-SSR markers for genotyping and mapping

    Get PDF
    Grape expressed sequence tags (ESTs) are a new resource for developing simple sequence repeat (SSR) functional markers for genotyping and genetic mapping. An integrated pipeline including several computational tools for SSR identification and functional annotation was developed to identify 6,447 EST-SSR sequences from a total collection of 215,609 grape ESTs retrieved from NCBI. The 6,447 EST-SSRs were further reduced to 1,701 non-redundant sequences via clustering analysis, and 1,037 of them were successfully designed with primer pairs flanking the SSR motifs. From them, 150 pairs of primers were randomly selected for PCR amplification, polymorphism and heterozygosity analysis in V. vinifera cvs. Riesling and Cabernet Sauvignon, and V. rotundifolia (muscadine grape) cvs. Summit and Noble, and 145 pairs of these primers yielded PCR products. Pairwise comparisons of loci between the parents Riesling and Cabernet Sauvignon showed that 72 were homozygous in both cultivars, while 70 loci were heterozygous in at least one cultivar of the two. Muscadine parents Noble and Summit had 90 homozygous SSR loci in both parents and contained 50 heterozygous loci in at least one of the two. These EST-SSR functional markers are a useful addition for grape genotyping and genome mapping

    Genome-wide BAC-end sequencing of Cucumis melo using two BAC libraries

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Although melon (<it>Cucumis melo </it>L.) is an economically important fruit crop, no genome-wide sequence information is openly available at the current time. We therefore sequenced BAC-ends representing a total of 33,024 clones, half of them from a previously described melon BAC library generated with restriction endonucleases and the remainder from a new random-shear BAC library.</p> <p>Results</p> <p>We generated a total of 47,140 high-quality BAC-end sequences (BES), 91.7% of which were paired-BES. Both libraries were assembled independently and then cross-assembled to obtain a final set of 33,372 non-redundant, high-quality sequences. These were grouped into 6,411 contigs (4.5 Mb) and 26,961 non-assembled BES (14.4 Mb), representing ~4.2% of the melon genome. The sequences were used to screen genomic databases, identifying 7,198 simple sequence repeats (corresponding to one microsatellite every 2.6 kb) and 2,484 additional repeats of which 95.9% represented transposable elements. The sequences were also used to screen expressed sequence tag (EST) databases, revealing 11,372 BES that were homologous to ESTs. This suggests that ~30% of the melon genome consists of coding DNA. We observed regions of microsynteny between melon paired-BES and six other dicotyledonous plant genomes.</p> <p>Conclusion</p> <p>The analysis of nearly 50,000 BES from two complementary genomic libraries covered ~4.2% of the melon genome, providing insight into properties such as microsatellite and transposable element distribution, and the percentage of coding DNA. The observed synteny between melon paired-BES and six other plant genomes showed that useful comparative genomic data can be derived through large scale BAC-end sequencing by anchoring a small proportion of the melon genome to other sequenced genomes.</p

    Microsatellites for the genus Cucurbita and an SSR-based genetic linkage map of Cucurbita pepo L.

    Get PDF
    Until recently, only a few microsatellites have been available for Cucurbita, thus their development is highly desirable. The Austrian oil-pumpkin variety Gleisdorfer Ölkürbis (C. pepo subsp. pepo) and the C. moschata cultivar Soler (Puerto Rico) were used for SSR development. SSR-enriched partial genomic libraries were established and 2,400 clones were sequenced. Of these 1,058 (44%) contained an SSR at least four repeats long. Primers were designed for 532 SSRs; 500 primer pairs produced fragments of expected size. Of these, 405 (81%) amplified polymorphic fragments in a set of 12 genotypes: three C. moschata, one C. ecuadorensis, and eight C. pepo representing all eight cultivar groups. On an average, C. pepo and C. moschata produced 3.3 alleles per primer pair, showing high inter-species transferability. There were 187 SSR markers detecting polymorphism between the USA oil-pumpkin variety “Lady Godiva” (O5) and the Italian crookneck variety “Bianco Friulano” (CN), which are the parents of our previous F2 mapping population. It has been used to construct the first published C. pepo map, containing mainly RAPD and AFLP markers. Now the updated map comprises 178 SSRs, 244 AFLPs, 230 RAPDs, five SCARs, and two morphological traits (h and B). It contains 20 linkage groups with a map density of 2.9 cM. The observed genome coverage (Co) is 86.8%

    Genome-wide characterization of simple sequence repeats in cucumber (Cucumis sativus L.)

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Cucumber, <it>Cucumis sativus </it>L. is an important vegetable crop worldwide. Until very recently, cucumber genetic and genomic resources, especially molecular markers, have been very limited, impeding progress of cucumber breeding efforts. Microsatellites are short tandemly repeated DNA sequences, which are frequently favored as genetic markers due to their high level of polymorphism and codominant inheritance. Data from previously characterized genomes has shown that these repeats vary in frequency, motif sequence, and genomic location across taxa. During the last year, the genomes of two cucumber genotypes were sequenced including the Chinese fresh market type inbred line '9930' and the North American pickling type inbred line 'Gy14'. These sequences provide a powerful tool for developing markers in a large scale. In this study, we surveyed and characterized the distribution and frequency of perfect microsatellites in 203 Mbp assembled Gy14 DNA sequences, representing 55% of its nuclear genome, and in cucumber EST sequences. Similar analyses were performed in genomic and EST data from seven other plant species, and the results were compared with those of cucumber.</p> <p>Results</p> <p>A total of 112,073 perfect repeats were detected in the Gy14 cucumber genome sequence, accounting for 0.9% of the assembled Gy14 genome, with an overall density of 551.9 SSRs/Mbp. While tetranucleotides were the most frequent microsatellites in genomic DNA sequence, dinucleotide repeats, which had more repeat units than any other SSR type, had the highest cumulative sequence length. Coding regions (ESTs) of the cucumber genome had fewer microsatellites compared to its genomic sequence, with trinucleotides predominating in EST sequences. AAG was the most frequent repeat in cucumber ESTs. Overall, AT-rich motifs prevailed in both genomic and EST data. Compared to the other species examined, cucumber genomic sequence had the highest density of SSRs (although comparable to the density of poplar, grapevine and rice), and was richest in AT dinucleotides. Using an electronic PCR strategy, we investigated the polymorphism between 9930 and Gy14 at 1,006 SSR loci, and found unexpectedly high degree of polymorphism (48.3%) between the two genotypes. The level of polymorphism seems to be positively associated with the number of repeat units in the microsatellite. The <it>in silico </it>PCR results were validated empirically in 660 of the 1,006 SSR loci. In addition, primer sequences for more than 83,000 newly-discovered cucumber microsatellites, and their exact positions in the Gy14 genome assembly were made publicly available.</p> <p>Conclusions</p> <p>The cucumber genome is rich in microsatellites; AT and AAG are the most abundant repeat motifs in genomic and EST sequences of cucumber, respectively. Considering all the species investigated, some commonalities were noted, especially within the monocot and dicot groups, although the distribution of motifs and the frequency of certain repeats were characteristic of the species examined. The large number of SSR markers developed from this study should be a significant contribution to the cucurbit research community.</p

    An eQTL Analysis of Partial Resistance to Puccinia hordei in Barley

    Get PDF
    Background - Genetic resistance to barley leaf rust caused by Puccinia hordei involves both R genes and quantitative trait loci. The R genes provide higher but less durable resistance than the quantitative trait loci. Consequently, exploring quantitative or partial resistance has become a favorable alternative for controlling disease. Four quantitative trait loci for partial resistance to leaf rust have been identified in the doubled haploid Steptoe (St)/Morex (Mx) mapping population. Further investigations are required to study the molecular mechanisms underpinning partial resistance and ultimately identify the causal genes.Methodology/Principal Findings - We explored partial resistance to barley leaf rust using a genetical genomics approach. We recorded RNA transcript abundance corresponding to each probe on a 15K Agilent custom barley microarray in seedlings from St and Mx and 144 doubled haploid lines of the St/Mx population. A total of 1154 and 1037 genes were, respectively, identified as being P. hordei-responsive among the St and Mx and differentially expressed between P. hordei-infected St and Mx. Normalized ratios from 72 distant-pair hybridisations were used to map the genetic determinants of variation in transcript abundance by expression quantitative trait locus (eQTL) mapping generating 15685 eQTL from 9557 genes. Correlation analysis identified 128 genes that were correlated with resistance, of which 89 had eQTL co-locating with the phenotypic quantitative trait loci (pQTL). Transcript abundance in the parents and conservation of synteny with rice allowed us to prioritise six genes as candidates for Rphq11, the pQTL of largest effect, and highlight one, a phospholipid hydroperoxide glutathione peroxidase (HvPHGPx) for detailed analysis.Conclusions/Significance - The eQTL approach yielded information that led to the identification of strong candidate genes underlying pQTL for resistance to leaf rust in barley and on the general pathogen response pathway. The dataset will facilitate a systems appraisal of this host-pathogen interaction and, potentially, for other traits measured in this populatio

    Integrated physical, genetic and genome map of chickpea (Cicer arietinum L.)

    Get PDF
    Physical map of chickpea was developed for the reference chickpea genotype (ICC 4958) using bacterial artificial chromosome (BAC) libraries targeting 71,094 clones (~12× coverage). High information content fingerprinting (HICF) of these clones gave high-quality fingerprinting data for 67,483 clones, and 1,174 contigs comprising 46,112 clones and 3,256 singletons were defined. In brief, 574 Mb genome size was assembled in 1,174 contigs with an average of 0.49 Mb per contig and 3,256 singletons represent 407 Mb genome. The physical map was linked with two genetic maps with the help of 245 BAC-end sequence (BES)-derived simple sequence repeat (SSR) markers. This allowed locating some of the BACs in the vicinity of some important quantitative trait loci (QTLs) for drought tolerance and reistance to Fusarium wilt and Ascochyta blight. In addition, fingerprinted contig (FPC) assembly was also integrated with the draft genome sequence of chickpea. As a result, ~965 BACs including 163 minimum tilling path (MTP) clones could be mapped on eight pseudo-molecules of chickpea forming 491 hypothetical contigs representing 54,013,992 bp (~54 Mb) of the draft genome. Comprehensive analysis of markers in abiotic and biotic stress tolerance QTL regions led to identification of 654, 306 and 23 genes in drought tolerance “QTL-hotspot” region, Ascochyta blight resistance QTL region and Fusarium wilt resistance QTL region, respectively. Integrated physical, genetic and genome map should provide a foundation for cloning and isolation of QTLs/genes for molecular dissection of traits as well as markers for molecular breeding for chickpea improvement

    Development and characterization of polymorphic microsatellite markers in taro (Colocasia esculenta)

    Get PDF
    Microsatellite-containing sequences were isolated from enriched genomic libraries of taro (Colocasia esculenta (L.) Schott). The sequencing of 269 clones yielded 77 inserts containing repeat motifs. The majority of these (81.7%) were dinucleotide or trinucleotide repeats. The GT/CA repeat motif was the most common, accounting for 42% of all repeat types. From a total of 43 primer pairs designed, 41 produced markers within the expected size range. Sixteen (39%) were polymorphic when screened against a restricted set of taro genotypes from Southeast Asia and Oceania, with an average of 3.2 alleles detected on each locus. These markers represent a useful resource for taro germplasm management, genome mapping, and marker-assisted selection
    corecore