136 research outputs found
Statistical Analysis of Microarray Data with Replicated Spots: A Case Study with Synechococcus WH8102
Until recently microarray experiments often involved relatively few arrays with only a single representation of each gene on each array. A complete genome microarray with multiple spots per gene (spread out spatially across the array) was developed in order to compare the gene expression of a marine cyanobacterium and a knockout mutant strain in a defined artificial seawater medium. Statistical methods were developed for analysis in the special situation of this case study where there is gene replication within an array and where relatively few arrays are used, which can be the case with current array technology. Due in part to the replication within an array, it was possible to detect very small changes in the levels of expression between the wild type and mutant strains. One interesting biological outcome of this experiment is the indication of the extent to which the phosphorus regulatory system of this cyanobacterium affects the expression of multiple genes beyond those strictly involved in phosphorus acquisition
NCBI GEO: archive for high-throughput functional genomic data
The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) is the largest public repository for high-throughput gene expression data. Additionally, GEO hosts other categories of high-throughput functional genomic data, including those that examine genome copy number variations, chromatin structure, methylation status and transcription factor binding. These data are generated by the research community using high-throughput technologies like microarrays and, more recently, next-generation sequencing. The database has a flexible infrastructure that can capture fully annotated raw and processed data, enabling compliance with major community-derived scientific reporting standards such as ‘Minimum Information About a Microarray Experiment’ (MIAME). In addition to serving as a centralized data storage hub, GEO offers many tools and features that allow users to effectively explore, analyze and download expression data from both gene-centric and experiment-centric perspectives. This article summarizes the GEO repository structure, content and operating procedures, as well as recently introduced data mining features. GEO is freely accessible at http://www.ncbi.nlm.nih.gov/geo/
A contiguous de novo genome assembly of sugar beet EL10 (Beta vulgaris L.)
A contiguous assembly of the inbred ‘EL10’ sugar beet (Beta vulgaris ssp. vulgaris) genome was constructed using PacBio long-read sequencing, BioNano optical mapping, Hi-C scaffolding, and Illumina short-read error correction. The EL10.1 assembly was 540 Mb, of which 96.2% was contained in nine chromosome-sized pseudomolecules with lengths from 52 to 65 Mb, and 31 contigs with a median size of 282 kb that remained unassembled. Gene annotation incorporating RNA-seq data and curated sequences via the MAKER annotation pipeline generated 24,255 gene models. Results indicated that the EL10.1 genome assembly is a contiguous genome assembly highly congruent with the published sugar beet reference genome. Gross duplicate gene analyses of EL10.1 revealed little large-scale intra-genome duplication. Reduced gene copy number for well-annotated gene families relative to other core eudicots was observed, especially for transcription factors. Variation in genome size in B. vulgaris was investigated by flow cytometry among 50 individuals producing estimates from 633 to 875 Mb/1C. Read-depth mapping with short-read whole-genome sequences from other sugar beet germplasm suggested that relatively few regions of the sugar beet genome appeared associated with high-copy number variation
Interactive metagenomic visualization in a Web browser
<p>Abstract</p> <p>Background</p> <p>A critical output of metagenomic studies is the estimation of abundances of taxonomical or functional groups. The inherent uncertainty in assignments to these groups makes it important to consider both their hierarchical contexts and their prediction confidence. The current tools for visualizing metagenomic data, however, omit or distort quantitative hierarchical relationships and lack the facility for displaying secondary variables.</p> <p>Results</p> <p>Here we present Krona, a new visualization tool that allows intuitive exploration of relative abundances and confidences within the complex hierarchies of metagenomic classifications. Krona combines a variant of radial, space-filling displays with parametric coloring and interactive polar-coordinate zooming. The HTML5 and JavaScript implementation enables fully interactive charts that can be explored with any modern Web browser, without the need for installed software or plug-ins. This Web-based architecture also allows each chart to be an independent document, making them easy to share via e-mail or post to a standard Web server. To illustrate Krona's utility, we describe its application to various metagenomic data sets and its compatibility with popular metagenomic analysis tools.</p> <p>Conclusions</p> <p>Krona is both a powerful metagenomic visualization tool and a demonstration of the potential of HTML5 for highly accessible bioinformatic visualizations. Its rich and interactive displays facilitate more informed interpretations of metagenomic analyses, while its implementation as a browser-based application makes it extremely portable and easily adopted into existing analysis packages. Both the Krona rendering code and conversion tools are freely available under a BSD open-source license, and available from: <url>http://krona.sourceforge.net</url>.</p
NCBI GEO: archive for functional genomics data sets—10 years on
A decade ago, the Gene Expression Omnibus (GEO) database was established at the National Center for Biotechnology Information (NCBI). The original objective of GEO was to serve as a public repository for high-throughput gene expression data generated mostly by microarray technology. However, the research community quickly applied microarrays to non-gene-expression studies, including examination of genome copy number variation and genome-wide profiling of DNA-binding proteins. Because the GEO database was designed with a flexible structure, it was possible to quickly adapt the repository to store these data types. More recently, as the microarray community switches to next-generation sequencing technologies, GEO has again adapted to host these data sets. Today, GEO stores over 20 000 microarray- and sequence-based functional genomics studies, and continues to handle the majority of direct high-throughput data submissions from the research community. Multiple mechanisms are provided to help users effectively search, browse, download and visualize the data at the level of individual genes or entire studies. This paper describes recent database enhancements, including new search and data representation tools, as well as a brief review of how the community uses GEO data. GEO is freely accessible at http://www.ncbi.nlm.nih.gov/geo/
Genetic variation for tuber mineral concentrations in accessions of the Commonwealth Potato Collection
The variation in tuber mineral concentrations amongst accessions of wild tuber-bearing Solanum species in the Commonwealth Potato Collection (CPC) was evaluated under greenhouse conditions. Selected CPC accessions, representing the eco-geographical distribution of wild potatoes, were grown to maturity in peat-based compost under controlled conditions. Tubers from five plants of each accession were harvested, bulked and their mineral composition analysed. Among the germplasm investigated, there was a greater range in tuber concentrations of some elements of nutritional significance to both plants and animals, such as (Ca, Fe and Zn; 6.7, 3.6, and 4.5-fold respectively) than others, such as (K, P and S; all <3-fold). Significant positive correlations were found between mean altitude of the species' range and tuber P, K, Cu and Mg concentrations. The amount of diversity observed in the CPC collection indicates the existence of wide differences in tuber mineral accumulation among different potato accessions. This might be useful in breeding for nutritional improvement of potato tubers
An improved pig reference genome sequence to enable pig genetics and genomics research.
BACKGROUND: The domestic pig (Sus scrofa) is important both as a food source and as a biomedical model given its similarity in size, anatomy, physiology, metabolism, pathology, and pharmacology to humans. The draft reference genome (Sscrofa10.2) of a purebred Duroc female pig established using older clone-based sequencing methods was incomplete, and unresolved redundancies, short-range order and orientation errors, and associated misassembled genes limited its utility. RESULTS: We present 2 annotated highly contiguous chromosome-level genome assemblies created with more recent long-read technologies and a whole-genome shotgun strategy, 1 for the same Duroc female (Sscrofa11.1) and 1 for an outbred, composite-breed male (USMARCv1.0). Both assemblies are of substantially higher (>90-fold) continuity and accuracy than Sscrofa10.2. CONCLUSIONS: These highly contiguous assemblies plus annotation of a further 11 short-read assemblies provide an unprecedented view of the genetic make-up of this important agricultural and biomedical model species. We propose that the improved Duroc assembly (Sscrofa11.1) become the reference genome for genomic research in pigs
Evolutionary and Experimental Assessment of Novel Markers for Detection of Xanthomonas euvesicatoria in Plant Samples
BACKGROUND: Bacterial spot-causing xanthomonads (BSX) are quarantine phytopathogenic bacteria responsible for heavy losses in tomato and pepper production. Despite the research on improved plant spraying methods and resistant cultivars, the use of healthy plant material is still considered as the most effective bacterial spot control measure. Therefore, rapid and efficient detection methods are crucial for an early detection of these phytopathogens. METHODOLOGY: In this work, we selected and validated novel DNA markers for reliable detection of the BSX Xanthomonas euvesicatoria (Xeu). Xeu-specific DNA regions were selected using two online applications, CUPID and Insignia. Furthermore, to facilitate the selection of putative DNA markers, a customized C program was designed to retrieve the regions outputted by both databases. The in silico validation was further extended in order to provide an insight on the origin of these Xeu-specific regions by assessing chromosomal location, GC content, codon usage and synteny analyses. Primer-pairs were designed for amplification of those regions and the PCR validation assays showed that most primers allowed for positive amplification with different Xeu strains. The obtained amplicons were labeled and used as probes in dot blot assays, which allowed testing the probes against a collection of 12 non-BSX Xanthomonas and 23 other phytopathogenic bacteria. These assays confirmed the specificity of the selected DNA markers. Finally, we designed and tested a duplex PCR assay and an inverted dot blot platform for culture-independent detection of Xeu in infected plants. SIGNIFICANCE: This study details a selection strategy able to provide a large number of Xeu-specific DNA markers. As demonstrated, the selected markers can detect Xeu in infected plants both by PCR and by hybridization-based assays coupled with automatic data analysis. Furthermore, this work is a contribution to implement more efficient DNA-based methods of bacterial diagnostics
- …