1,208 research outputs found

    Probe set algorithms: is there a rational best bet?

    Get PDF
    Affymetrix microarrays have become a standard experimental platform for studies of mRNA expression profiling. Their success is due, in part, to the multiple oligonucleotide features (probes) against each transcript (probe set). This multiple testing allows for more robust background assessments and gene expression measures, and has permitted the development of many computational methods to translate image data into a single normalized "signal" for mRNA transcript abundance. There are now many probe set algorithms that have been developed, with a gradual movement away from chip-by-chip methods (MAS5), to project-based model-fitting methods (dCHIP, RMA, others). Data interpretation is often profoundly changed by choice of algorithm, with disoriented biologists questioning what the "accurate" interpretation of their experiment is. Here, we summarize the debate concerning probe set algorithms. We provide examples of how changes in mismatch weight, normalizations, and construction of expression ratios each dramatically change data interpretation. All interpretations can be considered as computationally appropriate, but with varying biological credibility. We also illustrate the performance of two new hybrid algorithms (PLIER, GC-RMA) relative to more traditional algorithms (dCHIP, MAS5, Probe Profiler PCA, RMA) using an interactive power analysis tool. PLIER appears superior to other algorithms in avoiding false positives with poorly performing probe sets. Based on our interpretation of the literature, and examples presented here, we suggest that the variability in performance of probe set algorithms is more dependent upon assumptions regarding "background", than on calculations of "signal". We argue that "background" is an enormously complex variable that can only be vaguely quantified, and thus the "best" probe set algorithm will vary from project to project

    Modelling the spatial distribution of DEM Error

    Get PDF
    Assessment of a DEM’s quality is usually undertaken by deriving a measure of DEM accuracy – how close the DEM’s elevation values are to the true elevation. Measures such as Root Mean Squared Error and standard deviation of the error are frequently used. These measures summarise elevation errors in a DEM as a single value. A more detailed description of DEM accuracy would allow better understanding of DEM quality and the consequent uncertainty associated with using DEMs in analytical applications. The research presented addresses the limitations of using a single root mean squared error (RMSE) value to represent the uncertainty associated with a DEM by developing a new technique for creating a spatially distributed model of DEM quality – an accuracy surface. The technique is based on the hypothesis that the distribution and scale of elevation error within a DEM are at least partly related to morphometric characteristics of the terrain. The technique involves generating a set of terrain parameters to characterise terrain morphometry and developing regression models to define the relationship between DEM error and morphometric character. The regression models form the basis for creating standard deviation surfaces to represent DEM accuracy. The hypothesis is shown to be true and reliable accuracy surfaces are successfully created. These accuracy surfaces provide more detailed information about DEM accuracy than a single global estimate of RMSE

    Canopy nitrogen, carbon assimilation, and albedo in temperate and boreal forests: Functional relations and potential climate feedbacks

    Get PDF
    The availability of nitrogen represents a key constraint on carbon cycling in terrestrial ecosystems, and it is largely in this capacity that the role of N in the Earth\u27s climate system has been considered. Despite this, few studies have included continuous variation in plant N status as a driver of broad-scale carbon cycle analyses. This is partly because of uncertainties in how leaf-level physiological relationships scale to whole ecosystems and because methods for regional to continental detection of plant N concentrations have yet to be developed. Here, we show that ecosystem CO2 uptake capacity in temperate and boreal forests scales directly with whole-canopy N concentrations, mirroring a leaf-level trend that has been observed for woody plants worldwide. We further show that both CO2 uptake capacity and canopy N concentration are strongly and positively correlated with shortwave surface albedo. These results suggest that N plays an additional, and overlooked, role in the climate system via its influence on vegetation reflectivity and shortwave surface energy exchange. We also demonstrate that much of the spatial variation in canopy N can be detected by using broad-band satellite sensors, offering a means through which these findings can be applied toward improved application of coupled carbon cycle–climate models

    Physico-chemical foundations underpinning microarray and next-generation sequencing experiments

    Get PDF
    Hybridization of nucleic acids on solid surfaces is a key process involved in high-throughput technologies such as microarrays and, in some cases, next-generation sequencing (NGS). A physical understanding of the hybridization process helps to determine the accuracy of these technologies. The goal of a widespread research program is to develop reliable transformations between the raw signals reported by the technologies and individual molecular concentrations from an ensemble of nucleic acids. This research has inputs from many areas, from bioinformatics and biostatistics, to theoretical and experimental biochemistry and biophysics, to computer simulations. A group of leading researchers met in Ploen Germany in 2011 to discuss present knowledge and limitations of our physico-chemical understanding of high-throughput nucleic acid technologies. This meeting inspired us to write this summary, which provides an overview of the state-of-the-art approaches based on physico-chemical foundation to modeling of the nucleic acids hybridization process on solid surfaces. In addition, practical application of current knowledge is emphasized

    Definition of the σW regulon of Bacillus subtilis in the absence of stress

    Get PDF
    Bacteria employ extracytoplasmic function (ECF) sigma factors for their responses to environmental stresses. Despite intensive research, the molecular dissection of ECF sigma factor regulons has remained a major challenge due to overlaps in the ECF sigma factor-regulated genes and the stimuli that activate the different ECF sigma factors. Here we have employed tiling arrays to single out the ECF σW regulon of the Gram-positive bacterium Bacillus subtilis from the overlapping ECF σX, σY, and σM regulons. For this purpose, we profiled the transcriptome of a B. subtilis sigW mutant under non-stress conditions to select candidate genes that are strictly σW-regulated. Under these conditions, σW exhibits a basal level of activity. Subsequently, we verified the σW-dependency of candidate genes by comparing their transcript profiles to transcriptome data obtained with the parental B. subtilis strain 168 grown under 104 different conditions, including relevant stress conditions, such as salt shock. In addition, we investigated the transcriptomes of rasP or prsW mutant strains that lack the proteases involved in the degradation of the σW anti-sigma factor RsiW and subsequent activation of the σW-regulon. Taken together, our studies identify 89 genes as being strictly σW-regulated, including several genes for non-coding RNAs. The effects of rasP or prsW mutations on the expression of σW-dependent genes were relatively mild, which implies that σW-dependent transcription under non-stress conditions is not strictly related to RasP and PrsW. Lastly, we show that the pleiotropic phenotype of rasP mutant cells, which have defects in competence development, protein secretion and membrane protein production, is not mirrored in the transcript profile of these cells. This implies that RasP is not only important for transcriptional regulation via σW, but that this membrane protease also exerts other important post-transcriptional regulatory functions

    Examining smoking-induced differential gene expression changes in buccal mucosa

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Gene expression changes resulting from conditions such as disease, environmental stimuli, and drug use, can be monitored in the blood. However, a less invasive method of sample collection is of interest because of the discomfort and specialized personnel necessary for blood sampling especially if multiple samples are being collected. Buccal mucosa cells are easily collected and may be an alternative sample material for biomarker testing. A limited number of studies, primarily in the smoker/oral cancer literature, address this tissue's efficacy as an RNA source for expression analysis. The current study was undertaken to determine if total RNA isolated from buccal mucosa could be used as an alternative tissue source to assay relative gene expression.</p> <p>Methods</p> <p>Total RNA was isolated from swabs, reverse transcribed and amplified. The amplified cDNA was used in RT-qPCR and microarray analyses to evaluate gene expression in buccal cells. Initially, RT-qPCR was used to assess relative transcript levels of four genes from whole blood and buccal cells collected from the same seven individuals, concurrently. Second, buccal cell RNA was used for microarray-based differential gene expression studies by comparing gene expression between a group of female smokers and nonsmokers.</p> <p>Results</p> <p>An amplification protocol allowed use of less buccal cell total RNA (50 ng) than had been reported previously with human microarrays. Total RNA isolated from buccal cells was degraded but was of sufficient quality to be used with RT-qPCR to detect expression of specific genes. We report here the finding of a small number of statistically significant differentially expressed genes between smokers and nonsmokers, using buccal cells as starting material. Gene Set Enrichment Analysis confirmed that these genes had a similar expression pattern to results from another study.</p> <p>Conclusions</p> <p>Our results suggest that despite a high degree of degradation, RNA from buccal cells from cheek mucosa could be used to detect differential gene expression between smokers and nonsmokers. However the RNA degradation, increase in sample variability and microarray failure rate show that buccal samples should be used with caution as source material in expression studies.</p

    MicroRNAs targeting oncogenes are down-regulated in pancreatic malignant transformation from benign tumors

    Get PDF
    BACKGROUND MicroRNA (miRNA) expression profiles have been described in pancreatic ductal adenocarcinoma (PDAC), but these have not been compared with pre-malignant pancreatic tumors. We wished to compare the miRNA expression signatures in pancreatic benign cystic tumors (BCT) of low and high malignant potential with PDAC, in order to identify miRNAs deregulated during PDAC development. The mechanistic consequences of miRNA dysregulation were further evaluated. METHODS Tissue samples were obtained at a tertiary pancreatic unit from individuals with BCT and PDAC. MiRNA profiling was performed using a custom microarray and results were validated using RT-qPCR prior to evaluation of miRNA targets. RESULTS Widespread miRNA down-regulation was observed in PDAC compared to low malignant potential BCT. We show that amongst those miRNAs down-regulated, miR-16, miR-126 and let-7d regulate known PDAC oncogenes (targeting BCL2, CRK and KRAS respectively). Notably, miR-126 also directly targets the KRAS transcript at a "seedless" binding site within its 3'UTR. In clinical specimens, miR-126 was strongly down-regulated in PDAC tissues, with an associated elevation in KRAS and CRK proteins. Furthermore, miR-21, a known oncogenic miRNA in pancreatic and other cancers, was not elevated in PDAC compared to serous microcystic adenoma (SMCA), but in both groups it was up-regulated compared to normal pancreas, implicating early up-regulation during malignant change. CONCLUSIONS Expression profiling revealed 21 miRNAs down-regulated in PDAC compared to SMCA, the most benign lesion that rarely progresses to invasive carcinoma. It appears that miR-21 up-regulation is an early event in the transformation from normal pancreatic tissue. MiRNA expression has the potential to distinguish PDAC from normal pancreas and BCT. Mechanistically the down-regulation of miR-16, miR-126 and let-7d promotes PDAC transformation by post-transcriptional up-regulation of crucial PDAC oncogenes. We show that miR-126 is able to directly target KRAS; re-expression has the potential as a therapeutic strategy against PDAC and other KRAS-driven cancers

    Mechanisms controlling anaemia in Trypanosoma congolense infected mice.

    Get PDF
    Trypanosoma congolense are extracellular protozoan parasites of the blood stream of artiodactyls and are one of the main constraints on cattle production in Africa. In cattle, anaemia is the key feature of disease and persists after parasitaemia has declined to low or undetectable levels, but treatment to clear the parasites usually resolves the anaemia. The progress of anaemia after Trypanosoma congolense infection was followed in three mouse strains. Anaemia developed rapidly in all three strains until the peak of the first wave of parasitaemia. This was followed by a second phase, characterized by slower progress to severe anaemia in C57BL/6, by slow recovery in surviving A/J and a rapid recovery in BALB/c. There was no association between parasitaemia and severity of anaemia. Furthermore, functional T lymphocytes are not required for the induction of anaemia, since suppression of T cell activity with Cyclosporin A had neither an effect on the course of infection nor on anaemia. Expression of genes involved in erythropoiesis and iron metabolism was followed in spleen, liver and kidney tissues in the three strains of mice using microarrays. There was no evidence for a response to erythropoietin, consistent with anaemia of chronic disease, which is erythropoietin insensitive. However, the expression of transcription factors and genes involved in erythropoiesis and haemolysis did correlate with the expression of the inflammatory cytokines Il6 and Ifng. The innate immune response appears to be the major contributor to the inflammation associated with anaemia since suppression of T cells with CsA had no observable effect. Several transcription factors regulating haematopoiesis, Tal1, Gata1, Zfpm1 and Klf1 were expressed at consistently lower levels in C57BL/6 mice suggesting that these mice have a lower haematopoietic capacity and therefore less ability to recover from haemolysis induced anaemia after infection

    Establishing a major cause of discrepancy in the calibration of Affymetrix GeneChips

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Affymetrix GeneChips are a popular platform for performing whole-genome experiments on the transcriptome. There are a range of different calibration steps, and users are presented with choices of different background subtractions, normalisations and expression measures. We wished to establish which of the calibration steps resulted in the biggest uncertainty in the sets of genes reported to be differentially expressed.</p> <p>Results</p> <p>Our results indicate that the sets of genes identified as being most significantly differentially expressed, as estimated by the z-score of fold change, is relatively insensitive to the choice of background subtraction and normalisation. However, the contents of the gene list are most sensitive to the choice of expression measure. This is irrespective of whether the experiment uses a rat, mouse or human chip and whether the chip definition is made using probe mappings from Unigene, RefSeq, Entrez Gene or the original Affymetrix definitions. It is also irrespective of whether both Present and Absent, or just Present, Calls from the MAS5 algorithm are used to filter genelists, and this conclusion holds for genes of differing intensities. We also reach the same conclusion after assigning genes to be differentially expressed using t-statistics, although this approach results in a large amount of false positives in the sets of genes identified due to the small numbers of replicates typically used in microarray experiments.</p> <p>Conclusion</p> <p>The major calibration uncertainty that biologists need to consider when analysing Affymetrix data is how their multiple probe values are condensed into one expression measure.</p
    corecore