239 research outputs found

    Empirical Bayes analysis of single nucleotide polymorphisms

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>An important goal of whole-genome studies concerned with single nucleotide polymorphisms (SNPs) is the identification of SNPs associated with a covariate of interest such as the case-control status or the type of cancer. Since these studies often comprise the genotypes of hundreds of thousands of SNPs, methods are required that can cope with the corresponding multiple testing problem. For the analysis of gene expression data, approaches such as the empirical Bayes analysis of microarrays have been developed particularly for the detection of genes associated with the response. However, the empirical Bayes analysis of microarrays has only been suggested for binary responses when considering expression values, i.e. continuous predictors.</p> <p>Results</p> <p>In this paper, we propose a modification of this empirical Bayes analysis that can be used to analyze high-dimensional categorical SNP data. This approach along with a generalized version of the original empirical Bayes method are available in the R package siggenes version 1.10.0 and later that can be downloaded from <url>http://www.bioconductor.org</url>.</p> <p>Conclusion</p> <p>As applications to two subsets of the HapMap data show, the empirical Bayes analysis of microarrays cannot only be used to analyze continuous gene expression data, but also be applied to categorical SNP data, where the response is not restricted to be binary. In association studies in which typically several ten to a few hundred SNPs are considered, our approach can furthermore be employed to test interactions of SNPs. Moreover, the posterior probabilities resulting from the empirical Bayes analysis of (prespecified) interactions/genotypes can also be used to quantify the importance of these interactions.</p

    Application of Volcano Plots in Analyses of mRNA Differential Expressions with Microarrays

    Full text link
    Volcano plot displays unstandardized signal (e.g. log-fold-change) against noise-adjusted/standardized signal (e.g. t-statistic or -log10(p-value) from the t test). We review the basic and an interactive use of the volcano plot, and its crucial role in understanding the regularized t-statistic. The joint filtering gene selection criterion based on regularized statistics has a curved discriminant line in the volcano plot, as compared to the two perpendicular lines for the "double filtering" criterion. This review attempts to provide an unifying framework for discussions on alternative measures of differential expression, improved methods for estimating variance, and visual display of a microarray analysis result. We also discuss the possibility to apply volcano plots to other fields beyond microarray.Comment: 8 figure

    Transcript-Specific Expression Profiles Derived from Sequence-Based Analysis of Standard Microarrays

    Get PDF
    Background: Alternative mRNA processing mechanisms lead to multiple transcripts (i.e. splice isoforms) of a given gene which may have distinct biological functions. Microarrays like Affymetrix GeneChips measure mRNA expression of genes using sets of nucleotide probes. Until recently probe sets were not designed for transcript specificity. Nevertheless, the reanalysis of established microarray data using newly defined transcript-specific probe sets may provide information about expression levels of specific transcripts. Methodology/Principal Findings: In the present study alignment of probe sequences of the Affymetrix microarray HGU133A with Ensembl transcript sequences was performed to define transcript-specific probe sets. Out of a total of 247,965 perfect match probes, 95,008 were designated ‘‘transcript-specific’’, i.e. showing complete sequence alignment, no crosshybridization, and transcript-, not only gene-specificity. These probes were grouped into 7,941 transcript-specific probe sets and 15,619 gene-specific probe sets, respectively. The former were used to differentiate 445 alternative transcripts of 215 genes. For selected transcripts, predicted by this analysis to be differentially expressed in the human kidney, confirmatory real-time RT-PCR experiments were performed. First, the expression of two specific transcripts of the genes PPM1A (PP2CA_HUMAN and P35813) and PLG (PLMN_HUMAN and Q5TEH5) in human kidneys was determined by the transcriptspecific array analysis and confirmed by real-time RT-PCR. Secondly, disease-specific differential expression of single transcripts of PLG and ABCA1 (ABCA1_HUMAN and Q5VYS0_HUMAN) was computed from the available array data sets and confirmed by transcript-specific real-time RT-PCR. Conclusions: Transcript-specific analysis of microarray experiments can be employed to study gene-regulation on the transcript level using conventional microarray data. In this study, predictions based on sufficient probe set size and foldchange are confirmed by independent mean

    Deep EST profiling of developing fenugreek endosperm to investigate galactomannan biosynthesis and its regulation

    Get PDF
    Galactomannans are hemicellulosic polysaccharides composed of a (1 → 4)-linked β-D-mannan backbone substituted with single-unit (1 → 6)-α-linked D-galactosyl residues. Developing fenugreek (Trigonella foenum-graecum) seeds are known to accumulate large quantities of galactomannans in the endosperm, and were thus used here as a model system to better understand galactomannan biosynthesis and its regulation. We first verified the specific deposition of galactomannans in developing endosperms and determined that active accumulation occurred from 25 to 38 days post anthesis (DPA) under our growth conditions. We then examined the expression levels during seed development of ManS and GMGT, two genes encoding backbone and side chain synthetic enzymes. Based on transcript accumulation dynamics for ManS and GMGT, cDNA libraries were constructed using RNA isolated from endosperms at four ages corresponding to before, at the beginning of, and during active galactomannan deposition. DNA from these libraries was sequenced using the 454 sequencing technology to yield a total of 1.5 million expressed sequence tags (ESTs). Through analysis of the EST profiling data, we identified genes known to be involved in galactomannan biosynthesis, as well as new genes that may be involved in this process, and proposed a model for the flow of carbon from sucrose to galactomannans. Measurement of in vitro ManS and GMGT activities and analysis of sugar phosphate and nucleotide sugar levels in the endosperms of developing fenugreek seeds provided data consistent with this model. In vitro enzymatic assays also revealed that the ManS enzyme from fenugreek endosperm preferentially used GDP-mannose as the substrate for the backbone synthesis

    Evidence of gene-environment interaction for two genes on chromosome 4 and environmental tobacco smoke in controlling the risk of nonsyndromic cleft palate

    Get PDF
    Nonsyndromic cleft palate (CP) is one of the most common human birth defects and both genetic and environmental risk factors contribute to its etiology. We conducted a genome-wide association study (GWAS) using 550 CP case-parent trios ascertained in an international consortium. Stratified analysis among trios with different ancestries was performed to test for GxE interactions with common maternal exposures using conditional logistic regression models. While no single nucleotide polymorphism (SNP) achieved genome-wide significance when considered alone, markers in SLC2A9 and the neighboring WDR1 on chromosome 4p16.1 gave suggestive evidence of gene-environment interaction with environmental tobacco smoke (ETS) among 259 Asian trios when the models included a term for GxE interaction. Multiple SNPs in these two genes were associated with increased risk of nonsyndromic CP if the mother was exposed to ETS during the peri-conceptual period (3 months prior to conception through the first trimester). When maternal ETS was considered, fifteen of 135 SNPs mapping to SLC2A9 and 9 of 59 SNPs in WDR1 gave P values approaching genome-wide significance (10-6<P<10-4) in a test for GxETS interaction. SNPs rs3733585 and rs12508991 in SLC2A9 yielded P = 2.26×10-7 in a test for GxETS interaction. SNPs rs6820756 and rs7699512 in WDR1 also yielded P = 1.79×10-7 and P = 1.98×10-7 in a 1 df test for GxE interaction. Although further replication studies are critical to confirming these findings, these results illustrate how genetic associations for nonsyndromic CP can be missed if potential GxE interaction is not taken into account, and this study suggest SLC2A9 and WDR1 should be considered as candidate genes for CP. © 2014 Wu et al

    Absence of a specific radiation signature in post-Chernobyl thyroid cancers

    Get PDF
    Thyroid cancers have been the main medical consequence of the Chernobyl accident. On the basis of their pathological features and of the fact that a large proportion of them demonstrate RET-PTC translocations, these cancers are considered as similar to classical sporadic papillary carcinomas, although molecular alterations differ between both tumours. We analysed gene expression in post-Chernobyl cancers, sporadic papillary carcinomas and compared to autonomous adenomas used as controls. Unsupervised clustering of these data did not distinguish between the cancers, but separates both cancers from adenomas. No gene signature separating sporadic from post-Chernobyl PTC (chPTC) could be found using supervised and unsupervised classification methods although such a signature is demonstrated for cancers and adenomas. Furthermore, we demonstrate that pooled RNA from sporadic and chPTC are as strongly correlated as two independent sporadic PTC pools, one from Europe, one from the US involving patients not exposed to Chernobyl radiations. This result relies on cDNA and Affymetrix microarrays. Thus, platform-specific artifacts are controlled for. Our findings suggest the absence of a radiation fingerprint in the chPTC and support the concept that post-Chernobyl cancer data, for which the cancer-causing event and its date are known, are a unique source of information to study naturally occurring papillary carcinomas
    corecore