71 research outputs found

    Relative effects of mutability and selection on single nucleotide polymorphisms in transcribed regions of the human genome

    Get PDF
    <p>Abstract</p> <p>Motivation</p> <p>Single nucleotide polymorphisms (SNPs) are the most common type of genetic variation in humans. However, the factors that affect SNP density are poorly understood. The goal of this study was to estimate the relative effects of mutability and selection on SNP density in transcribed regions of human genes. It is important for prediction of the regions that harbor functional polymorphisms.</p> <p>Results</p> <p>We used frequency-validated SNPs resulting from single-nucleotide substitutions. SNPs were subdivided into five functional categories: (i) 5' untranslated region (UTR) SNPs, (ii) 3' UTR SNPs, (iii) synonymous SNPs, (iv) SNPs producing conservative missense mutations, and (v) SNPs producing radical missense mutations. Each of these categories was further subdivided into nine mutational categories on the basis of the single-nucleotide substitution type. Thus, 45 functional/mutational categories were analyzed. The relative mutation rate in each mutational category was estimated on the basis of published data. The proportion of segregating sites (PSSs) for each functional/mutational category was estimated by dividing the observed number of SNPs by the number of potential sites in the genome for a given functional/mutational category. By analyzing each functional group separately, we found significant positive correlations between PSSs and relative mutation rates (Spearman's correlation coefficient, at least r = 0.96, df = 9, <it>P </it>< 0.001). We adjusted the PSSs for the mutation rate and found that the functional category had a significant effect on SNP density (F = 5.9, df = 4, <it>P </it>= 0.001), suggesting that selection affects SNP density in transcribed regions of the genome. We used analyses of variance and covariance to estimate the relative effects of selection (functional category) and mutability (relative mutation rate) on the PSSs and found that approximately 87% of variation in PSS was due to variation in the mutation rate and approximately 13% was due to selection, suggesting that the probability that a site located in a transcribed region of a gene is polymorphic mostly depends on the mutability of the site.</p

    Building a Statistical Model for Predicting Cancer Genes

    Get PDF
    More than 400 cancer genes have been identified in the human genome. The list is not yet complete. Statistical models predicting cancer genes may help with identification of novel cancer gene candidates. We used known prostate cancer (PCa) genes (identified through KnowledgeNet) as a training set to build a binary logistic regression model identifying PCa genes. Internal and external validation of the model was conducted using a validation set (also from KnowledgeNet), permutations, and external data on genes with recurrent prostate tumor mutations. We evaluated a set of 33 gene characteristics as predictors. Sixteen of the original 33 predictors were significant in the model. We found that a typical PCa gene is a prostate-specific transcription factor, kinase, or phosphatase with high interindividual variance of the expression level in adjacent normal prostate tissue and differential expression between normal prostate tissue and primary tumor. PCa genes are likely to have an antiapoptotic effect and to play a role in cell proliferation, angiogenesis, and cell adhesion. Their proteins are likely to be ubiquitinated or sumoylated but not acetylated. A number of novel PCa candidates have been proposed. Functional annotations of novel candidates identified antiapoptosis, regulation of cell proliferation, positive regulation of kinase activity, positive regulation of transferase activity, angiogenesis, positive regulation of cell division, and cell adhesion as top functions. We provide the list of the top 200 predicted PCa genes, which can be used as candidates for experimental validation. The model may be modified to predict genes for other cancer sites

    How to Get the Most from Microarray Data: Advice from Reverse Genomics

    Get PDF
    Whole-genome profiling of gene expression is a powerful tool for identifying cancer-associated genes. Genes differentially expressed between normal and tumorous tissues are usually considered to be cancer associated. We recently demonstrated that the analysis of interindividual variation in gene expression can be useful for identifying cancer associated genes. The goal of this study was to identify the best microarray data–derived predictor of known cancer associated genes. We found that the traditional approach of identifying cancer genes—identifying differentially expressed genes—is not very efficient. The analysis of interindividual variation of gene expression in tumor samples identifies cancer-associated genes more effectively. The results were consistent across 4 major types of cancer: breast, colorectal, lung, and prostate. We used recently reported cancer-associated genes (2011–2012) for validation and found that novel cancer-associated genes can be best identified by elevated variance of the gene expression in tumor samples

    Modified Logistic Regression Models Using Gene Coexpression and Clinical Features to Predict Prostate Cancer Progression

    Get PDF
    Predicting disease progression is one of the most challenging problems in prostate cancer research. Adding gene expression data to prediction models that are based on clinical features has been proposed to improve accuracy. In the current study, we applied a logistic regression (LR) model combining clinical features and gene co-expression data to improve the accuracy of the prediction of prostate cancer progression. The top-scoring pair (TSP) method was used to select genes for the model. The proposed models not only preserved the basic properties of the TSP algorithm but also incorporated the clinical features into the prognostic models. Based on the statistical inference with the iterative cross validation, we demonstrated that prediction LR models that included genes selected by the TSP method provided better predictions of prostate cancer progression than those using clinical variables only and/or those that included genes selected by the one-gene-at-a-time approach. Thus, we conclude that TSP selection is a useful tool for feature (and/or gene) selection to use in prognostic models and our model also provides an alternative for predicting prostate cancer progression

    Prediction of the Gene Expression in Normal Lung Tissue by the Gene Expression in Blood

    Get PDF
    Background: Comparative analysis of gene expression in human tissues is important for understanding the molecular mechanisms underlying tissue-specific control of gene expression. It can also open an avenue for using gene expression in blood (which is the most easily accessible human tissue) to predict gene expression in other (less accessible) tissues, which would facilitate the development of novel gene expression based models for assessing disease risk and progression. Until recently, direct comparative analysis across different tissues was not possible due to the scarcity of paired tissue samples from the same individuals. Methods: In this study we used paired whole blood/lung gene expression data from the Genotype-Tissue Expression (GTEx) project. We built a generalized linear regression model for each gene using gene expression in lung as the outcome and gene expression in blood, age and gender as predictors. Results: For ~18 % of the genes, gene expression in blood was a significant predictor of gene expression in lung. We found that the number of single nucleotide polymorphisms (SNPs) influencing expression of a given gene in either blood or lung, also known as the number of quantitative trait loci (eQTLs), was positively associated with efficacy of blood-based prediction of that gene’s expression in lung. This association was strongest for shared eQTLs: those influencing gene expression in both blood and lung. Conclusions: In conclusion, for a considerable number of human genes, their expression levels in lung can be predicted using observable gene expression in blood. An abundance of shared eQTLs may explain the strong blood/lung correlations in the gene expression

    Variants at IRF5-TNPO3, 17q12-21 and MMEL1 are associated with primary biliary cirrhosis

    Get PDF
    We genotyped individuals with primary biliary cirrhosis and unaffected controls for suggestive risk loci (genome-wide association P \u3c 1 × 10−4) identified in a previous genome-wide association study. Combined analysis of the genome-wide association and replication datasets identified IRF5-TNPO3 (combined P = 8.66 × 10−13), 7q12-21 (combined P = 3.50 × 10−13) and MMEL1 (combined P = 3.15 × 10−8) as new primary biliary cirrhosis susceptibility loci. Fine-mapping studies showed that a single variant accounts for the IRF5-TNPO3 association. As these loci are implicated in other autoimmune conditions, these findings confirm genetic overlap among such diseases

    INPP4B suppresses prostate cancer cell invasion

    Get PDF
    Background INPP4B and PTEN dual specificity phosphatases are frequently lost during progression of prostate cancer to metastatic disease. We and others have previously shown that loss of INPP4B expression correlates with poor prognosis in multiple malignancies and with metastatic spread in prostate cancer. Results We demonstrate that de novo expression of INPP4B in highly invasive human prostate carcinoma PC-3 cells suppresses their invasion both in vitro and in vivo. Using global gene expression analysis, we found that INPP4B regulates a number of genes associated with cell adhesion, the extracellular matrix, and the cytoskeleton. Importantly, de novo expressed INPP4B suppressed the proinflammatory chemokine IL-8 and induced PAK6. These genes were regulated in a reciprocal manner following downregulation of INPP4B in the independently derived INPP4B-positive LNCaP prostate cancer cell line. Inhibition of PI3K/Akt pathway, which is highly active in both PC-3 and LNCaP cells, did not reproduce INPP4B mediated suppression of IL-8 mRNA expression in either cell type. In contrast, inhibition of PKC signaling phenocopied INPP4B-mediated inhibitory effect on IL-8 in either prostate cancer cell line. In PC-3 cells, INPP4B overexpression caused a decline in the level of metastases associated BIRC5 protein, phosphorylation of PKC, and expression of the common PKC and IL-8 downstream target, COX-2. Reciprocally, COX-2 expression was increased in LNCaP cells following depletion of endogenous INPP4B. Conclusion Taken together, we discovered that INPP4B is a novel suppressor of oncogenic PKC signaling, further emphasizing the role of INPP4B in maintaining normal physiology of the prostate epithelium and suppressing metastatic potential of prostate tumors

    GWAS Meets Microarray: Are the Results of Genome-Wide Association Studies and Gene-Expression Profiling Consistent? Prostate Cancer as an Example

    Get PDF
    Genome-wide association studies (GWASs) and global profiling of gene expression (microarrays) are two major technological breakthroughs that allow hypothesis-free identification of candidate genes associated with tumorigenesis. It is not obvious whether there is a consistency between the candidate genes identified by GWAS (GWAS genes) and those identified by profiling gene expression (microarray genes).We used the Cancer Genetic Markers Susceptibility database to retrieve single nucleotide polymorphisms from candidate genes for prostate cancer. In addition, we conducted a large meta-analysis of gene expression data in normal prostate and prostate tumor tissue. We identified 13,905 genes that were interrogated by both GWASs and microarrays. On the basis of P values from GWASs, we selected 1,649 most significantly associated genes for functional annotation by the Database for Annotation, Visualization and Integrated Discovery. We also conducted functional annotation analysis using same number of the top genes identified in the meta-analysis of the gene expression data. We found that genes involved in cell adhesion were overrepresented among both the GWAS and microarray genes.We conclude that the results of these analyses suggest that combining GWAS and microarray data would be a more effective approach than analyzing individual datasets and can help to refine the identification of candidate genes and functions associated with tumor development

    Tissue Effects in a Randomized Controlled Trial of Short-term Finasteride in Early Prostate Cancer.

    Get PDF
    BackgroundIn the Prostate Cancer Prevention Trial, finasteride selectively suppressed low-grade prostate cancer and significantly reduced the incidence of prostate cancer in men treated with finasteride compared with placebo. However, an apparent increase in high-grade disease was also observed among men randomized to finasteride. We aimed to determine why and hypothesized that there is a grade-dependent response to finasteride.MethodsFrom 2007 to 2012, we randomized dynamically by intranet-accessible software 183 men with localized prostate cancer to receive 5mg finasteride or placebo daily in a double-blind study during the 4-6weeks preceding prostatectomy. As the primary end point, the expression of a predefined molecular signature (ERβ, UBE2C, SRD5A2, and VEGF) differentiating high- and low-grade tumors in Gleason grade (GG) 3 areas of finasteride-exposed tumors from those in GG3 areas of placebo-exposed tumors, adjusted for Gleason score (GS) at prostatectomy, was compared. We also determined androgen receptor (AR) levels, Ki-67, and cleaved caspase 3 to evaluate the effects of finasteride on the expression of its downstream target, cell proliferation, and apoptosis, respectively. The expression of these markers was also compared across grades between and within treatment groups. Logistic regression was used to assess the expression of markers.FindingsWe found that the predetermined molecular signature did not distinguish GG3 from GG4 areas in the placebo group. However, AR expression was significantly lower in the GG4 areas of the finasteride group than in those of the placebo group. Within the finasteride group, AR expression was also lower in GG4 than in GG3 areas, but not significantly. Expression of cleaved caspase 3 was significantly increased in both GG3 and GG4 areas in the finasteride group compared to the placebo group, although it was lower in GG4 than in GG3 areas in both groups.InterpretationWe showed that finasteride's effect on apoptosis and AR expression is tumor grade dependent after short-term intervention. This may explain finasteride's selective suppression of low-grade tumors observed in the PCPT

    Prioritizing genes associated with prostate cancer development

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The genetic control of prostate cancer development is poorly understood. Large numbers of gene-expression datasets on different aspects of prostate tumorigenesis are available. We used these data to identify and prioritize candidate genes associated with the development of prostate cancer and bone metastases. Our working hypothesis was that combining meta-analyses on different but overlapping steps of prostate tumorigenesis will improve identification of genes associated with prostate cancer development.</p> <p>Methods</p> <p>A <it>Z </it>score-based meta-analysis of gene-expression data was used to identify candidate genes associated with prostate cancer development. To put together different datasets, we conducted a meta-analysis on 3 levels that follow the natural history of prostate cancer development. For experimental verification of candidates, we used in silico validation as well as in-house gene-expression data.</p> <p>Results</p> <p>Genes with experimental evidence of an association with prostate cancer development were overrepresented among our top candidates. The meta-analysis also identified a considerable number of novel candidate genes with no published evidence of a role in prostate cancer development. Functional annotation identified cytoskeleton, cell adhesion, extracellular matrix, and cell motility as the top functions associated with prostate cancer development. We identified 10 genes--<it>CDC2, CCNA2, IGF1, EGR1, SRF, CTGF, CCL2, CAV1, SMAD4</it>, and <it>AURKA</it>--that form hubs of the interaction network and therefore are likely to be primary drivers of prostate cancer development.</p> <p>Conclusions</p> <p>By using this large 3-level meta-analysis of the gene-expression data to identify candidate genes associated with prostate cancer development, we have generated a list of candidate genes that may be a useful resource for researchers studying the molecular mechanisms underlying prostate cancer development.</p
    • …
    corecore