62 research outputs found

    Modeling genetic heterogeneity of drug response and resistance in cancer

    Get PDF
    Heterogeneity in tumors is recognized as a key contributor to drug resistance and spread of advanced disease, but deep characterization of genetic variation within tumors has only recently been quantifiable with the advancement of next generation sequencing and single cell technologies. These data have been essential in developing molecular models of how tumors develop, evolve, and respond to environmental changes, such as therapeutic intervention. A deeper understanding of tumor evolution has subsequently opened up new research efforts to develop mathematical models that account for evolutionary dynamics with the goal of predicting drug response and resistance in cancer. This study describes recent advances and limitations of how models of tumor evolution can impact treatment strategies for cancer patients.</p

    Extent, impact, and mitigation of batch effects in tumor biomarker studies using tissue microarrays

    Get PDF
    Tissue microarrays (TMAs) have been used in thousands of cancer biomarker studies. To what extent batch effects, measurement error in biomarker levels between slides, affects TMA-based studies has not been assessed systematically. We evaluated 20 protein biomarkers on 14 TMAs with prospectively collected tumor tissue from 1,448 primary prostate cancers. In half of the biomarkers, more than 10% of biomarker variance was attributable to between-TMA differences (range, 1–48%). We implemented different methods to mitigate batch effects (R package batchtma), tested in plasmode simulation. Biomarker levels were more similar between mitigation approaches compared to uncorrected values. For some biomarkers, associations with clinical features changed substantially after addressing batch effects. Batch effects and resulting bias are not an error of an individual study but an inherent feature of TMA-based protein biomarker studies. They always need to be considered during study design and addressed analytically in studies using more than one TMA

    The role of mutation rate variation and genetic diversity in the architecture of human disease

    Get PDF
    Background We have investigated the role that the mutation rate and the structure of genetic variation at a locus play in determining whether a gene is involved in disease. We predict that the mutation rate and its genetic diversity should be higher in genes associated with disease, unless all genes that could cause disease have already been identified. Results Consistent with our predictions we find that genes associated with Mendelian and complex disease are substantially longer than non-disease genes. However, we find that both Mendelian and complex disease genes are found in regions of the genome with relatively low mutation rates, as inferred from intron divergence between humans and chimpanzees, and they are predicted to have similar rates of non-synonymous mutation as other genes. Finally, we find that disease genes are in regions of significantly elevated genetic diversity, even when variation in the rate of mutation is controlled for. The effect is small nevertheless. Conclusions Our results suggest that gene length contributes to whether a gene is associated with disease. However, the mutation rate and the genetic architecture of the locus appear to play only a minor role in determining whether a gene is associated with disease

    A codon substitution model that incorporates the effect of the GC contents, the gene density and the density of CpG islands of human chromosomes

    Get PDF
    Abstract Background Developing a model for codon substitutions is essential for the analyses of protein sequences. Recent studies on the mutation rates in the non-coding regions have shown that CpG mutation rates in the human genome are negatively correlated to the local GC content and to the densities of functional elements. This study aimed at understanding the effect of genomic features, namely, GC content, gene density, and frequency of CpG islands, on the rates of codon substitution in human chromosomes. Results Codon substitution rates of CpG to TpG mutations, TpG to CpG mutations, and non-CpG transitions and transversions in humans were estimated by comparing the coding regions of thousands of human and chimpanzee genes and inferring their ancestral sequences by using macaque genes as the outgroup. Since the genomic features are depending on each other, partial regression coefficients of these features were obtained. Conclusion The substitution rates of codons depend on gene densities of the chromosomes. Transcription-associated mutation is one such pressure. On the basis of these results, a model of codon substitutions that incorporates the effect of genomic features on codon substitution in human chromosomes was developed.</p

    Establishing the baseline level of repetitive element expression in the human cortex

    Get PDF
    Background: Although nearly half of the human genome is comprised of repetitive sequences, the expression profile of these elements remains largely uncharacterized. Recently developed high throughput sequencing technologies provide us with a powerful new set of tools to study repeat elements. Hence, we performed whole transcriptome sequencing to investigate the expression of repetitive elements in human frontal cortex using postmortem tissue obtained from the Stanley Medical Research Institute. Results: We found a significant amount of reads from the human frontal cortex originate from repeat elements. We also noticed that Alu elements were expressed at levels higher than expected by random or background transcription. In contrast, L1 elements were expressed at lower than expected amounts. Conclusions: Repetitive elements are expressed abundantly in the human brain. This expression pattern appears to be element specific and can not be explained by random or background transcription. These results demonstrate that our knowledge about repetitive elements is far from complete. Further characterization is required to determine the mechanism, the control, and the effects of repeat element expression

    Inhibition of de novo lipogenesis targets androgen receptor signaling in castration-resistant prostate cancer

    Get PDF
    A hallmark of prostate cancer progression is dysregulation of lipid metabolism via overexpression of fatty acid synthase (FASN), a key enzyme in de novo fatty acid synthesis. Metastatic castration-resistant prostate cancer (mCRPC) develops resistance to inhibitors of androgen receptor (AR) signaling through a variety of mechanisms, including the emergence of the constitutively active AR variant V7 (AR-V7). Here, we developed an FASN inhibitor (IPI-9119) and demonstrated that selective FASN inhibition antagonizes CRPC growth through metabolic reprogramming and results in reduced protein expression and transcriptional activity of both full-length AR (AR-FL) and AR-V7. Activation of the reticulum endoplasmic stress response resulting in reduced protein synthesis was involved in IPI-9119-mediated inhibition of the AR pathway. In vivo, IPI-9119 reduced growth of AR-V7-driven CRPC xenografts and human mCRPC-derived organoids and enhanced the efficacy of enzalutamide in CRPC cells. In human mCRPC, both FASN and AR-FL were detected in 87% of metastases. AR-V7 was found in 39% of bone metastases and consistently coexpressed with FASN. In patients treated with enzalutamide and/or abiraterone FASN/AR-V7 double-positive metastases were found in 77% of cases. These findings provide a compelling rationale for the use of FASN inhibitors in mCRPCs, including those overexpressing AR-V7.Giorgia Zadra, Caroline F. Ribeiro, Paolo Chetta, Yeung Ho, Stefano Cacciatore ... Lisa M. Butler ... et al

    Relationship between amino acid composition and gene expression in the mouse genome

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Codon bias is a phenomenon that refers to the differences in the frequencies of synonymous codons among different genes. In many organisms, natural selection is considered to be a cause of codon bias because codon usage in highly expressed genes is biased toward optimal codons. Methods have previously been developed to predict the expression level of genes from their nucleotide sequences, which is based on the observation that synonymous codon usage shows an overall bias toward a few codons called major codons. However, the relationship between codon bias and gene expression level, as proposed by the translation-selection model, is less evident in mammals.</p> <p>Findings</p> <p>We investigated the correlations between the expression levels of 1,182 mouse genes and amino acid composition, as well as between gene expression and codon preference. We found that a weak but significant correlation exists between gene expression levels and amino acid composition in mouse. In total, less than 10% of variation of expression levels is explained by amino acid components. We found the effect of codon preference on gene expression was weaker than the effect of amino acid composition, because no significant correlations were observed with respect to codon preference.</p> <p>Conclusion</p> <p>These results suggest that it is difficult to predict expression level from amino acid components or from codon bias in mouse.</p

    Late Replicating Domains Are Highly Recombining in Females but Have Low Male Recombination Rates: Implications for Isochore Evolution

    Get PDF
    In mammals sequences that are either late replicating or highly recombining have high rates of evolution at putatively neutral sites. As early replicating domains and highly recombining domains both tend to be GC rich we a priori expect these two variables to covary. If so, the relative contribution of either of these variables to the local neutral substitution rate might have been wrongly estimated owing to covariance with the other. Against our expectations, we find that sex-averaged recombination rates show little or no correlation with replication timing, suggesting that they are independent determinants of substitution rates. However, this result masks significant sex-specific complexity: late replicating domains tend to have high recombination rates in females but low recombination rates in males. That these trends are antagonistic explains why sex-averaged recombination is not correlated with replication timing. This unexpected result has several important implications. First, although both male and female recombination rates covary significantly with intronic substitution rates, the magnitude of this correlation is moderately underestimated for male recombination and slightly overestimated for female recombination, owing to covariance with replicating timing. Second, the result could explain why male recombination is strongly correlated with GC content but female recombination is not. If to explain the correlation between GC content and replication timing we suppose that late replication forces reduced GC content, then GC promotion by biased gene conversion during female recombination is partly countered by the antagonistic effect of later replicating sequence tending increase AT content. Indeed, the strength of the correlation between female recombination rate and local GC content is more than doubled by control for replication timing. Our results underpin the need to consider sex-specific recombination rates and potential covariates in analysis of GC content and rates of evolution

    Large scale variation in the rate of germ-line de novo mutation, base composition, divergence and diversity in humans

    Get PDF
    It has long been suspected that the rate of mutation varies across the human genome at a large scale based on the divergence between humans and other species. However, it is now possible to directly investigate this question using the large number of de novo mutations (DNMs) that have been discovered in humans through the sequencing of trios. We investi- gate a number of questions pertaining to the distribution of mutations using more than 130,000 DNMs from three large datasets. We demonstrate that the amount and pattern of variation differs between datasets at the 1MB and 100KB scales probably as a consequence of differences in sequencing technology and processing. In particular, datasets show differ- ent patterns of correlation to genomic variables such as replication time. Never-the-less there are many commonalities between datasets, which likely represent true patterns. We show that there is variation in the mutation rate at the 100KB, 1MB and 10MB scale that can- not be explained by variation at smaller scales, however the level of this variation is modest at large scales–at the 1MB scale we infer that ~90% of regions have a mutation rate within 50% of the mean. Different types of mutation show similar levels of variation and appear to vary in concert which suggests the pattern of mutation is relatively constant across the genome. We demonstrate that variation in the mutation rate does not generate large-scale variation in GC-content, and hence that mutation bias does not maintain the isochore struc- ture of the human genome. We find that genomic features explain less than 40% of the explainable variance in the rate of DNM. As expected the rate of divergence between spe- cies is correlated to the rate of DNM. However, the correlations are weaker than expected if all the variation in divergence was due to variation in the mutation rate. We provide evidence that this is due the effect of biased gene conversion on the probability that a mutation will become fixed. In contrast to divergence, we find that most of the variation in diversity can be explained by variation in the mutation rate. Finally, we show that the correlation between divergence and DNM density declines as increasingly divergent species are considered

    Erratum: Corrigendum: Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution

    Get PDF
    International Chicken Genome Sequencing Consortium. The Original Article was published on 09 December 2004. Nature432, 695–716 (2004). In Table 5 of this Article, the last four values listed in the ‘Copy number’ column were incorrect. These should be: LTR elements, 30,000; DNA transposons, 20,000; simple repeats, 140,000; and satellites, 4,000. These errors do not affect any of the conclusions in our paper. Additional information. The online version of the original article can be found at 10.1038/nature0315
    corecore