138 research outputs found

    How can I explore high-dimensional data?

    Full text link

    Expression profiling to predict outcome in breast cancer: the influence of sample selection

    Get PDF
    Gene expression profiling of tumors using DNA microarrays is a promising method for predicting prognosis and treatment response in cancer patients. It was recently reported that expression profiles of sporadic breast cancers could be used to predict disease recurrence better than currently available clinical and histopathological prognostic factors. Having observed an overlap in those data between the genes that predict outcome and those that predict estrogen receptor-α status, we examined their predictive power in an independent data set. We conclude that it may be important to define prognostic expression profiles separately for estrogen receptor-α-positive and estrogen receptor-α-negative tumors

    Environmentally induced DNA methylation is inherited across generations in an aquatic keystone species

    Get PDF
    Transgenerational inheritance of environmentally induced epigenetic marks can have significant impacts on eco-evolutionary dynamics, but the phenomenon remains controversial in ecological model systems. We used whole-genome bisulfite sequencing of individual water fleas (Daphnia magna) to assess whether environmentally induced DNA methylation is transgenerationally inherited. Genetically identical females were exposed to one of three natural stressors, or a de-methylating drug, and their offspring were propagated clonally for four generations under control conditions. We identified between 70 and 225 differentially methylated CpG positions (DMPs) in F1 individuals whose mothers were exposed to a natural stressor. Roughly half of these environmentally induced DMPs persisted until generation F4. In contrast, treatment with the drug demonstrated that pervasive hypomethylation upon exposure is reset almost completely after one generation. These results suggest that environmentally induced DNA methylation is non-random and stably inherited across generations in Daphnia, making epigenetic inheritance a putative factor in the eco-evolutionary dynamics of freshwater communities

    Role of conservative mutations in protein multi-property adaptation

    Get PDF
    Protein physicochemical properties must undergo complex changes during evolution, as a response to modifications in the organism environment, the result of the proteins taking up new roles or because of the need to cope with the evolution of molecular interacting partners. Recent work has emphasized the role of stability and stability–function trade-offs in these protein adaptation processes. In the present study, on the other hand, we report that combinations of a few conservative, high-frequency-of-fixation mutations in the thioredoxin molecule lead to largely independent changes in both stability and the diversity of catalytic mechanisms, as revealed by single-molecule atomic force spectroscopy. Furthermore, the changes found are evolutionarily significant, as they combine typically hyperthermophilic stability enhancements with modulations in function that span the ranges defined by the quite different catalytic patterns of thioredoxins from bacterial and eukaryotic origin. These results suggest that evolutionary protein adaptation may use, in some cases at least, the potential of conservative mutations to originate a multiplicity of evolutionarily allowed mutational paths leading to a variety of protein modulation patterns. In addition the results support the feasibility of using evolutionary information to achieve protein multi-feature optimization, an important biotechnological goal

    CD44 isoforms are heterogeneously expressed in breast cancer and correlate with tumor subtypes and cancer stem cell markers

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The CD44 cell adhesion molecule is aberrantly expressed in many breast tumors and has been implicated in the metastatic process as well as in the putative cancer stem cell (CSC) compartment. We aimed to investigate potential associations between alternatively spliced isoforms of CD44 and CSCs as well as to various breast cancer biomarkers and molecular subtypes.</p> <p>Methods</p> <p>We used q-RT-PCR and exon-exon spanning assays to analyze the expression of four alternatively spliced CD44 isoforms as well as the total expression of CD44 in 187 breast tumors and 13 cell lines. ALDH1 protein expression was determined by IHC on TMA.</p> <p>Results</p> <p>Breast cancer cell lines showed a heterogeneous expression pattern of the CD44 isoforms, which shifted considerably when cells were grown as mammospheres. Tumors characterized as positive for the CD44<sup>+</sup>/CD24<it><sup>- </sup></it>phenotype by immunohistochemistry were associated to all isoforms except the CD44 standard (CD44S) isoform, which lacks all variant exons. Conversely, tumors with strong expression of the CSC marker ALDH1 had elevated expression of CD44S. A high expression of the CD44v2-v10 isoform, which retain all variant exons, was correlated to positive steroid receptor status, low proliferation and luminal A subtype. The CD44v3-v10 isoform showed similar correlations, while high expression of CD44v8-v10 was correlated to positive EGFR, negative/low HER2 status and basal-like subtype. High expression of CD44S was associated with strong HER2 staining and also a subgroup of basal-like tumors. Unsupervised hierarchical cluster analysis of CD44 isoform expression data divided tumors into four main clusters, which showed significant correlations to molecular subtypes and differences in 10-year overall survival.</p> <p>Conclusions</p> <p>We demonstrate that individual CD44 isoforms can be associated to different breast cancer subtypes and clinical markers such as HER2, ER and PgR, which suggests involvement of CD44 splice variants in specific oncogenic signaling pathways. Efforts to link CD44 to CSCs and tumor progression should consider the expression of various CD44 isoforms.</p

    Molecular subtypes of breast cancer are associated with characteristic DNA methylation patterns

    Get PDF
    Introduction: Five different molecular subtypes of breast cancer have been identified through gene expression profiling. Each subtype has a characteristic expression pattern suggested to partly depend on cellular origin. We aimed to investigate whether the molecular subtypes also display distinct methylation profiles. Methods: We analysed methylation status of 807 cancer-related genes in 189 fresh frozen primary breast tumours and four normal breast tissue samples using an array-based methylation assay. Results: Unsupervised analysis revealed three groups of breast cancer with characteristic methylation patterns. The three groups were associated with the luminal A, luminal B and basal-like molecular subtypes of breast cancer, respectively, whereas cancers of the HER2-enriched and normal-like subtypes were distributed among the three groups. The methylation frequencies were significantly different between subtypes, with luminal B and basal-like tumours being most and least frequently methylated, respectively. Moreover, targets of the polycomb repressor complex in breast cancer and embryonic stem cells were more methylated in luminal B tumours than in other tumours. BRCA2-mutated tumours had a particularly high degree of methylation. Finally, by utilizing gene expression data, we observed that a large fraction of genes reported as having subtype-specific expression patterns might be regulated through methylation. Conclusions: We have found that breast cancers of the basal-like, luminal A and luminal B molecular subtypes harbour specific methylation profiles. Our results suggest that methylation may play an important role in the development of breast cancers

    Knowledge-based matrix factorization temporally resolves the cellular responses to IL-6 stimulation

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>External stimulations of cells by hormones, cytokines or growth factors activate signal transduction pathways that subsequently induce a re-arrangement of cellular gene expression. The analysis of such changes is complicated, as they consist of multi-layered temporal responses. While classical analyses based on clustering or gene set enrichment only partly reveal this information, matrix factorization techniques are well suited for a detailed temporal analysis. In signal processing, factorization techniques incorporating data properties like spatial and temporal correlation structure have shown to be robust and computationally efficient. However, such correlation-based methods have so far not be applied in bioinformatics, because large scale biological data rarely imply a natural order that allows the definition of a delayed correlation function.</p> <p>Results</p> <p>We therefore develop the concept of graph-decorrelation. We encode prior knowledge like transcriptional regulation, protein interactions or metabolic pathways in a weighted directed graph. By linking features along this underlying graph, we introduce a partial ordering of the features (e.g. genes) and are thus able to define a graph-delayed correlation function. Using this framework as constraint to the matrix factorization task allows us to set up the fast and robust graph-decorrelation algorithm (GraDe). To analyze alterations in the gene response in <it>IL-6 </it>stimulated primary mouse hepatocytes, we performed a time-course microarray experiment and applied GraDe. In contrast to standard techniques, the extracted time-resolved gene expression profiles showed that <it>IL-6 </it>activates genes involved in cell cycle progression and cell division. Genes linked to metabolic and apoptotic processes are down-regulated indicating that <it>IL-6 </it>mediated priming renders hepatocytes more responsive towards cell proliferation and reduces expenditures for the energy metabolism.</p> <p>Conclusions</p> <p>GraDe provides a novel framework for the decomposition of large-scale 'omics' data. We were able to show that including prior knowledge into the separation task leads to a much more structured and detailed separation of the time-dependent responses upon <it>IL-6 </it>stimulation compared to standard methods. A Matlab implementation of the GraDe algorithm is freely available at <url>http://cmb.helmholtz-muenchen.de/grade</url>.</p

    Landscape of somatic allelic imbalances and copy number alterations in HER2-amplified breast cancer

    Get PDF
    Introduction: Human epidermal growth factor receptor 2 (HER2)-amplified breast cancer represents a clinically well-defined subgroup due to availability of targeted treatment. However, HER2-amplified tumors have been shown to be heterogeneous at the genomic level by genome-wide microarray analyses, pointing towards a need of further investigations for identification of recurrent copy number alterations and delineation of patterns of allelic imbalance. Methods: High-density whole genome array-based comparative genomic hybridization (aCGH) and single nucleotide polymorphism (SNP) array data from 260 HER2-amplified breast tumors or cell lines, and 346 HER2-negative breast cancers with molecular subtype information were assembled from different repositories. Copy number alteration (CNA), loss-of-heterozygosity (LOH), copy number neutral allelic imbalance (CNN-AI), subclonal CNA and patterns of tumor DNA ploidy were analyzed using bioinformatical methods such as genomic identification of significant targets in cancer (GISTIC) and genome alteration print (GAP). The patterns of tumor ploidy were confirmed in 338 unrelated breast cancers analyzed by DNA flow cytometry with concurrent BAC aCGH and gene expression data. Results: A core set of 36 genomic regions commonly affected by copy number gain or loss was identified by integrating results with a previous study, together comprising > 400 HER2-amplified tumors. While CNN-AI frequency appeared evenly distributed over chromosomes in HER2-amplified tumors, not targeting specific regions and often < 20% in frequency, the occurrence of LOH was strongly associated with regions of copy number loss. HER2-amplified and HER2-negative tumors stratified by molecular subtypes displayed different patterns of LOH and CNN-AI, with basal-like tumors showing highest frequencies followed by HER2-amplified and luminal B cases. Tumor aneuploidy was strongly associated with increasing levels of LOH, CNN-AI, CNAs and occurrence of subclonal copy number events, irrespective of subtype. Finally, SNP data from individual tumors indicated that genomic amplification in general appears as monoallelic, that is, it preferentially targets one parental chromosome in HER2-amplified tumors. Conclusions: We have delineated the genomic landscape of CNAs, amplifications, LOH, and CNN-AI in HER2-amplified breast cancer, but also demonstrated a strong association between different types of genomic aberrations and tumor aneuploidy irrespective of molecular subtype
    • 

    corecore