78 research outputs found

    Wide-Scale Analysis of Human Functional Transcription Factor Binding Reveals a Strong Bias towards the Transcription Start Site

    Get PDF
    We introduce a novel method to screen the promoters of a set of genes with shared biological function, against a precompiled library of motifs, and find those motifs which are statistically over-represented in the gene set. The gene sets were obtained from the functional Gene Ontology (GO) classification; for each set and motif we optimized the sequence similarity score threshold, independently for every location window (measured with respect to the TSS), taking into account the location dependent nucleotide heterogeneity along the promoters of the target genes. We performed a high throughput analysis, searching the promoters (from 200bp downstream to 1000bp upstream the TSS), of more than 8000 human and 23,000 mouse genes, for 134 functional Gene Ontology classes and for 412 known DNA motifs. When combined with binding site and location conservation between human and mouse, the method identifies with high probability functional binding sites that regulate groups of biologically related genes. We found many location-sensitive functional binding events and showed that they clustered close to the TSS. Our method and findings were put to several experimental tests. By allowing a "flexible" threshold and combining our functional class and location specific search method with conservation between human and mouse, we are able to identify reliably functional TF binding sites. This is an essential step towards constructing regulatory networks and elucidating the design principles that govern transcriptional regulation of expression. The promoter region proximal to the TSS appears to be of central importance for regulation of transcription in human and mouse, just as it is in bacteria and yeast.Comment: 31 pages, including Supplementary Information and figure

    Identification of Candidate Growth Promoting Genes in Ovarian Cancer through Integrated Copy Number and Expression Analysis

    Get PDF
    Ovarian cancer is a disease characterised by complex genomic rearrangements but the majority of the genes that are the target of these alterations remain unidentified. Cataloguing these target genes will provide useful insights into the disease etiology and may provide an opportunity to develop novel diagnostic and therapeutic interventions. High resolution genome wide copy number and matching expression data from 68 primary epithelial ovarian carcinomas of various histotypes was integrated to identify genes in regions of most frequent amplification with the strongest correlation with expression and copy number. Regions on chromosomes 3, 7, 8, and 20 were most frequently increased in copy number (>40% of samples). Within these regions, 703/1370 (51%) unique gene expression probesets were differentially expressed when samples with gain were compared to samples without gain. 30% of these differentially expressed probesets also showed a strong positive correlation (r≥0.6) between expression and copy number. We also identified 21 regions of high amplitude copy number gain, in which 32 known protein coding genes showed a strong positive correlation between expression and copy number. Overall, our data validates previously known ovarian cancer genes, such as ERBB2, and also identified novel potential drivers such as MYNN, PUF60 and TPX2

    High-resolution analysis of copy number alterations and associated expression changes in ovarian tumors

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>DNA copy number alterations are frequently observed in ovarian cancer, but it remains a challenge to identify the most relevant alterations and the specific causal genes in those regions.</p> <p>Methods</p> <p>We obtained high-resolution 500K SNP array data for 52 ovarian tumors and identified the most statistically significant minimal genomic regions with the most prevalent and highest-level copy number alterations (recurrent CNAs). Within a region of recurrent CNA, comparison of expression levels in tumors with a given CNA to tumors lacking that CNA and to whole normal ovary samples was used to select genes with CNA-specific expression patterns. A public expression array data set of laser capture micro-dissected (LCM) non-malignant fallopian tube epithelia and LCM ovarian serous adenocarcinoma was used to evaluate the effect of cell-type mixture biases.</p> <p>Results</p> <p>Fourteen recurrent deletions were detected on chromosomes 4, 6, 9, 12, 13, 15, 16, 17, 18, 22 and most prevalently on X and 8. Copy number and expression data suggest several apoptosis mediators as candidate drivers of the 8p deletions. Sixteen recurrent gains were identified on chromosomes 1, 2, 3, 5, 8, 10, 12, 15, 17, 19, and 20, with the most prevalent gains localized to 8q and 3q. Within the 8q amplicon, <it>PVT1</it>, but not <it>MYC</it>, was strongly over-expressed relative to tumors lacking this CNA and showed over-expression relative to normal ovary. Likewise, the cell polarity regulators <it>PRKCI </it>and <it>ECT2 </it>were identified as putative drivers of two distinct amplicons on 3q. Co-occurrence analyses suggested potential synergistic or antagonistic relationships between recurrent CNAs. Genes within regions of recurrent CNA showed an enrichment of Cancer Census genes, particularly when filtered for CNA-specific expression.</p> <p>Conclusion</p> <p>These analyses provide detailed views of ovarian cancer genomic changes and highlight the benefits of using multiple reference sample types for the evaluation of CNA-specific expression changes.</p

    Identifying significant genetic regulatory networks in the prostate cancer from microarray data based on transcription factor analysis and conditional independency

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Prostate cancer is a world wide leading cancer and it is characterized by its aggressive metastasis. According to the clinical heterogeneity, prostate cancer displays different stages and grades related to the aggressive metastasis disease. Although numerous studies used microarray analysis and traditional clustering method to identify the individual genes during the disease processes, the important gene regulations remain unclear. We present a computational method for inferring genetic regulatory networks from micorarray data automatically with transcription factor analysis and conditional independence testing to explore the potential significant gene regulatory networks that are correlated with cancer, tumor grade and stage in the prostate cancer.</p> <p>Results</p> <p>To deal with missing values in microarray data, we used a K-nearest-neighbors (KNN) algorithm to determine the precise expression values. We applied web services technology to wrap the bioinformatics toolkits and databases to automatically extract the promoter regions of DNA sequences and predicted the transcription factors that regulate the gene expressions. We adopt the microarray datasets consists of 62 primary tumors, 41 normal prostate tissues from Stanford Microarray Database (SMD) as a target dataset to evaluate our method. The predicted results showed that the possible biomarker genes related to cancer and denoted the androgen functions and processes may be in the development of the prostate cancer and promote the cell death in cell cycle. Our predicted results showed that sub-networks of genes SREBF1, STAT6 and PBX1 are strongly related to a high extent while ETS transcription factors ELK1, JUN and EGR2 are related to a low extent. Gene SLC22A3 may explain clinically the differentiation associated with the high grade cancer compared with low grade cancer. Enhancer of Zeste Homolg 2 (EZH2) regulated by RUNX1 and STAT3 is correlated to the pathological stage.</p> <p>Conclusions</p> <p>We provide a computational framework to reconstruct the genetic regulatory network from the microarray data using biological knowledge and constraint-based inferences. Our method is helpful in verifying possible interaction relations in gene regulatory networks and filtering out incorrect relations inferred by imperfect methods. We predicted not only individual gene related to cancer but also discovered significant gene regulation networks. Our method is also validated in several enriched published papers and databases and the significant gene regulatory networks perform critical biological functions and processes including cell adhesion molecules, androgen and estrogen metabolism, smooth muscle contraction, and GO-annotated processes. Those significant gene regulations and the critical concept of tumor progression are useful to understand cancer biology and disease treatment.</p

    Amplicon-Dependent CCNE1 Expression Is Critical for Clonogenic Survival after Cisplatin Treatment and Is Correlated with 20q11 Gain in Ovarian Cancer

    Get PDF
    Genomic amplification of 19q12 occurs in several cancer types including ovarian cancer where it is associated with primary treatment failure. We systematically attenuated expression of genes within the minimally defined 19q12 region in ovarian cell lines using short-interfering RNAs (siRNA) to identify driver oncogene(s) within the amplicon. Knockdown of CCNE1 resulted in G1/S phase arrest, reduced cell viability and apoptosis only in amplification-carrying cells. Although CCNE1 knockdown increased cisplatin resistance in short-term assays, clonogenic survival was inhibited after treatment. Gain of 20q11 was highly correlated with 19q12 amplification and spanned a 2.5 Mb region including TPX2, a centromeric protein required for mitotic spindle function. Expression of TPX2 was highly correlated with gene amplification and with CCNE1 expression in primary tumors. siRNA inhibition of TPX2 reduced cell viability but this effect was not amplicon-dependent. These findings demonstrate that CCNE1 is a key driver in the 19q12 amplicon required for survival and clonogenicity in cells with locus amplification. Co-amplification at 19q12 and 20q11 implies the presence of a cooperative mutational network. These observations have implications for the application of targeted therapies in CCNE1 dependent ovarian cancers

    Landscape of somatic allelic imbalances and copy number alterations in HER2-amplified breast cancer

    Get PDF
    Introduction: Human epidermal growth factor receptor 2 (HER2)-amplified breast cancer represents a clinically well-defined subgroup due to availability of targeted treatment. However, HER2-amplified tumors have been shown to be heterogeneous at the genomic level by genome-wide microarray analyses, pointing towards a need of further investigations for identification of recurrent copy number alterations and delineation of patterns of allelic imbalance. Methods: High-density whole genome array-based comparative genomic hybridization (aCGH) and single nucleotide polymorphism (SNP) array data from 260 HER2-amplified breast tumors or cell lines, and 346 HER2-negative breast cancers with molecular subtype information were assembled from different repositories. Copy number alteration (CNA), loss-of-heterozygosity (LOH), copy number neutral allelic imbalance (CNN-AI), subclonal CNA and patterns of tumor DNA ploidy were analyzed using bioinformatical methods such as genomic identification of significant targets in cancer (GISTIC) and genome alteration print (GAP). The patterns of tumor ploidy were confirmed in 338 unrelated breast cancers analyzed by DNA flow cytometry with concurrent BAC aCGH and gene expression data. Results: A core set of 36 genomic regions commonly affected by copy number gain or loss was identified by integrating results with a previous study, together comprising > 400 HER2-amplified tumors. While CNN-AI frequency appeared evenly distributed over chromosomes in HER2-amplified tumors, not targeting specific regions and often < 20% in frequency, the occurrence of LOH was strongly associated with regions of copy number loss. HER2-amplified and HER2-negative tumors stratified by molecular subtypes displayed different patterns of LOH and CNN-AI, with basal-like tumors showing highest frequencies followed by HER2-amplified and luminal B cases. Tumor aneuploidy was strongly associated with increasing levels of LOH, CNN-AI, CNAs and occurrence of subclonal copy number events, irrespective of subtype. Finally, SNP data from individual tumors indicated that genomic amplification in general appears as monoallelic, that is, it preferentially targets one parental chromosome in HER2-amplified tumors. Conclusions: We have delineated the genomic landscape of CNAs, amplifications, LOH, and CNN-AI in HER2-amplified breast cancer, but also demonstrated a strong association between different types of genomic aberrations and tumor aneuploidy irrespective of molecular subtype

    The Stress Response Factors Yap6, Cin5, Phd1, and Skn7 Direct Targeting of the Conserved Co-Repressor Tup1-Ssn6 in S. cerevisiae

    Get PDF
    Maintaining the proper expression of the transcriptome during development or in response to a changing environment requires a delicate balance between transcriptional regulators with activating and repressing functions. The budding yeast transcriptional co-repressor Tup1-Ssn6 is a model for studying similar repressor complexes in multicellular eukaryotes. Tup1-Ssn6 does not bind DNA directly, but is directed to individual promoters by one or more DNA-binding proteins, referred to as Tup1 recruiters. This functional architecture allows the Tup1-Ssn6 to modulate the expression of genes required for the response to a variety of cellular stresses. To understand the targeting or the Tup1-Ssn6 complex, we determined the genomic distribution of Tup1 and Ssn6 by ChIP-chip. We found that most loci bound by Tup1-Ssn6 could not be explained by co-occupancy with a known recruiting cofactor and that deletion of individual known Tup1 recruiters did not significantly alter the Tup1 binding profile. These observations suggest that new Tup1 recruiting proteins remain to be discovered and that Tup1 recruitment typically depends on multiple recruiting cofactors. To identify new recruiting proteins, we computationally screened for factors with binding patterns similar to the observed Tup1-Ssn6 genomic distribution. Four top candidates, Cin5, Skn7, Phd1, and Yap6, all known to be associated with stress response gene regulation, were experimentally confirmed to physically interact with Tup1 and/or Ssn6. Incorporating these new recruitment cofactors with previously characterized cofactors now explains the majority of Tup1 targeting across the genome, and expands our understanding of the mechanism by which Tup1-Ssn6 is directed to its targets
    • …
    corecore