74 research outputs found
Quantized correlation coefficient for measuring reproducibility of ChIP-chip data
<p>Abstract</p> <p>Background</p> <p>Chromatin immunoprecipitation followed by microarray hybridization (ChIP-chip) is used to study protein-DNA interactions and histone modifications on a genome-scale. To ensure data quality, these experiments are usually performed in replicates, and a correlation coefficient between replicates is used often to assess reproducibility. However, the correlation coefficient can be misleading because it is affected not only by the reproducibility of the signal but also by the amount of binding signal present in the data.</p> <p>Results</p> <p>We develop the Quantized correlation coefficient (QCC) that is much less dependent on the amount of signal. This involves discretization of data into set of quantiles (quantization), a merging procedure to group the background probes, and recalculation of the Pearson correlation coefficient. This procedure reduces the influence of the background noise on the statistic, which then properly focuses more on the reproducibility of the signal. The performance of this procedure is tested in both simulated and real ChIP-chip data. For replicates with different levels of enrichment over background and coverage, we find that QCC reflects reproducibility more accurately and is more robust than the standard Pearson or Spearman correlation coefficients. The quantization and the merging procedure can also suggest a proper quantile threshold for separating signal from background for further analysis.</p> <p>Conclusions</p> <p>To measure reproducibility of ChIP-chip data correctly, a correlation coefficient that is robust to the amount of signal present should be used. QCC is one such measure. The QCC statistic can also be applied in a variety of other contexts for measuring reproducibility, including analysis of array CGH data for DNA copy number and gene expression data.</p
Somatic Mutational Landscape of Splicing Factor Genes and Their Functional Consequences across 33 Cancer Types
Hotspot mutations in splicing factor genes have been recently reported at high frequency in hematological malignancies, suggesting the importance of RNA splicing in cancer. We analyzed whole-exome sequencing data across 33 tumor types in The Cancer Genome Atlas (TCGA), and we identified 119 splicing factor genes with significant non-silent mutation patterns, including mutation over-representation, recurrent loss of function (tumor suppressor-like), or hotspot mutation profile (oncogene-like). Furthermore, RNA sequencing analysis revealed altered splicing events associated with selected splicing factor mutations. In addition, we were able to identify common gene pathway profiles associated with the presence of these mutations. Our analysis suggests that somatic alteration of genes involved in the RNA-splicing process is common in cancer and may represent an underappreciated hallmark of tumorigenesis
Recommended from our members
Identification of Chromatin-Associated Regulators of MSL Complex Targeting in Drosophila Dosage Compensation
Sex chromosome dosage compensation in Drosophila provides a model for understanding how chromatin organization can modulate coordinate gene regulation. Male Drosophila increase the transcript levels of genes on the single male X approximately two-fold to equal the gene expression in females, which have two X-chromosomes. Dosage compensation is mediated by the Male-Specific Lethal (MSL) histone acetyltransferase complex. Five core components of the MSL complex were identified by genetic screens for genes that are specifically required for male viability and are dispensable for females. However, because dosage compensation must interface with the general transcriptional machinery, it is likely that identifying additional regulators that are not strictly male-specific will be key to understanding the process at a mechanistic level. Such regulators would not have been recovered from previous male-specific lethal screening strategies. Therefore, we have performed a cell culture-based, genome-wide RNAi screen to search for factors required for MSL targeting or function. Here we focus on the discovery of proteins that function to promote MSL complex recruitment to “chromatin entry sites,” which are proposed to be the initial sites of MSL targeting. We find that components of the NSL (Non-specific lethal) complex, and a previously unstudied zinc-finger protein, facilitate MSL targeting and display a striking enrichment at MSL entry sites. Identification of these factors provides new insight into how MSL complex establishes the specialized hyperactive chromatin required for dosage compensation in Drosophila
Analyses of clinicopathological, molecular, and prognostic associations of KRAS codon 61 and codon 146 mutations in colorectal cancer: cohort study and literature review
Background: KRAS mutations in codons 12 and 13 are established predictive biomarkers for anti-EGFR therapy in colorectal cancer. Previous studies suggest that KRAS codon 61 and 146 mutations may also predict resistance to anti-EGFR therapy in colorectal cancer. However, clinicopathological, molecular, and prognostic features of colorectal carcinoma with KRAS codon 61 or 146 mutation remain unclear. Methods: We utilized a molecular pathological epidemiology database of 1267 colon and rectal cancers in the Nurse’s Health Study and the Health Professionals Follow-up Study. We examined KRAS mutations in codons 12, 13, 61 and 146 (assessed by pyrosequencing), in relation to clinicopathological features, and tumor molecular markers, including BRAF and PIK3CA mutations, CpG island methylator phenotype (CIMP), LINE-1 methylation, and microsatellite instability (MSI). Survival analyses were performed in 1067 BRAF-wild-type cancers to avoid confounding by BRAF mutation. Cox proportional hazards models were used to compute mortality hazard ratio, adjusting for potential confounders, including disease stage, PIK3CA mutation, CIMP, LINE-1 hypomethylation, and MSI. Results: KRAS codon 61 mutations were detected in 19 cases (1.5%), and codon 146 mutations in 40 cases (3.2%). Overall KRAS mutation prevalence in colorectal cancers was 40% (=505/1267). Of interest, compared to KRAS-wild-type, overall, KRAS-mutated cancers more frequently exhibited cecal location (24% vs. 12% in KRAS-wild-type; P < 0.0001), CIMP-low (49% vs. 32% in KRAS-wild-type; P < 0.0001), and PIK3CA mutations (24% vs. 11% in KRAS-wild-type; P < 0.0001). These trends were evident irrespective of mutated codon, though statistical power was limited for codon 61 mutants. Neither KRAS codon 61 nor codon 146 mutation was significantly associated with clinical outcome or prognosis in univariate or multivariate analysis [colorectal cancer-specific mortality hazard ratio (HR) = 0.81, 95% confidence interval (CI) = 0.29-2.26 for codon 61 mutation; colorectal cancer-specific mortality HR = 0.86, 95% CI = 0.42-1.78 for codon 146 mutation]. Conclusions: Tumors with KRAS mutations in codons 61 and 146 account for an appreciable proportion (approximately 5%) of colorectal cancers, and their clinicopathological and molecular features appear generally similar to KRAS codon 12 or 13 mutated cancers. To further assess clinical utility of KRAS codon 61 and 146 testing, large-scale trials are warranted
Ambaixada a Brussel·les.
Hotspot mutations in splicing factor genes have been recently reported at high frequency in hematological malignancies, suggesting the importance of RNA splicing in cancer. We analyzed whole-exome sequencing data across 33 tumor types in The Cancer Genome Atlas (TCGA), and we identified 119 splicing factor genes with significant non-silent mutation patterns, including mutation over-representation, recurrent loss of function (tumor suppressor-like), or hotspot mutation profile (oncogene-like). Furthermore, RNA sequencing analysis revealed altered splicing events associated with selected splicing factor mutations. In addition, we were able to identify common gene pathway profiles associated with the presence of these mutations. Our analysis suggests that somatic alteration of genes involved in the RNA-splicing process is common in cancer and may represent an underappreciated hallmark of tumorigenesis
Mutant N-RAS Protects Colorectal Cancer Cells from Stress-Induced Apoptosis and Contributes to Cancer Development and Progression
N-Ras is one member of a family of oncoproteins that are commonly mutated in cancer. Activating mutations in N-Ras occur in a subset of colorectal cancers, but little in known about how the mutant protein contributes to onset and progression of the disease. Using genetically engineered mice, we find that mutant N-Ras strongly promotes tumorigenesis in the context of inflammation. The pro-tumorigenic nature of mutant N-Ras is related to its anti-apoptotic function, which is mediated by activation of a non-canonical MAPK pathway that signals through Stat3. As a result, inhibition of MEK selectively induces apoptosis in autochthonous colonic tumors expressing mutant N-Ras. The translational significance of this finding is highlighted by our observation that NRAS mutation correlates with a less favorable clinical outcome for colorectal cancer patients. These data demonstrate for the first time the important role that N-Ras plays in colorectal cancer.
Recommended from our members
Exome and whole genome sequencing of esophageal adenocarcinoma identifies recurrent driver events and mutational complexity
The incidence of esophageal adenocarcinoma (EAC) has risen 600% over the last 30 years. With a five-year survival rate of 15%, identification of new therapeutic targets for EAC is greatly important. We analyze the mutation spectra from whole exome sequencing of 149 EAC tumors/normal pairs, 15 of which have also been subjected to whole genome sequencing. We identify a mutational signature defined by a high prevalence of A to C transversions at AA dinucleotides. Statistical analysis of exome data identified significantly mutated 26 genes. Of these genes, four (TP53, CDKN2A, SMAD4, and PIK3CA) have been previously implicated in EAC. The novel significantly mutated genes include chromatin modifying factors and candidate contributors: SPG20, TLR4, ELMO1, and DOCK2. Functional analyses of EAC-derived mutations in ELMO1 reveal increased cellular invasion. Therefore, we suggest a new hypothesis about the potential activation of the RAC1 pathway to be a contributor to EAC tumorigenesis
Normalization and experimental design for ChIP-chip data
<p>Abstract</p> <p>Background</p> <p>Chromatin immunoprecipitation on tiling arrays (ChIP-chip) has been widely used to investigate the DNA binding sites for a variety of proteins on a genome-wide scale. However, several issues in the processing and analysis of ChIP-chip data have not been resolved fully, including the effect of background (mock control) subtraction and normalization within and across arrays.</p> <p>Results</p> <p>The binding profiles of <it>Drosophila </it>male-specific lethal (MSL) complex on a tiling array provide a unique opportunity for investigating these topics, as it is known to bind on the X chromosome but not on the autosomes. These large bound and control regions on the same array allow clear evaluation of analytical methods.</p> <p>We introduce a novel normalization scheme specifically designed for ChIP-chip data from dual-channel arrays and demonstrate that this step is critical for correcting systematic dye-bias that may exist in the data. Subtraction of the mock (non-specific antibody or no antibody) control data is generally needed to eliminate the bias, but appropriate normalization obviates the need for mock experiments and increases the correlation among replicates. The idea underlying the normalization can be used subsequently to estimate the background noise level in each array for normalization across arrays. We demonstrate the effectiveness of the methods with the MSL complex binding data and other publicly available data.</p> <p>Conclusion</p> <p>Proper normalization is essential for ChIP-chip experiments. The proposed normalization technique can correct systematic errors and compensate for the lack of mock control data, thus reducing the experimental cost and producing more accurate results.</p
- …