8 research outputs found

    Assessing Matched Normal and Tumor Pairs in Next-Generation Sequencing Studies

    Get PDF
    Next generation sequencing technology has revolutionized the study of cancers. Through matched normal-tumor pairs, it is now possible to identify genome-wide germline and somatic mutations. The generation and analysis of the data requires rigorous quality checks and filtering, and the current analytical pipeline is constantly undergoing improvements. We noted however that in analyzing matched pairs, there is an implicit assumption that the sequenced data are matched, without any quality check such as those implemented in association studies. There are serious implications in this assumption as identification of germline and rare somatic variants depend on the normal sample being the matched pair. Using a genetics concept on measuring relatedness between individuals, we demonstrate that the matchedness of tumor pairs can be quantified and should be included as part of a quality protocol in analysis of sequenced data. Despite the mutation changes in cancer samples, matched tumor-normal pairs are still relatively similar in sequence compared to non-matched pairs. We demonstrate that the approach can be used to assess the mutation landscape between individuals

    An Integrated Approach to Identifying Cis-Regulatory Modules in the Human Genome

    Get PDF
    In eukaryotic genomes, it is challenging to accurately determine target sites of transcription factors (TFs) by only using sequence information. Previous efforts were made to tackle this task by considering the fact that TF binding sites tend to be more conserved than other functional sites and the binding sites of several TFs are often clustered. Recently, ChIP-chip and ChIP-sequencing experiments have been accumulated to identify TF binding sites as well as survey the chromatin modification patterns at the regulatory elements such as promoters and enhancers. We propose here a hidden Markov model (HMM) to incorporate sequence motif information, TF-DNA interaction data and chromatin modification patterns to precisely identify cis-regulatory modules (CRMs). We conducted ChIP-chip experiments on four TFs, CREB, E2F1, MAX, and YY1 in 1% of the human genome. We then trained a hidden Markov model (HMM) to identify the labels of the CRMs by incorporating the sequence motifs recognized by these TFs and the ChIP-chip ratio. Chromatin modification data was used to predict the functional sites and to further remove false positives. Cross-validation showed that our integrated HMM had a performance superior to other existing methods on predicting CRMs. Incorporating histone signature information successfully penalized false prediction and improved the whole performance. The dataset we used and the software are available at http://nash.ucsd.edu/CIS/

    A computational analysis of the dynamic roles of talin, Dok1, and PIPKI for integrin activation

    Get PDF
    Integrin signaling regulates cell migration and plays a pivotal role in developmental processes and cancer metastasis. Integrin signaling has been studied extensively and much data is available on pathway components and interactions. Yet the data is fragmented and an integrated model is missing. We use a rule-based modeling approach to integrate available data and test biological hypotheses regarding the role of talin, Dok1 and PIPKI in integrin activation. The detailed biochemical characterization of integrin signaling provides us with measured values for most of the kinetics parameters. However, measurements are not fully accurate and the cellular concentrations of signaling proteins are largely unknown and expected to vary substantially across different cellular conditions. By sampling model behaviors over the physiologically realistic parameter range we find that the model exhibits only two different qualitative behaviours and these depend mainly on the relative protein concentrations, which offers a powerful point of control to the cell. Our study highlights the necessity to characterize model behavior not for a single parameter optimum, but to identify parameter sets that characterize different signaling modes

    Antimicrobial Agents from Higher Plants

    No full text
    corecore