13 research outputs found
GAIA: a gram-based interaction analysis tool – an approach for identifying interacting domains in yeast
<p>Abstract</p> <p>Background</p> <p>Protein-Protein Interactions (PPIs) play important roles in many biological functions. Protein domains, which are defined as independently folding structural blocks of proteins, physically interact with each other to perform these biological functions. Therefore, the identification of Domain-Domain Interactions (DDIs) is of great biological interests because it is generally accepted that PPIs are mediated by DDIs. As a result, much effort has been put on the prediction of domain pair interactions based on computational methods. Many DDI prediction tools using PPIs network and domain evolution information have been reported. However, tools that combine the primary sequences, domain annotations, and structural annotations of proteins have not been evaluated before.</p> <p>Results</p> <p>In this study, we report a novel approach called Gram-bAsed Interaction Analysis (GAIA). GAIA extracts peptide segments that are composed of fixed length of continuous amino acids, called n-grams (where n is the number of amino acids), from the annotated domain and DDI data set in <it>Saccharomyces cerevisiae </it>(budding yeast) and identifies a list of n-grams that may contribute to DDIs and PPIs based on the frequencies of their appearance. GAIA also reports the coordinate position of gram pairs on each interacting domain pair. We demonstrate that our approach improves on other DDI prediction approaches when tested against a gold-standard data set and achieves a true positive rate of 82% and a false positive rate of 21%. We also identify a list of 4-gram pairs that are significantly over-represented in the DDI data set and may mediate PPIs.</p> <p>Conclusion</p> <p>GAIA represents a novel and reliable way to predict DDIs that mediate PPIs. Our results, which show the localizations of interacting grams/hotspots, provide testable hypotheses for experimental validation. Complemented with other prediction methods, this study will allow us to elucidate the interactome of cells.</p
Genomic basis for RNA alterations in cancer.
Transcript alterations often result from somatic changes in cancer genomes1. Various forms of RNA alterations have been described in cancer, including overexpression2, altered splicing3 and gene fusions4; however, it is difficult to attribute these to underlying genomic changes owing to heterogeneity among patients and tumour types, and the relatively small cohorts of patients for whom samples have been analysed by both transcriptome and whole-genome sequencing. Here we present, to our knowledge, the most comprehensive catalogue of cancer-associated gene alterations to date, obtained by characterizing tumour transcriptomes from 1,188 donors of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA)5. Using matched whole-genome sequencing data, we associated several categories of RNA alterations with germline and somatic DNA alterations, and identified probable genetic mechanisms. Somatic copy-number alterations were the major drivers of variations in total gene and allele-specific expression. We identified 649 associations of somatic single-nucleotide variants with gene expression in cis, of which 68.4% involved associations with flanking non-coding regions of the gene. We found 1,900 splicing alterations associated with somatic mutations, including the formation of exons within introns in proximity to Alu elements. In addition, 82% of gene fusions were associated with structural variants, including 75 of a new class, termed 'bridged' fusions, in which a third genomic location bridges two genes. We observed transcriptomic alteration signatures that differ between cancer types and have associations with variations in DNA mutational signatures. This compendium of RNA alterations in the genomic context provides a rich resource for identifying genes and mechanisms that are functionally implicated in cancer
There are three columns – the name of the parameter, its current value and a check box to indicate if this parameter is enabled
<p><b>Copyright information:</b></p><p>Taken from "Pegasys: software for executing and integrating analyses of biological sequences"</p><p>BMC Bioinformatics 2004;5():40-40.</p><p>Published online 19 Apr 2004</p><p>PMCID:PMC406494.</p><p>Copyright © 2004 Shah et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.</p> Disabled parameters will be excluded from the DAG XML, and consequently from the actual command that is executed on the server. All default values are set in the ProgramList.xml file that the server reads on startup
The International Cancer Genome Consortium's evolving data-protection policies
The International Cancer Genome Consortium (ICGC) is an international initiative launched in 2008 to coordinate large-scale cancer genome studies from 50 different cancer types and subtypes of global clinical and societal importance. As of May 2014, 74 project teams in 17 countries in Asia, Australia, Europe and North America are analyzing more than 25,000 tumor genomes (http://www.icgc.org/). In addition to its goal of generating comprehensive catalogs of genomic abnormalities to facilitate better understanding, treatment and prevention of cancer, ICGC is committed to making the data it generates available to the international research community as rapidly as possible, with minimal restrictions on appropriate data use. Here, we describe the experience of ICGC in addressing privacy and confidentiality issues raised by recent articles that expose re-identification threats for research participants