93 research outputs found
An evaluation of DNA-damage response and cell-cycle pathways for breast cancer classification
Accurate subtyping or classification of breast cancer is important for
ensuring proper treatment of patients and also for understanding the molecular
mechanisms driving this disease. While there have been several gene signatures
proposed in the literature to classify breast tumours, these signatures show
very low overlaps, different classification performance, and not much relevance
to the underlying biology of these tumours. Here we evaluate DNA-damage
response (DDR) and cell cycle pathways, which are critical pathways implicated
in a considerable proportion of breast tumours, for their usefulness and
ability in breast tumour subtyping. We think that subtyping breast tumours
based on these two pathways could lead to vital insights into molecular
mechanisms driving these tumours. Here, we performed a systematic evaluation of
DDR and cell-cycle pathways for subtyping of breast tumours into the five known
intrinsic subtypes. Homologous Recombination (HR) pathway showed the best
performance in subtyping breast tumours, indicating that HR genes are strongly
involved in all breast tumours. Comparisons of pathway based signatures and two
standard gene signatures supported the use of known pathways for breast tumour
subtyping. Further, the evaluation of these standard gene signatures showed
that breast tumour subtyping, prognosis and survival estimation are all closely
related. Finally, we constructed an all-inclusive super-signature by combining
(union of) all genes and performing a stringent feature selection, and found it
to be reasonably accurate and robust in classification as well as prognostic
value. Adopting DDR and cell cycle pathways for breast tumour subtyping
achieved robust and accurate breast tumour subtyping, and constructing a
super-signature which contains feature selected mix of genes from these
molecular pathways as well as clinical aspects is valuable in clinical
practice.Comment: 28 pages, 7 figures, 6 table
Inferring synthetic lethal interactions from mutual exclusivity of genetic events in cancer
Background: Synthetic lethality (SL) refers to the genetic interaction
between two or more genes where only their co-alteration (e.g. by mutations,
amplifications or deletions) results in cell death. In recent years, SL has
emerged as an attractive therapeutic strategy against cancer: by targeting the
SL partners of altered genes in cancer cells, these cells can be selectively
killed while sparing the normal cells. Consequently, a number of studies have
attempted prediction of SL interactions in human, a majority by extrapolating
SL interactions inferred through large-scale screens in model organisms.
However, these predicted SL interactions either do not hold in human cells or
do not include genes that are (frequently) altered in human cancers, and are
therefore not attractive in the context of cancer therapy.
Results: Here, we develop a computational approach to infer SL interactions
directly from frequently altered genes in human cancers. It is based on the
observation that pairs of genes that are altered in a (significantly) mutually
exclusive manner in cancers are likely to constitute lethal combinations. Using
genomic copy-number and gene-expression data from four cancers, breast,
prostate, ovarian and uterine (total 3980 samples) from The Cancer Genome
Atlas, we identify 718 genes that are frequently amplified or upregulated, and
are likely to be synthetic lethal with six key DNA-damage response (DDR) genes
in these cancers. By comparing with published data on gene essentiality (~16000
genes) from ten DDR-deficient cancer cell lines, we show that our identified
genes are enriched among the top quartile of essential genes in these cell
lines, implying that our inferred genes are highly likely to be (synthetic)
lethal upon knockdown in these cell lines.Comment: 35 pages, 7 figure
Examination of the relationship between essential genes in PPI network and hub proteins in reverse nearest neighbor topology
Abstract Background In many protein-protein interaction (PPI) networks, densely connected hub proteins are more likely to be essential proteins. This is referred to as the "centrality-lethality rule", which indicates that the topological placement of a protein in PPI network is connected with its biological essentiality. Though such connections are observed in many PPI networks, the underlying topological properties for these connections are not yet clearly understood. Some suggested putative connections are the involvement of essential proteins in the maintenance of overall network connections, or that they play a role in essential protein clusters. In this work, we have attempted to examine the placement of essential proteins and the network topology from a different perspective by determining the correlation of protein essentiality and reverse nearest neighbor topology (RNN). Results The RNN topology is a weighted directed graph derived from PPI network, and it is a natural representation of the topological dependences between proteins within the PPI network. Similar to the original PPI network, we have observed that essential proteins tend to be hub proteins in RNN topology. Additionally, essential genes are enriched in clusters containing many hub proteins in RNN topology (RNN protein clusters). Based on these two properties of essential genes in RNN topology, we have proposed a new measure; the RNN cluster centrality. Results from a variety of PPI networks demonstrate that RNN cluster centrality outperforms other centrality measures with regard to the proportion of selected proteins that are essential proteins. We also investigated the biological importance of RNN clusters. Conclusions This study reveals that RNN cluster centrality provides the best correlation of protein essentiality and placement of proteins in PPI network. Additionally, merged RNN clusters were found to be topologically important in that essential proteins are significantly enriched in RNN clusters, and biologically important because they play an important role in many Gene Ontology (GO) processes.http://deepblue.lib.umich.edu/bitstream/2027.42/78257/1/1471-2105-11-505.xmlhttp://deepblue.lib.umich.edu/bitstream/2027.42/78257/2/1471-2105-11-505-S1.DOChttp://deepblue.lib.umich.edu/bitstream/2027.42/78257/3/1471-2105-11-505.pdfPeer Reviewe
Complex-based analysis of dysregulated cellular processes in cancer
Background: Differential expression analysis of (individual) genes is often
used to study their roles in diseases. However, diseases such as cancer are a
result of the combined effect of multiple genes. Gene products such as proteins
seldom act in isolation, but instead constitute stable multi-protein complexes
performing dedicated functions. Therefore, complexes aggregate the effect of
individual genes (proteins) and can be used to gain a better understanding of
cancer mechanisms. Here, we observe that complexes show considerable changes in
their expression, in turn directed by the concerted action of transcription
factors (TFs), across cancer conditions. We seek to gain novel insights into
cancer mechanisms through a systematic analysis of complexes and their
transcriptional regulation.
Results: We integrated large-scale protein-interaction (PPI) and
gene-expression datasets to identify complexes that exhibit significant changes
in their expression across different conditions in cancer. We devised a
log-linear model to relate these changes to the differential regulation of
complexes by TFs. The application of our model on two case studies involving
pancreatic and familial breast tumour conditions revealed: (i) complexes in
core cellular processes, especially those responsible for maintaining genome
stability and cell proliferation (e.g. DNA damage repair and cell cycle) show
considerable changes in expression; (ii) these changes include decrease and
countering increase for different sets of complexes indicative of compensatory
mechanisms coming into play in tumours; and (iii) TFs work in cooperative and
counteractive ways to regulate these mechanisms. Such aberrant complexes and
their regulating TFs play vital roles in the initiation and progression of
cancer.Comment: 22 pages, BMC Systems Biolog
MCL-CAw: A refinement of MCL for detecting yeast complexes from weighted PPI networks by incorporating core-attachment structure
Abstract Background The reconstruction of protein complexes from the physical interactome of organisms serves as a building block towards understanding the higher level organization of the cell. Over the past few years, several independent high-throughput experiments have helped to catalogue enormous amount of physical protein interaction data from organisms such as yeast. However, these individual datasets show lack of correlation with each other and also contain substantial number of false positives (noise). Over these years, several affinity scoring schemes have also been devised to improve the qualities of these datasets. Therefore, the challenge now is to detect meaningful as well as novel complexes from protein interaction (PPI) networks derived by combining datasets from multiple sources and by making use of these affinity scoring schemes. In the attempt towards tackling this challenge, the Markov Clustering algorithm (MCL) has proved to be a popular and reasonably successful method, mainly due to its scalability, robustness, and ability to work on scored (weighted) networks. However, MCL produces many noisy clusters, which either do not match known complexes or have additional proteins that reduce the accuracies of correctly predicted complexes. Results Inspired by recent experimental observations by Gavin and colleagues on the modularity structure in yeast complexes and the distinctive properties of "core" and "attachment" proteins, we develop a core-attachment based refinement method coupled to MCL for reconstruction of yeast complexes from scored (weighted) PPI networks. We combine physical interactions from two recent "pull-down" experiments to generate an unscored PPI network. We then score this network using available affinity scoring schemes to generate multiple scored PPI networks. The evaluation of our method (called MCL-CAw) on these networks shows that: (i) MCL-CAw derives larger number of yeast complexes and with better accuracies than MCL, particularly in the presence of natural noise; (ii) Affinity scoring can effectively reduce the impact of noise on MCL-CAw and thereby improve the quality (precision and recall) of its predicted complexes; (iii) MCL-CAw responds well to most available scoring schemes. We discuss several instances where MCL-CAw was successful in deriving meaningful complexes, and where it missed a few proteins or whole complexes due to affinity scoring of the networks. We compare MCL-CAw with several recent complex detection algorithms on unscored and scored networks, and assess the relative performance of the algorithms on these networks. Further, we study the impact of augmenting physical datasets with computationally inferred interactions for complex detection. Finally, we analyse the essentiality of proteins within predicted complexes to understand a possible correlation between protein essentiality and their ability to form complexes. Conclusions We demonstrate that core-attachment based refinement in MCL-CAw improves the predictions of MCL on yeast PPI networks. We show that affinity scoring improves the performance of MCL-CAw.http://deepblue.lib.umich.edu/bitstream/2027.42/78256/1/1471-2105-11-504.xmlhttp://deepblue.lib.umich.edu/bitstream/2027.42/78256/2/1471-2105-11-504-S1.PDFhttp://deepblue.lib.umich.edu/bitstream/2027.42/78256/3/1471-2105-11-504-S2.ZIPhttp://deepblue.lib.umich.edu/bitstream/2027.42/78256/4/1471-2105-11-504.pdfPeer Reviewe
- …
