4 research outputs found

    Predicting direct protein interactions from affinity purification mass spectrometry data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Affinity purification followed by mass spectrometry identification (AP-MS) is an increasingly popular approach to observe protein-protein interactions (PPI) <it>in vivo</it>. One drawback of AP-MS, however, is that it is prone to detecting indirect interactions mixed with direct physical interactions. Therefore, the ability to distinguish direct interactions from indirect ones is of much interest.</p> <p>Results</p> <p>We first propose a simple probabilistic model for the interactions captured by AP-MS experiments, under which the problem of separating direct interactions from indirect ones is formulated. Then, given idealized quantitative AP-MS data, we study the problem of identifying the most likely set of direct interactions that produced the observed data. We address this challenging graph theoretical problem by first characterizing signatures that can identify weakly connected nodes as well as dense regions of the network. The rest of the direct PPI network is then inferred using a genetic algorithm.</p> <p>Our algorithm shows good performance on both simulated and biological networks with very high sensitivity and specificity. Then the algorithm is used to predict direct interactions from a set of AP-MS PPI data from yeast, and its performance is measured against a high-quality interaction dataset.</p> <p>Conclusions</p> <p>As the sensitivity of AP-MS pipeline improves, the fraction of indirect interactions detected will also increase, thereby making the ability to distinguish them even more desirable. Despite the simplicity of our model for indirect interactions, our method provides a good performance on the test networks.</p

    An upper bound for the chromatic number of line graphs

    Get PDF
    It was conjectured by Reed [reed98conjecture] that for any graph GG, the graph's chromatic number χ(G)χ (G) is bounded above by Δ(G)+1+ω(G)/2\lceil Δ (G) +1 + ω (G) / 2\rceil , where Δ(G)Δ (G) and ω(G)ω (G) are the maximum degree and clique number of GG, respectively. In this paper we prove that this bound holds if GG is the line graph of a multigraph. The proof yields a polynomial time algorithm that takes a line graph GG and produces a colouring that achieves our bound

    Simultaneous Clustering of Multiple Gene Expression and Physical Interaction Datasets

    Get PDF
    Many genome-wide datasets are routinely generated to study different aspects of biological systems, but integrating them to obtain a coherent view of the underlying biology remains a challenge. We propose simultaneous clustering of multiple networks as a framework to integrate large-scale datasets on the interactions among and activities of cellular components. Specifically, we develop an algorithm JointCluster that finds sets of genes that cluster well in multiple networks of interest, such as coexpression networks summarizing correlations among the expression profiles of genes and physical networks describing protein-protein and protein-DNA interactions among genes or gene-products. Our algorithm provides an efficient solution to a well-defined problem of jointly clustering networks, using techniques that permit certain theoretical guarantees on the quality of the detected clustering relative to the optimal clustering. These guarantees coupled with an effective scaling heuristic and the flexibility to handle multiple heterogeneous networks make our method JointCluster an advance over earlier approaches. Simulation results showed JointCluster to be more robust than alternate methods in recovering clusters implanted in networks with high false positive rates. In systematic evaluation of JointCluster and some earlier approaches for combined analysis of the yeast physical network and two gene expression datasets under glucose and ethanol growth conditions, JointCluster discovers clusters that are more consistently enriched for various reference classes capturing different aspects of yeast biology or yield better coverage of the analysed genes. These robust clusters, which are supported across multiple genomic datasets and diverse reference classes, agree with known biology of yeast under these growth conditions, elucidate the genetic control of coordinated transcription, and enable functional predictions for a number of uncharacterized genes

    An upper bound for the chromatic number of line graphs

    Get PDF
    It was conjectured by Reed [reed98conjecture] that for any graph GG, the graph's chromatic number χ(G)χ (G) is bounded above by Δ(G)+1+ω(G)/2\lceil Δ (G) +1 + ω (G) / 2\rceil , where Δ(G)Δ (G) and ω(G)ω (G) are the maximum degree and clique number of GG, respectively. In this paper we prove that this bound holds if GG is the line graph of a multigraph. The proof yields a polynomial time algorithm that takes a line graph GG and produces a colouring that achieves our bound
    corecore