73 research outputs found

    Discovering bipartite substructure in directed networks

    Get PDF
    Bipartivity is an important network concept that can be applied to nodes, edges and communities. Here we focus on directed networks and look for subnetworks made up of two distinct groups of nodes, connected by “one-way” links. We show that a spectral approach can be used to find hidden substructure of this form. Theoretical support is given for the idealised case where there is limited overlap between subnetworks. Numerical experiments show that the approach is robust to spurious and missing edges. A key application of this work is in the analysis of high-throughput gene expression data, and we give an example where a biologically meaningful directed bipartite subnetwork is found from a cancer microarray dataset

    DNA meets the SVD

    Get PDF
    This paper introduces an important area of computational cell biology where complex, publicly available genomic data is being examined by linear algebra methods, with the aim of revealing biological and medical insights

    Multidimensional partitioning and bi-partitioning

    Get PDF
    Eigenvectors and, more generally, singular vectors, have proved to be useful tools for data mining and dimension reduction. Spectral clustering and reordering algorithms have been designed and implemented in many disciplines, and they can be motivated from several dierent standpoints. Here we give a general, unied, derivation from an applied linear algebra perspective. We use a variational approach that has the benet of (a) naturally introducing an appropriate scaling, (b) allowing for a solution in any desired dimension, and (c) dealing with both the clustering and bi-clustering issues in the same framework. The motivation and analysis is then backed up with examples involving two large data sets from modern, high-throughput, experimental cell biology. Here, the objects of interest are genes and tissue samples, and the experimental data represents gene activity. We show that looking beyond the dominant, or Fiedler, direction reveals important information

    Discretization Provides a Conceptually Simple Tool to Build Expression Networks

    Get PDF
    Biomarker identification, using network methods, depends on finding regular co-expression patterns; the overall connectivity is of greater importance than any single relationship. A second requirement is a simple algorithm for ranking patients on how relevant a gene-set is. For both of these requirements discretized data helps to first identify gene cliques, and then to stratify patients

    Erratum: “Searches for Gravitational Waves from Known Pulsars at Two Harmonics in 2015–2017 LIGO Data” (2019, ApJ, 879, 10)

    Get PDF
    Due to an error at the publisher, in the published article the number of pulsars presented in the paper is incorrect in multiple places throughout the text. Specifically, "222" pulsars should be "221." Additionally, the number of pulsars for which we have EM observations that fully overlap with O1 and O2 changes from "168" to "167." Elsewhere, in the machine-readable table of Table 1 and in Table 2, the row corresponding to pulsar J0952-0607 should be excised as well. Finally, in the caption for Table 2 the number of pulsars changes from "188" to "187.

    Spectral algorithms for heterogeneous biological networks

    Get PDF
    pectral methods, which use information relating to eigenvectors, singular vectors and generalized singular vectors, help us to visualize and summarize sets of pairwise interactions. In this work, we motivate and discuss the use of spectral methods by taking a matrix computation view and applying concepts from applied linear algebra. We show that this unified approach is sufficiently flexible to allow multiple sources of network information to be combined. We illustrate the methods on microarray data arising from a large population-based study in human adipose tissue, combined with related information concerning metabolic pathways
    corecore