73 research outputs found
Discovering bipartite substructure in directed networks
Bipartivity is an important network concept that can be applied to nodes, edges and communities. Here we focus on directed networks and look for subnetworks made up of two distinct groups of nodes, connected by “one-way” links. We show that a spectral approach can be used to find hidden substructure of this form. Theoretical support is given for the idealised case where there is limited overlap between subnetworks. Numerical experiments show that the approach is robust to spurious and missing edges. A key application of this work is in the analysis of high-throughput gene expression data, and we give an example where a biologically meaningful directed bipartite subnetwork is found from a cancer microarray dataset
DNA meets the SVD
This paper introduces an important area of computational cell biology where complex, publicly available genomic data is being examined by linear algebra methods, with the aim of revealing biological and medical insights
Multidimensional partitioning and bi-partitioning
Eigenvectors and, more generally, singular vectors, have proved to be useful tools for data mining and dimension reduction. Spectral clustering and reordering algorithms have been designed and implemented in many disciplines, and they can be motivated from several dierent standpoints. Here we give a general, unied, derivation from an applied linear algebra perspective. We use a variational approach that has the benet of (a) naturally introducing an appropriate scaling, (b) allowing for a solution in any desired dimension, and (c) dealing with both the clustering and bi-clustering issues in the same framework. The motivation and analysis is then backed up with examples involving two large data sets from modern, high-throughput, experimental cell biology. Here, the objects of interest are genes and tissue samples, and the experimental data represents gene activity. We show that looking beyond the dominant, or Fiedler, direction reveals important information
Discretization Provides a Conceptually Simple Tool to Build Expression Networks
Biomarker identification, using network methods, depends on finding regular co-expression patterns; the overall connectivity is of greater importance than any single relationship. A second requirement is a simple algorithm for ranking patients on how relevant a gene-set is. For both of these requirements discretized data helps to first identify gene cliques, and then to stratify patients
Erratum: “Searches for Gravitational Waves from Known Pulsars at Two Harmonics in 2015–2017 LIGO Data” (2019, ApJ, 879, 10)
Due to an error at the publisher, in the published article the number of pulsars presented in the paper is incorrect in multiple places throughout the text. Specifically, "222" pulsars should be "221." Additionally, the number of pulsars for which we have EM observations that fully overlap with O1 and O2 changes from "168" to "167." Elsewhere, in the machine-readable table of Table 1 and in Table 2, the row corresponding to pulsar J0952-0607 should be excised as well. Finally, in the caption for Table 2 the number of pulsars changes from "188" to "187.
Spectral algorithms for heterogeneous biological networks
pectral methods, which use information relating to eigenvectors, singular vectors and generalized singular vectors, help us to visualize and summarize sets of pairwise interactions. In this work, we motivate and discuss the use of spectral methods by taking a matrix computation view and applying concepts from applied linear algebra. We show that this unified approach is sufficiently flexible to allow multiple sources of network information to be combined. We illustrate the methods on microarray data arising from a large population-based study in human adipose tissue, combined with related information concerning metabolic pathways
- …