8 research outputs found
Visualizing Global Properties of Large Complex Networks
For complex biological networks, graphical representations are highly desired for understanding some design principles, but few drawing methods are available that capture topological features of a large and highly heterogeneous network, such as a protein interaction network. Here we propose the circular perspective drawing (CPD) method to visualize global structures of large complex networks. The presented CPD combines the quasi-continuous search (QCS) analogous to the steepest descent method with a random node swapping strategy for an enhanced calculation speed. The CPD depicts a network in an aesthetic manner by showing connection patterns between different parts of the network instead of detailed links between nodes. Global structural features of networks exhibited by CPD provide clues toward a comprehensive understanding of the network organizations. Availability: Software is freely available at http://www.cadlive.j
Hybridization interactions between probesets in short oligo microarrays lead to spurious correlations
BACKGROUND: Microarrays measure the binding of nucleotide sequences to a set of sequence specific probes. This information is combined with annotation specifying the relationship between probes and targets and used to make inferences about transcript- and, ultimately, gene expression. In some situations, a probe is capable of hybridizing to more than one transcript, in others, multiple probes can target a single sequence. These 'multiply targeted' probes can result in non-independence between measured expression levels. RESULTS: An analysis of these relationships for Affymetrix arrays considered both the extent and influence of exact matches between probe and transcript sequences. For the popular HGU133A array, approximately half of the probesets were found to interact in this way. Both real and simulated expression datasets were used to examine how these effects influenced the expression signal. It was found not only to lead to increased signal strength for the affected probesets, but the major effect is to significantly increase their correlation, even in situations when only a single probe from a probeset was involved. By building a network of probe-probeset-transcript relationships, it is possible to identify families of interacting probesets. More than 10% of the families contain members annotated to different genes or even different Unigene clusters. Within a family, a mixture of genuine biological and artefactual correlations can occur. CONCLUSION: Multiple targeting is not only prevalent, but also significant. The ability of probesets to hybridize to more than one gene product can lead to false positives when analysing gene expression. Comprehensive annotation describing multiple targeting is required when interpreting array data
Using Sequence Similarity Networks for Visualization of Relationships Across Diverse Protein Superfamilies
The dramatic increase in heterogeneous types of biological data—in particular, the abundance of new protein sequences—requires fast and user-friendly methods for organizing this information in a way that enables functional inference. The most widely used strategy to link sequence or structure to function, homology-based function prediction, relies on the fundamental assumption that sequence or structural similarity implies functional similarity. New tools that extend this approach are still urgently needed to associate sequence data with biological information in ways that accommodate the real complexity of the problem, while being accessible to experimental as well as computational biologists. To address this, we have examined the application of sequence similarity networks for visualizing functional trends across protein superfamilies from the context of sequence similarity. Using three large groups of homologous proteins of varying types of structural and functional diversity—GPCRs and kinases from humans, and the crotonase superfamily of enzymes—we show that overlaying networks with orthogonal information is a powerful approach for observing functional themes and revealing outliers. In comparison to other primary methods, networks provide both a good representation of group-wise sequence similarity relationships and a strong visual and quantitative correlation with phylogenetic trees, while enabling analysis and visualization of much larger sets of sequences than trees or multiple sequence alignments can easily accommodate. We also define important limitations and caveats in the application of these networks. As a broadly accessible and effective tool for the exploration of protein superfamilies, sequence similarity networks show great potential for generating testable hypotheses about protein structure-function relationships
Towards a Processual Microbial Ontology
types: ArticleStandard microbial evolutionary ontology is organized according to a
nested hierarchy of entities at various levels of biological organization. It typically
detects and defines these entities in relation to the most stable aspects of evolutionary
processes, by identifying lineages evolving by a process of vertical inheritance
from an ancestral entity. However, recent advances in microbiology indicate
that such an ontology has important limitations. The various dynamics detected
within microbiological systems reveal that a focus on the most stable entities (or
features of entities) over time inevitably underestimates the extent and nature of
microbial diversity. These dynamics are not the outcome of the process of vertical
descent alone. Other processes, often involving causal interactions between entities
from distinct levels of biological organisation, or operating at different time scales,
are responsible not only for the destabilisation of pre-existing entities, but also for
the emergence and stabilisation of novel entities in the microbial world. In this
article we consider microbial entities as more or less stabilised functional wholes,
and sketch a network-based ontology that can represent a diverse set of processes
including, for example, as well as phylogenetic relations, interactions that stabilise
or destabilise the interacting entities, spatial relations, ecological connections, and
genetic exchanges. We use this pluralistic framework for evaluating (i) the existing
ontological assumptions in evolution (e.g. whether currently recognized entities are
adequate for understanding the causes of change and stabilisation in the microbial
world), and (ii) for identifying hidden ontological kinds, essentially invisible from
within a more limited perspective. We propose to recognize additional classes of
entities that provide new insights into the structure of the microbial world, namely ‘‘processually equivalent’’ entities, ‘‘processually versatile’’ entities, and ‘‘stabilized’’
entities.Economic and Social Research Council, U
Protein-protein interaction networks and biology - What's the connection?
Analysis of protein-protein interaction networks is an increasingly popular means to infer biological insight, but is close enough attention being paid to data handling protocols and the degree of bias in the data