80 research outputs found

    Markov clustering versus affinity propagation for the partitioning of protein interaction graphs

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Genome scale data on protein interactions are generally represented as large networks, or graphs, where hundreds or thousands of proteins are linked to one another. Since proteins tend to function in groups, or complexes, an important goal has been to reliably identify protein complexes from these graphs. This task is commonly executed using clustering procedures, which aim at detecting densely connected regions within the interaction graphs. There exists a wealth of clustering algorithms, some of which have been applied to this problem. One of the most successful clustering procedures in this context has been the Markov Cluster algorithm (MCL), which was recently shown to outperform a number of other procedures, some of which were specifically designed for partitioning protein interactions graphs. A novel promising clustering procedure termed Affinity Propagation (AP) was recently shown to be particularly effective, and much faster than other methods for a variety of problems, but has not yet been applied to partition protein interaction graphs.</p> <p>Results</p> <p>In this work we compare the performance of the Affinity Propagation (AP) and Markov Clustering (MCL) procedures. To this end we derive an unweighted network of protein-protein interactions from a set of 408 protein complexes from <it>S. cervisiae </it>hand curated in-house, and evaluate the performance of the two clustering algorithms in recalling the annotated complexes. In doing so the parameter space of each algorithm is sampled in order to select optimal values for these parameters, and the robustness of the algorithms is assessed by quantifying the level of complex recall as interactions are randomly added or removed to the network to simulate noise. To evaluate the performance on a weighted protein interaction graph, we also apply the two algorithms to the consolidated protein interaction network of <it>S. cerevisiae</it>, derived from genome scale purification experiments and to versions of this network in which varying proportions of the links have been randomly shuffled.</p> <p>Conclusion</p> <p>Our analysis shows that the MCL procedure is significantly more tolerant to noise and behaves more robustly than the AP algorithm. The advantage of MCL over AP is dramatic for unweighted protein interaction graphs, as AP displays severe convergence problems on the majority of the unweighted graph versions that we tested, whereas MCL continues to identify meaningful clusters, albeit fewer of them, as the level of noise in the graph increases. MCL thus remains the method of choice for identifying protein complexes from binary interaction networks.</p

    Misty Mountain clustering: application to fast unsupervised flow cytometry gating

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>There are many important clustering questions in computational biology for which no satisfactory method exists. Automated clustering algorithms, when applied to large, multidimensional datasets, such as flow cytometry data, prove unsatisfactory in terms of speed, problems with local minima or cluster shape bias. Model-based approaches are restricted by the assumptions of the fitting functions. Furthermore, model based clustering requires serial clustering for all cluster numbers within a user defined interval. The final cluster number is then selected by various criteria. These supervised serial clustering methods are time consuming and frequently different criteria result in different optimal cluster numbers. Various unsupervised heuristic approaches that have been developed such as affinity propagation are too expensive to be applied to datasets on the order of 10<sup>6 </sup>points that are often generated by high throughput experiments.</p> <p>Results</p> <p>To circumvent these limitations, we developed a new, unsupervised density contour clustering algorithm, called Misty Mountain, that is based on percolation theory and that efficiently analyzes large data sets. The approach can be envisioned as a progressive top-down removal of clouds covering a data histogram relief map to identify clusters by the appearance of statistically distinct peaks and ridges. This is a parallel clustering method that finds every cluster after analyzing only once the cross sections of the histogram. The overall run time for the composite steps of the algorithm increases linearly by the number of data points. The clustering of 10<sup>6 </sup>data points in 2D data space takes place within about 15 seconds on a standard laptop PC. Comparison of the performance of this algorithm with other state of the art automated flow cytometry gating methods indicate that Misty Mountain provides substantial improvements in both run time and in the accuracy of cluster assignment.</p> <p>Conclusions</p> <p>Misty Mountain is fast, unbiased for cluster shape, identifies stable clusters and is robust to noise. It provides a useful, general solution for multidimensional clustering problems. We demonstrate its suitability for automated gating of flow cytometry data.</p

    Differential Development of Human Brain White Matter Tracts

    Get PDF
    Neuroscience is increasingly focusing on developmental factors related to human structural and functional connectivity. Unfortunately, to date, diffusion-based imaging approaches have only contributed modestly to these broad objectives, despite the promise of diffusion-based tractography. Here, we report a novel data-driven approach to detect similarities and differences among white matter tracts with respect to their developmental trajectories, using 64-direction diffusion tensor imaging. Specifically, using a cross-sectional sample comprising 144 healthy individuals (7 to 48 years old), we applied k-means cluster analysis to separate white matter voxels based on their age-related trajectories of fractional anisotropy. Optimal solutions included 5-, 9- and 14-clusters. Our results recapitulate well-established tracts (e.g., internal and external capsule, optic radiations, corpus callosum, cingulum bundle, cerebral peduncles) and subdivisions within tracts (e.g., corpus callosum, internal capsule). For all but one tract identified, age-related trajectories were curvilinear (i.e., inverted ‘U-shape’), with age-related increases during childhood and adolescence followed by decreases in middle adulthood. Identification of peaks in the trajectories suggests that age-related losses in fractional anisotropy occur as early as 23 years of age, with mean onset at 30 years of age. Our findings demonstrate that data-driven analytic techniques may be fruitfully applied to extant diffusion tensor imaging datasets in normative and neuropsychiatric samples

    Effects of Transcriptional Pausing on Gene Expression Dynamics

    Get PDF
    Stochasticity in gene expression affects many cellular processes and is a source of phenotypic diversity between genetically identical individuals. Events in elongation, particularly RNA polymerase pausing, are a source of this noise. Since the rate and duration of pausing are sequence-dependent, this regulatory mechanism of transcriptional dynamics is evolvable. The dependency of pause propensity on regulatory molecules makes pausing a response mechanism to external stress. Using a delayed stochastic model of bacterial transcription at the single nucleotide level that includes the promoter open complex formation, pausing, arrest, misincorporation and editing, pyrophosphorolysis, and premature termination, we investigate how RNA polymerase pausing affects a gene's transcriptional dynamics and gene networks. We show that pauses' duration and rate of occurrence affect the bursting in RNA production, transcriptional and translational noise, and the transient to reach mean RNA and protein levels. In a genetic repressilator, increasing the pausing rate and the duration of pausing events increases the period length but does not affect the robustness of the periodicity. We conclude that RNA polymerase pausing might be an important evolvable feature of genetic networks

    Pattern of neural responses to verbal fluency shows diagnostic specificity for schizophrenia and bipolar disorder

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Impairments in executive function and language processing are characteristic of both schizophrenia and bipolar disorder. Their functional neuroanatomy demonstrate features that are shared as well as specific to each disorder. Determining the distinct pattern of neural responses in schizophrenia and bipolar disorder may provide biomarkers for their diagnoses.</p> <p>Methods</p> <p>104 participants underwent functional magnetic resonance imaging (fMRI) scans while performing a phonological verbal fluency task. Subjects were 32 patients with schizophrenia in remission, 32 patients with bipolar disorder in an euthymic state, and 40 healthy volunteers. Neural responses to verbal fluency were examined in each group, and the diagnostic potential of the pattern of the neural responses was assessed with machine learning analysis.</p> <p>Results</p> <p>During the verbal fluency task, both patient groups showed increased activation in the anterior cingulate, left dorsolateral prefrontal cortex and right putamen as compared to healthy controls, as well as reduced deactivation of precuneus and posterior cingulate. The magnitude of activation was greatest in patients with schizophrenia, followed by patients with bipolar disorder and then healthy individuals. Additional recruitment in the right inferior frontal and right dorsolateral prefrontal cortices was observed in schizophrenia relative to both bipolar disorder and healthy subjects. The pattern of neural responses correctly identified individual patients with schizophrenia with an accuracy of 92%, and those with bipolar disorder with an accuracy of 79% in which mis-classification was typically of bipolar subjects as healthy controls.</p> <p>Conclusions</p> <p>In summary, both schizophrenia and bipolar disorder are associated with altered function in prefrontal, striatal and default mode networks, but the magnitude of this dysfunction is particularly marked in schizophrenia. The pattern of response to verbal fluency is highly diagnostic for schizophrenia and distinct from bipolar disorder. Pattern classification of functional MRI measurements of language processing is a potential diagnostic marker of schizophrenia.</p

    'Same, but different' : a mixed methods realist evaluation of a cluster-randomized controlled participatory organizational intervention

    Get PDF
    Participatory organizational interventions are a recommended approach to improve the psychosocial work environment. As interventions of this type are shaped by employees and managers, their implementation can vary considerably, making evaluation challenging. This study contributes to our understanding of interventions by focusing on how the intervention mechanisms and the organizational context interact. In a mixed-methods design, we use multi-group structural equation modelling of pre-and post-intervention survey data (N = 204) to test multiple mediational mechanisms in three different contexts. We then analyse interviews (N = 67) and field observations of workshops to identify the role of contextual factors. The findings suggest that participatory organizational interventions do not produce one-size-fits-all results; on the contrary, intervention results are better understood as products of multiple intervention mechanisms interacting with the specific organizational contexts

    Regulation of Heterochromatin Assembly on Unpaired Chromosomes during Caenorhabditis elegans Meiosis by Components of a Small RNA-Mediated Pathway

    Get PDF
    Many organisms have a mechanism for down regulating the expression of non-synapsed chromosomes and chromosomal regions during meiosis. This phenomenon is thought to function in genome defense. During early meiosis in Caenorhabditis elegans, unpaired chromosomes (e.g., the male X chromosome) become enriched for a modification associated with heterochromatin and transcriptional repression, dimethylation of histone H3 on lysine 9 (H3K9me2). This enrichment requires activity of the cellular RNA-directed RNA polymerase, EGO-1. Here we use genetic mutation, RNA interference, immunofluorescence microscopy, fluorescence in situ hybridization, and molecular cloning methods to identify and analyze three additional regulators of meiotic H3K9me2 distribution: CSR-1 (a Piwi/PAZ/Argonaute protein), EKL-1 (a Tudor domain protein), and DRH-3 (a DEAH/D-box helicase). In csr-1, ekl-1, and drh-3 mutant males, we observed a reduction in H3K9me2 accumulation on the unpaired X chromosome and an increase in H3K9me2 accumulation on paired autosomes relative to controls. We observed a similar shift in H3K9me2 pattern in hermaphrodites that carry unpaired chromosomes. Based on several assays, we conclude that ectopic H3K9me2 accumulates on paired and synapsed chromosomes in these mutants. We propose alternative models for how a small RNA-mediated pathway may regulate H3K9me2 accumulation during meiosis. We also describe the germline phenotypes of csr-1, ekl-1, and drh-3 mutants. Our genetic data suggest that these factors, together with EGO-1, participate in a regulatory network to promote diverse aspects of development

    Caenorhabditis briggsae Recombinant Inbred Line Genotypes Reveal Inter-Strain Incompatibility and the Evolution of Recombination

    Get PDF
    The nematode Caenorhabditis briggsae is an emerging model organism that allows evolutionary comparisons with C. elegans and exploration of its own unique biological attributes. To produce a high-resolution C. briggsae recombination map, recombinant inbred lines were generated from reciprocal crosses between two strains and genotyped at over 1,000 loci. A second set of recombinant inbred lines involving a third strain was also genotyped at lower resolution. The resulting recombination maps exhibit discrete domains of high and low recombination, as in C. elegans, indicating these are a general feature of Caenorhabditis species. The proportion of a chromosome's physical size occupied by the central, low-recombination domain is highly correlated between species. However, the C. briggsae intra-species comparison reveals striking variation in the distribution of recombination between domains. Hybrid lines made with the more divergent pair of strains also exhibit pervasive marker transmission ratio distortion, evidence of selection acting on hybrid genotypes. The strongest effect, on chromosome III, is explained by a developmental delay phenotype exhibited by some hybrid F2 animals. In addition, on chromosomes IV and V, cross direction-specific biases towards one parental genotype suggest the existence of cytonuclear epistatic interactions. These interactions are discussed in relation to surprising mitochondrial genome polymorphism in C. briggsae, evidence that the two strains diverged in allopatry, the potential for local adaptation, and the evolution of Dobzhansky-Muller incompatibilities. The genetic and genomic resources resulting from this work will support future efforts to understand inter-strain divergence as well as facilitate studies of gene function, natural variation, and the evolution of recombination in Caenorhabditis nematodes

    Using graph theory to analyze biological networks

    Get PDF
    Understanding complex systems often requires a bottom-up analysis towards a systems biology approach. The need to investigate a system, not only as individual components but as a whole, emerges. This can be done by examining the elementary constituents individually and then how these are connected. The myriad components of a system and their interactions are best characterized as networks and they are mainly represented as graphs where thousands of nodes are connected with thousands of vertices. In this article we demonstrate approaches, models and methods from the graph theory universe and we discuss ways in which they can be used to reveal hidden properties and features of a network. This network profiling combined with knowledge extraction will help us to better understand the biological significance of the system

    Mourning and melancholia revisited: correspondences between principles of Freudian metapsychology and empirical findings in neuropsychiatry

    Get PDF
    Freud began his career as a neurologist studying the anatomy and physiology of the nervous system, but it was his later work in psychology that would secure his place in history. This paper draws attention to consistencies between physiological processes identified by modern clinical research and psychological processes described by Freud, with a special emphasis on his famous paper on depression entitled 'Mourning and melancholia'. Inspired by neuroimaging findings in depression and deep brain stimulation for treatment resistant depression, some preliminary physiological correlates are proposed for a number of key psychoanalytic processes. Specifically, activation of the subgenual cingulate is discussed in relation to repression and the default mode network is discussed in relation to the ego. If these correlates are found to be reliable, this may have implications for the manner in which psychoanalysis is viewed by the wider psychological and psychiatric communities
    corecore