5 research outputs found

    Yeast gene CMR1/YDL156W is consistently co-expressed with genes participating in DNA-metabolic processes in a variety of stringent clustering experiments

    Get PDF
    © 2013 The Authors. Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/3.0/, which permits unrestricted use, provided the original author and source are credited.The binarization of consensus partition matrices (Bi-CoPaM) method has, among its unique features, the ability to perform ensemble clustering over the same set of genes from multiple microarray datasets by using various clustering methods in order to generate tunable tight clusters. Therefore, we have used the Bi-CoPaM method to the most synchronized 500 cell-cycle-regulated yeast genes from different microarray datasets to produce four tight, specific and exclusive clusters of co-expressed genes. We found 19 genes formed the tightest of the four clusters and this included the gene CMR1/YDL156W, which was an uncharacterized gene at the time of our investigations. Two very recent proteomic and biochemical studies have independently revealed many facets of CMR1 protein, although the precise functions of the protein remain to be elucidated. Our computational results complement these biological results and add more evidence to their recent findings of CMR1 as potentially participating in many of the DNA-metabolism processes such as replication, repair and transcription. Interestingly, our results demonstrate the close co-expressions of CMR1 and the replication protein A (RPA), the cohesion complex and the DNA polymerases α, δ and ɛ, as well as suggest functional relationships between CMR1 and the respective proteins. In addition, the analysis provides further substantial evidence that the expression of the CMR1 gene could be regulated by the MBF complex. In summary, the application of a novel analytic technique in large biological datasets has provided supporting evidence for a gene of previously unknown function, further hypotheses to test, and a more general demonstration of the value of sophisticated methods to explore new large datasets now so readily generated in biological experiments.National Institute for Health Researc

    Paradigm of tunable clustering using binarization of consensus partition matrices (Bi-CoPaM) for gene discovery

    Get PDF
    Copyright @ 2013 Abu-Jamous et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.Clustering analysis has a growing role in the study of co-expressed genes for gene discovery. Conventional binary and fuzzy clustering do not embrace the biological reality that some genes may be irrelevant for a problem and not be assigned to a cluster, while other genes may participate in several biological functions and should simultaneously belong to multiple clusters. Also, these algorithms cannot generate tight clusters that focus on their cores or wide clusters that overlap and contain all possibly relevant genes. In this paper, a new clustering paradigm is proposed. In this paradigm, all three eventualities of a gene being exclusively assigned to a single cluster, being assigned to multiple clusters, and being not assigned to any cluster are possible. These possibilities are realised through the primary novelty of the introduction of tunable binarization techniques. Results from multiple clustering experiments are aggregated to generate one fuzzy consensus partition matrix (CoPaM), which is then binarized to obtain the final binary partitions. This is referred to as Binarization of Consensus Partition Matrices (Bi-CoPaM). The method has been tested with a set of synthetic datasets and a set of five real yeast cell-cycle datasets. The results demonstrate its validity in generating relevant tight, wide, and complementary clusters that can meet requirements of different gene discovery studies.National Institute for Health Researc

    Extensions of Independent Component Analysis Mixture Models for classification and prediction of EEG signals

    Full text link
    [EN] This paper presents two applications of Independent Component Analysis Mixture Modeling (ICAMM) for the classification and prediction of data. The first one of these extensions is Sequential ICAMM (SICAMM), an ICAMM structure that takes into account the sequential dependence in the feature record. This algorithm can be used to classify input observations in a given set of mutually-exclusive classes. The performance of SICAMM is tested with simulations and compared against that of the base ICAMM algorithm and of a Dynamic Bayesian Network (DBN). All three methods are also used to classify real electroencephalographic (EEG) signals to compute hypnograms, a clinical tool used to help in the diagnosis of sleep disorders. The second extension of ICAMM is PREDICAMM, an estimation algorithm that makes use of the ICAMM parameters in order to reconstruct missing samples from a set of data. This predictor is used to reconstruct real EEG data from a working memory experiment, and its performance is compared to that of a classical predictor for EEG signals: sphere splines. Prediction performance is measured with four error indicators: signal-to-interference ratio, KullbackLeibler divergence, correlation, and mean structural similarity index. Both extensions of the base ICAMM algorithm have achieved a higher performance than other methodsThis work has been supported by Universitat Politècnica de Valencia under grant 20130072, Generalitat Valenciana under grants PROMETEO/2010/040 and ISIC/2012/006; and Spanish Administration and European Union FEDER Programme under grant TEC2011-23403 01/01/2012. The PSG signals and annotated hypnograms were provided by the Electroencephalography Department of Hospital Universitario La Fe, Valencia, SpainSafont Armero, G.; Salazar Afanador, A.; Rodriguez Martinez, A.; Vergara Domínguez, L. (2013). Extensions of Independent Component Analysis Mixture Models for classification and prediction of EEG signals. WAVES. 5:59-68. http://hdl.handle.net/10251/52797S5968
    corecore