Search CORE

18,635 research outputs found

Joint Clustering and Registration of Functional Data

Author: Telesca Donatello
Zhang Yafeng
Publication venue
Publication date: 01/01/2014
Field of study

Curve registration and clustering are fundamental tools in the analysis of functional data. While several methods have been developed and explored for either task individually, limited work has been done to infer functional clusters and register curves simultaneously. We propose a hierarchical model for joint curve clustering and registration. Our proposal combines a Dirichlet process mixture model for clustering of common shapes, with a reproducing kernel representation of phase variability for registration. We show how inference can be carried out applying standard posterior simulation algorithms and compare our method to several alternatives in both engineered data and a benchmark analysis of the Berkeley growth data. We conclude our investigation with an application to time course gene expression

arXiv.org e-Print Archive

CiteSeerX

eScholarship - University of California

Noise resistant generalized parametric validity index of clustering for gene expression data

Author: Fa R
Nandi AK
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/07/2014
Field of study

This article has been made available through the Brunel Open Access Publishing Fund.Validity indices have been investigated for decades. However, since there is no study of noise-resistance performance of these indices in the literature, there is no guideline for determining the best clustering in noisy data sets, especially microarray data sets. In this paper, we propose a generalized parametric validity (GPV) index which employs two tunable parameters α and β to control the proportions of objects being considered to calculate the dissimilarities. The greatest advantage of the proposed GPV index is its noise-resistance ability, which results from the flexibility of tuning the parameters. Several rules are set to guide the selection of parameter values. To illustrate the noise-resistance performance of the proposed index, we evaluate the GPV index for assessing five clustering algorithms in two gene expression data simulation models with different noise levels and compare the ability of determining the number of clusters with eight existing indices. We also test the GPV in three groups of real gene expression data sets. The experimental results suggest that the proposed GPV index has superior noise-resistance ability and provides fairly accurate judgements

Crossref

Brunel University Research Archive

Predicting protein function with hierarchical phylogenetic profiles: The Gene3D phylo-tuner method applied to eukaryotic Genomes

Author: Grant A
Orengo CA
Ranea JAG
Yeats C
Publication venue: PUBLIC LIBRARY SCIENCE
Publication date: 30/11/2007
Field of study

"Phylogenetic profiling'' is based on the hypothesis that during evolution functionally or physically interacting genes are likely to be inherited or eliminated in a codependent manner. Creating presence-absence profiles of orthologous genes is now a common and powerful way of identifying functionally associated genes. In this approach, correctly determining orthology, as a means of identifying functional equivalence between two genes, is a critical and nontrivial step and largely explains why previous work in this area has mainly focused on using presence-absence profiles in prokaryotic species. Here, we demonstrate that eukaryotic genomes have a high proportion of multigene families whose phylogenetic profile distributions are poor in presence-absence information content. This feature makes them prone to orthology mis-assignment and unsuited to standard profile-based prediction methods. Using CATH structural domain assignments from the Gene3D database for 13 complete eukaryotic genomes, we have developed a novel modification of the phylogenetic profiling method that uses genome copy number of each domain superfamily to predict functional relationships. In our approach, superfamilies are subclustered at ten levels of sequence identity from 30% to 100% - and phylogenetic profiles built at each level. All the profiles are compared using normalised Euclidean distances to identify those with correlated changes in their domain copy number. We demonstrate that two protein families will "auto-tune'' with strong co-evolutionary signals when their profiles are compared at the similarity levels that capture their functional relationship. Our method finds functional relationships that are not detectable by the conventional presence - absence profile comparisons, and it does not require a priori any fixed criteria to define orthologous genes

UCL Discovery

Spatial gene expression quantification in changing morphologies

Author: Botman D.
Publication venue
Publication date: 01/01/2016
Field of study

International Migration, Integration and Social Cohesion online publications

Recommended from our members

Dissecting the sharp response of a canonical developmental enhancer reveals multiple sources of cooperativity.

Author: Bragdon Meghan Dj
Cha Anna
DePace Angela H
Estrada Javier
Gunawardena Jeremy
Johnson Gemma
Park Jeehae
Ricci-Tam Chiara
Shulgina Yekaterina
Vincent Ben J
Wunderlich Zeba
Publication venue: eScholarship, University of California
Publication date: 01/06/2019
Field of study

Developmental enhancers integrate graded concentrations of transcription factors (TFs) to create sharp gene expression boundaries. Here we examine the hunchback P2 (HbP2) enhancer which drives a sharp expression pattern in the Drosophila blastoderm embryo in response to the transcriptional activator Bicoid (Bcd). We systematically interrogate cis and trans factors that influence the shape and position of expression driven by HbP2, and find that the prevailing model, based on pairwise cooperative binding of Bcd to HbP2 is not adequate. We demonstrate that other proteins, such as pioneer factors, Mediator and histone modifiers influence the shape and position of the HbP2 expression pattern. Comparing our results to theory reveals how higher-order cooperativity and energy expenditure impact boundary location and sharpness. Our results emphasize that the bacterial view of transcription regulation, where pairwise interactions between regulatory proteins dominate, must be reexamined in animals, where multiple molecular mechanisms collaborate to shape the gene regulatory function

eScholarship - University of California

A NOVEL COMPUTATIONAL FRAMEWORK FOR TRANSCRIPTOME ANALYSIS WITH RNA-SEQ DATA

Author: Hu Yin
Publication venue: UKnowledge
Publication date: 01/01/2013
Field of study

The advance of high-throughput sequencing technologies and their application on mRNA transcriptome sequencing (RNA-seq) have enabled comprehensive and unbiased profiling of the landscape of transcription in a cell. In order to address the current limitation of analyzing accuracy and scalability in transcriptome analysis, a novel computational framework has been developed on large-scale RNA-seq datasets with no dependence on transcript annotations. Directly from raw reads, a probabilistic approach is first applied to infer the best transcript fragment alignments from paired-end reads. Empowered by the identification of alternative splicing modules, this framework then performs precise and efficient differential analysis at automatically detected alternative splicing variants, which circumvents the need of full transcript reconstruction and quantification. Beyond the scope of classical group-wise analysis, a clustering scheme is further described for mining prominent consistency among samples in transcription, breaking the restriction of presumed grouping. The performance of the framework has been demonstrated by a series of simulation studies and real datasets, including the Cancer Genome Atlas (TCGA) breast cancer analysis. The successful applications have suggested the unprecedented opportunity in using differential transcription analysis to reveal variations in the mRNA transcriptome in response to cellular differentiation or effects of diseases

University of Kentucky

Development of network inference among LexA/ RecA-dependent manner genes in the SOS response

Author: Sachiyo Aburatani
Publication venue: 'PAGEPress Publications'
Publication date: 01/01/2010
Field of study

Crossref

Nucleosome-mediated cooperativity between transcription factors

Author: Adams
Anderson
Arnosti
Arnosti
Berman
Bintu
Changeux
Chen
Dion
Gao
Hare
Ioshikhes
Ioshikhes
Iyer
John
Kaplan
Kim
L. A. Mirny
Lam
Lee
Lee
Li
Li
Li
Lin
Maerkl
Mavrich
Miller
Monod
Morse
Moses
Oliviero
Panne
Pettersson
Phair
Poirier
Polach
Polach
Ransom
Sabo
Saiz
Segal
Sinha
Tillo
Trifonov
Vashee
Weinberg
Wunderlich
Xu
Zhang
Publication venue
Publication date: 16/01/2009
Field of study

Cooperative binding of transcription factors (TFs) to cis-regulatory regions (CRRs) is essential for precision of gene expression in development and other processes. The classical model of cooperativity requires direct interactions between TFs, thus constraining the arrangement of TFs sites in a CRR. On the contrary, genomic and functional studies demonstrate a great deal of flexibility in such arrangements with variable distances, numbers of sites, and identities of the involved TFs. Such flexibility is inconsistent with the cooperativity by direct interactions between TFs. Here we demonstrate that strong cooperativity among non-interacting TFs can be achieved by their competition with nucleosomes. We find that the mechanism of nucleosome-mediated cooperativity is mathematically identical to the Monod-Wyman-Changeux (MWC) model of cooperativity in hemoglobin. This surprising parallel provides deep insights, with parallels between heterotropic regulation of hemoglobin (e.g. Bohr effect) and roles of nucleosome-positioning sequences and chromatin modifications in gene regulation. Characterized mechanism is consistent with numerous experimental results, allows substantial flexibility in and modularity of CRRs, and provides a rationale for a broad range of genomic and evolutionary observations. Striking parallels between cooperativity in hemoglobin and in transcription regulation point at a new design principle that may be used in range of biological systems

arXiv.org e-Print Archive