1,588 research outputs found

    Shifting Patterns Discovery in Microarrays with Evolutionary Algorithms

    Get PDF
    In recent years, the interest in extracting useful knowledge from gene expression data has experimented an enormous increase with the development of microarray technique. Biclustering is a recent technique that aims at extracting a subset of genes that show a similar behaviour for a subset conditions. It is important, therefore, to measure the quality of a bicluster, and a way to do that would be checking if each data submatrix follows a specific trend, represented by a pattern. In this work, we present an evolutionary algorithm for finding significant shifting patterns which depict the general behaviour within each bicluster. The empirical results we have obtained confirm the quality of our proposal, obtaining very accurate solutions for the biclusters used.Comisión Interministerial de Ciencia y Tecnología (CICYT) TIN2004-00159Comisión Interministerial de Ciencia y Tecnología (CICYT) TIN2004-06689C030

    Analysis of circadian pattern reveals tissue-specific alternative transcription in leptin signaling pathway

    Get PDF
    *Background*
It has been previously reported that most mammalian genes display a circadian oscillation in their baseline expression. Consequently, the phase and amplitude of each component of a signal transduction cascade has downstream consequences. 

*Results*
We report our analysis of alternative transcripts in the leptin signaling pathway which is responsible for the systemic regulation of macronutrient storage and energy balance. We focused on the circadian expression pattern of a critical component of the leptin signaling system, suppressor of cytokine signaling 3 (SOCS3). On an Affymetrix GeneChip 430A2 microarray, this gene is represented by three probe sets targeting different regions within the 3’ end of the last exon. We demonstrate that in murine brown adipose tissue two downstream 3’ probe sets experience circadian baseline oscillation in counter-phase to the upstream probe set. Such differences in expression patterns are a telltale sign of alternative splicing within the last exon of SOCS3. In contrast, all three probe sets oscillated in a common phase in murine liver and white adipose tissue. This suggests that the regulation of SOCS3 expression in brown fat is tissue specific. Another component of the signaling pathway, Janus kinase (JAK), is directly regulated by SOCS and has alternative transcript probe sets oscillating in counter-phase in a white adipose tissue specific manner.
 
*Conclusion*
We hypothesize that differential oscillation of alternative transcripts may provide a mechanism to maintain steady levels of expression in spite of circadian baseline variation

    Configurable Pattern-based Evolutionary Biclustering of Gene Expression Data

    Get PDF
    BACKGROUND: Biclustering algorithms for microarray data aim at discovering functionally related gene sets under different subsets of experimental conditions. Due to the problem complexity and the characteristics of microarray datasets, heuristic searches are usually used instead of exhaustive algorithms. Also, the comparison among different techniques is still a challenge. The obtained results vary in relevant features such as the number of genes or conditions, which makes it difficult to carry out a fair comparison. Moreover, existing approaches do not allow the user to specify any preferences on these properties. RESULTS: Here, we present the first biclustering algorithm in which it is possible to particularize several biclusters features in terms of different objectives. This can be done by tuning the specified features in the algorithm or also by incorporating new objectives into the search. Furthermore, our approach bases the bicluster evaluation in the use of expression patterns, being able to recognize both shifting and scaling patterns either simultaneously or not. Evolutionary computation has been chosen as the search strategy, naming thus our proposal Evo-Bexpa (Evolutionary Biclustering based in Expression Patterns). CONCLUSIONS: We have conducted experiments on both synthetic and real datasets demonstrating Evo-Bexpa abilities to obtain meaningful biclusters. Synthetic experiments have been designed in order to compare Evo-Bexpa performance with other approaches when looking for perfect patterns. Experiments with four different real datasets also confirm the proper performing of our algorithm, whose results have been biologically validated through Gene Ontology

    Techniques for clustering gene expression data

    Get PDF
    Many clustering techniques have been proposed for the analysis of gene expression data obtained from microarray experiments. However, choice of suitable method(s) for a given experimental dataset is not straightforward. Common approaches do not translate well and fail to take account of the data profile. This review paper surveys state of the art applications which recognises these limitations and implements procedures to overcome them. It provides a framework for the evaluation of clustering in gene expression analyses. The nature of microarray data is discussed briefly. Selected examples are presented for the clustering methods considered

    Unsupervised Algorithms for Microarray Sample Stratification

    Get PDF
    The amount of data made available by microarrays gives researchers the opportunity to delve into the complexity of biological systems. However, the noisy and extremely high-dimensional nature of this kind of data poses significant challenges. Microarrays allow for the parallel measurement of thousands of molecular objects spanning different layers of interactions. In order to be able to discover hidden patterns, the most disparate analytical techniques have been proposed. Here, we describe the basic methodologies to approach the analysis of microarray datasets that focus on the task of (sub)group discovery.Peer reviewe

    Probabilities of spurious connections in gene networks: Application to expression time series

    Full text link
    Motivation: The reconstruction of gene networks from gene expression microarrays is gaining popularity as methods improve and as more data become available. The reliability of such networks could be judged by the probability that a connection between genes is spurious, resulting from chance fluctuations rather than from a true biological relationship. Results: Unlike the false discovery rate and positive false discovery rate, the decisive false discovery rate (dFDR) is exactly equal to a conditional probability without assuming independence or the randomness of hypothesis truth values. This property is useful not only in the common application to the detection of differential gene expression, but also in determining the probability of a spurious connection in a reconstructed gene network. Estimators of the dFDR can estimate each of three probabilities: 1. The probability that two genes that appear to be associated with each other lack such association. 2. The probability that a time ordering observed for two associated genes is misleading. 3. The probability that a time ordering observed for two genes is misleading, either because they are not associated or because they are associated without a lag in time. The first probability applies to both static and dynamic gene networks, and the other two only apply to dynamic gene networks. Availability: Cross-platform software for network reconstruction, probability estimation, and plotting is free from http://www.davidbickel.com as R functions and a Java application.Comment: Like q-bio.GN/0404032, this was rejected in March 2004 because it was submitted to the math archive. The only modification is a corrected reference to q-bio.GN/0404032, which was not modified at al

    MSL: A Measure to Evaluate Three-dimensional Patterns in Gene Expression Data

    Get PDF
    Microarray technology is highly used in biological research environments due to its ability to monitor the RNA concentration levels. The analysis of the data generated represents a computational challenge due to the characteristics of these data. Clustering techniques are widely applied to create groups of genes that exhibit a similar behavior. Biclustering relaxes the constraints for grouping, allowing genes to be evaluated only under a subset of the conditions. Triclustering appears for the analysis of longitudinal experiments in which the genes are evaluated under certain conditions at several time points. These triclusters provide hidden information in the form of behavior patterns from temporal experiments with microarrays relating subsets of genes, experimental conditions, and time points. We present an evaluation measure for triclusters called Multi Slope Measure, based on the similarity among the angles of the slopes formed by each profile formed by the genes, conditions, and times of the triclusterMinisterio de Ciencia y Tecnología TIN2011-28956-C02-02Junta de Andalucía TIC-752
    corecore