research

Evaluation of diversity, specialization, and gene specificity in transcriptomes

Abstract

The transcriptome is a set of genes transcribed in a given tissue under specific conditions and can be characterized by a list of genes with their corresponding frequencies of transcription. Transcriptome changes can be measured by counting gene tags from mRNA libraries or by measuring light signals in DNA microarrays. Recently we proposed an approach to define and estimate the diversity and specialization of transcriptomes and gene specificity. This approach can be useful for the determination and measure of transcriptional networks. We defined transcriptome diversity as the Shannon entropy of its frequency distribution. Gene specificity is defined as the mutual information between the tissues and the corresponding transcript, allowing detection of either housekeeping or highly specific genes and clarifying the meaning of these concepts in the literature. Tissue specialization is measured by average gene specificity. Visualization of the positions of transcriptomes in a system of diversity and specialization coordinates makes it possible to understand at a glance their interrelations, summarizing in a powerful way which transcriptomes are richer in diversity of expressed genes, or which are relatively more specialized. This enlightens the relation among transcriptomes, allowing a better understanding of their changes through the development of the organism or in response to environmental stimuli. We present statistical tools based on resampling procedures to obtain confidence intervals for the parameters as well as perform statistical test. These approaches are illustrated with a human dataset

    Similar works