20 research outputs found

    Uncovering the expression patterns of chimeric transcripts using surveys of affymetrix GeneChips.

    Get PDF
    BACKGROUND: A chimeric transcript is a single RNA sequence which results from the transcription of two adjacent genes. Recent studies estimate that at least 4% of tandem human gene pairs may form chimeric transcripts. Affymetrix GeneChip data are used to study the expression patterns of tens of thousands of genes and the probe sequences used in these microarrays can potentially map to exotic RNA sequences such as chimeras. RESULTS: We have studied human chimeras and investigated their expression patterns using large surveys of Affymetrix microarray data obtained from the Gene Expression Omnibus. We show that for six probe sets, a unique probe mapping to a transcript produced by one of the adjacent genes can be used to identify the expression patterns of readthrough transcripts. Furthermore, unique probes mapping to an intergenic exon present only in the MASK-BP3 chimera can be used directly to study the expression levels of this transcript. CONCLUSIONS: We have attempted to implement a new method for identifying tandem chimerism. In this analysis unambiguous probes are needed to measure run-off transcription and probes that map to intergenic exons are particularly valuable for identifying the expression of chimeras

    Using surveys of Affymetrix GeneChips to study antisense expression.

    Get PDF
    We have used large surveys of Affymetrix GeneChip data in the public domain to conduct a study of antisense expression across diverse conditions. We derive correlations between groups of probes which map uniquely to the same exon in the antisense direction. When there are no probes assigned to an exon in the sense direction we find that many of the antisense groups fail to detect a coherent block of transcription. We find that only a minority of these groups contain coherent blocks of antisense expression suggesting transcription. We also derive correlations between groups of probes which map uniquely to the same exon in both sense and antisense direction. In some of these cases the locations of sense probes overlap with the antisense probes, and the sense and antisense probe intensities are correlated with each other. This configuration suggests the existence of a Natural Antisense Transcript (NAT) pair. We find the majority of such NAT pairs detected by GeneChips are formed by a transcript of an established gene and either an EST or an mRNA. In order to determine the exact antisense regulatory mechanism indicated by the correlation of sense probes with antisense probes, a further investigation is necessary for every particular case of interest. However, the analysis of microarray data has proved to be a good method to reconfirm known NATs, discover new ones, as well as to notice possible problems in the annotation of antisense transcripts

    Mammalian comparative genomics and epigenomics

    Get PDF
    Thesis (Ph. D.)--Harvard-MIT Division of Health Sciences and Technology, 2009.This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.Cataloged from student submitted PDF version of thesis.Includes bibliographical references.The human genome sequence can be thought of as an instruction manual for our species, written and rewritten over more than a billion of years of evolution. Taking a complete inventory of our genome, dissecting its genes and their functional components, and elucidating how these genes are selectively used to establish and maintain cell types with markedly different behaviors, are key challenges of modern biology. In this thesis we present contributions to our understanding of the structure, function and evolution of the human genome. We rely on two complementary approaches. First, we study signatures of evolutionary processes that have acted on the genome using comparative sequence analysis. We generate high quality draft genome sequences of the chimpanzee, the dog and the opossum. These species share a last common ancestor with humans approximately 6 million, 80 million and 140 million years ago, respectively, and therefore provide distinct perspectives on our evolutionary history. We apply computational methods to explore the functional organization of the genome and to identify genes that contribute to shared and species-specific traits. Second, we study how the genome is bound by proteins and packaged into chromatin in distinct cell types. We develop new methods to map protein-DNA interactions and DNA methylation using single-molecule based sequencing technology. We apply these methods to identify new functional sequence elements based on characteristic chromatin signatures, and to explore the relationship between DNA sequence, chromatin and cellular state.by Tarjei Sigurd Mikkelsen.Ph.D
    corecore