24 research outputs found

    Prediction of evolutionarily conserved interologs in Mus musculus

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Identification of protein-protein interactions is an important first step to understand living systems. High-throughput experimental approaches have accumulated large amount of information on protein-protein interactions in human and other model organisms. Such interaction information has been successfully transferred to other species, in which the experimental data are limited. However, the annotation transfer method could yield false positive interologs due to the lack of conservation of interactions when applied to phylogenetically distant organisms.</p> <p>Results</p> <p>To address this issue, we used phylogenetic profile method to filter false positives in interologs based on the notion that evolutionary conserved interactions show similar patterns of occurrence along the genomes. The approach was applied to <it>Mus musculus</it>, in which the experimentally identified interactions are limited. We first inferred the protein-protein interactions in <it>Mus musculus </it>by using two approaches: i) identifying mouse orthologs of interacting proteins (interologs) based on the experimental protein-protein interaction data from other organisms; and ii) analyzing frequency of mouse ortholog co-occurrence in predicted operons of bacteria. We then filtered possible false-positives in the predicted interactions using the phylogenetic profiles. We found that this filtering method significantly increased the frequency of interacting protein-pairs coexpressed in the same cells/tissues in gene expression omnibus (GEO) database as well as the frequency of interacting protein-pairs shared the similar Gene Ontology (GO) terms for biological processes and cellular localizations. The data supports the notion that phylogenetic profile helps to reduce the number of false positives in interologs.</p> <p>Conclusion</p> <p>We have developed protein-protein interaction database in mouse, which contains 41109 interologs. We have also developed a web interface to facilitate the use of database <url>http://lgsun.grc.nia.nih.gov/mppi/</url>.</p

    Transcript copy number estimation using a mouse whole-genome oligonucleotide microarray

    Get PDF
    The ability to quantitatively measure the expression of all genes in a given tissue or cell with a single assay is an exciting promise of gene-expression profiling technology. An in situ-synthesized 60-mer oligonucleotide microarray designed to detect transcripts from all mouse genes was validated, as well as a set of exogenous RNA controls derived from the yeast genome (made freely available without restriction), which allow quantitative estimation of absolute endogenous transcript abundance

    Transcriptome Analysis of Mouse Stem Cells and Early Embryos

    Get PDF
    Understanding and harnessing cellular potency are fundamental in biology and are also critical to the future therapeutic use of stem cells. Transcriptome analysis of these pluripotent cells is a first step towards such goals. Starting with sources that include oocytes, blastocysts, and embryonic and adult stem cells, we obtained 249,200 high-quality EST sequences and clustered them with public sequences to produce an index of approximately 30,000 total mouse genes that includes 977 previously unidentified genes. Analysis of gene expression levels by EST frequency identifies genes that characterize preimplantation embryos, embryonic stem cells, and adult stem cells, thus providing potential markers as well as clues to the functional features of these cells. Principal component analysis identified a set of 88 genes whose average expression levels decrease from oocytes to blastocysts, stem cells, postimplantation embryos, and finally to newborn tissues. This can be a first step towards a possible definition of a molecular scale of cellular potency. The sequences and cDNA clones recovered in this work provide a comprehensive resource for genes functioning in early mouse embryos and stem cells. The nonrestricted community access to the resource can accelerate a wide range of research, particularly in reproductive and regenerative medicine

    BIOINFORMATICS APPLICATIONS NOTE

    No full text
    Vol. 21 no. 10 2005, pages 2548–2549 doi:10.1093/bioinformatics/bti343 A web-based tool for principal component and significance analysis of microarray dat

    CircInteractome: A web tool for exploring circular RNAs and their interacting proteins and microRNAs

    No full text
    <p>Circular RNAs (circRNAs) are widely expressed in animal cells, but their biogenesis and functions are poorly understood. CircRNAs have been shown to act as sponges for miRNAs and may also potentially sponge RNA-binding proteins (RBPs) and are thus predicted to function as robust posttranscriptional regulators of gene expression. The joint analysis of large-scale transcriptome data coupled with computational analyses represents a powerful approach to elucidate possible biological roles of ribonucleoprotein (RNP) complexes. Here, we present a new web tool, CircInteractome (circRNA interactome), for mapping RBP- and miRNA-binding sites on human circRNAs. CircInteractome searches public circRNA, miRNA, and RBP databases to provide bioinformatic analyses of binding sites on circRNAs and additionally analyzes miRNA and RBP sites on junction and junction-flanking sequences. CircInteractome also allows the user the ability to (1) identify potential circRNAs which can act as RBP sponges, (2) design junction-spanning primers for specific detection of circRNAs of interest, (3) design siRNAs for circRNA silencing, and (4) identify potential internal ribosomal entry sites (IRES). In sum, the web tool CircInteractome, freely accessible at http://circinteractome.nia.nih.gov, facilitates the analysis of circRNAs and circRNP biology.</p
    corecore