18 research outputs found

    Genome SEGE: A database for 'intronless' genes in eukaryotic genomes

    Get PDF
    BACKGROUND: A number of completely sequenced eukaryotic genome data are available in the public domain. Eukaryotic genes are either 'intron containing' or 'intronless'. Eukaryotic 'intronless' genes are interesting datasets for comparative genomics and evolutionary studies. The SEGE database containing a collection of eukaryotic single exon genes is available. However, SEGE is derived using GenBank. The redundant, incomplete and heterogeneous qualities of GenBank data are a bottleneck for biological investigation in comparative genomics and evolutionary studies. Such studies often require representative gene sets from each genome and this is possible only by deriving specific datasets from completely sequenced genome data. Thus Genome SEGE, a database for 'intronless' genes in completely sequenced eukaryotic genomes, has been constructed. Availability: DESCRIPTION: Eukaryotic 'intronless' genes are extracted from nine completely sequenced genomes (four of which are unicellular and five of which are multi-cellular). The complete dataset is available for download. Data subsets are also available for 'intronless' pseudo-genes. The database provides information on the distribution of 'intronless' genes in different genomes together with their length distributions in each genome. Additionally, the search tool provides pre-computed PROSITE motifs for each sequence in the database with appropriate hyperlinks to InterPro. A search facility is also available through the web server. CONCLUSIONS: The unique features that distinguish Genome SEGE from SEGE is the service providing representative 'intronless' datasets for completely sequenced genomes. 'Intronless' gene sets available in this database will be of use for subsequent bio-computational analysis in comparative genomics and evolutionary studies. Such analysis may help to revisit the original genome data for re-examination and re-annotation

    Bioinformatics models in drug abuse and Neuro-AIDS: Using and developing databases

    Get PDF
    The magnitude of the problems of drug abuse and Neuro-AIDS warrants the development of novel approaches for testing hypotheses in diagnosis and treatment ranging from cell culture models to developing databases. In this study, cultured neurons were treated with/without HIV-TAT, ENV, or cocaine in a 2x2x2 expression study design. RNA was purified, labeled, and expression data were produced and analyzed using ANOVA. Thus, we identified 35 genes that were significantly expressed across treatment conditions. A diagram is presented showing examples of molecular relationships involving a significantly expressed gene in the current study (SOX2). Also, we use this information to discuss examples of gene expression interactions as a means to portray significance and complexity of gene expression studies in Drug Abuse and Neuro-AIDS. Furthermore, we discuss here that critical interactions remain undetected, which may be unravelled by developing robust database systems containing large datasets and gleaned information from collaborating scientists . Hence, we are developing a public domain database we named The Agora database , that will served as a shared infrastructure to query, deposit, and review information related to drug abuse and dementias including Neuro-AIDS. A workflow of this database is also outlined in this paper
    corecore