69 research outputs found

    Personal Genomics, Bioinformatics, and Variomics.

    Get PDF
    In 2008 at least five complete genome sequences are available. It is known that there are over 15,000,000 genetic variants, called SNPs, in the dbSNP database. The cost of full genome sequencing in 2009 is claimed to be less than $5000 USD. The genomics era has arrived in 2008. This review introduces technologies, bioinformatics, genomics visions, and variomics projects. Variomics is the study of the total genetic variation in an individual and populations. Research on genetic variation is the most valuable among many genomics research branches. Genomics and variomics projects will change biology and the society so dramatically that biology will become an everyday technology like personal computers and the internet. 'BioRevolution' is the term that can adequately describe this changeclose

    Biological Object Downloader (BOD) Service for Easy Download and Management of Biological Databases.

    Get PDF
    BOD is an FTP service management tool on the Internet. It was developed for biological researchers in South Korea. It enables easier and faster access of bioinformation without having to go through foreign FTP sites. BOD includes an automatic downloader with a management and email alert service from which the user can easily select and schedule any biological database. Once listed in BOD, the user can check and modify the download status and data from an additional email alert service.Availability:http://ftp.kobic.kr, ftp://ftp.kobic.kr, and http://bioftp.orclose

    MitoVariome: a variome database of human mitochondrial DNA

    Get PDF
    Background: Mitochondrial sequence variation provides critical information for studying human evolution and variation. Mitochondrial DNA provides information on the origin of humans, and plays a substantial role in forensics, degenerative diseases, cancers, and aging process. Typically, human mitochondrial DNA has various features such as HVSI, HVSII, single-nucleotide polymorphism (SNP), restriction enzyme sites, and short tandem repeat (STR). Results: We present a variome database (MitoVariome) of human mitochondrial DNA sequences. Queries against MitoVariome can be made using accession numbers or haplogroup/continent. Query results are presented not only in text but also in HTML tables to report extensive mitochondrial sequence variation information. The variation information includes repeat pattern, restriction enzyme site polymorphism, short tandem repeat, disease information as well as single nucleotide polymorphism. It also provides a graphical interface as Gbrowse displaying all variations at a glance. The web interface also provides the tool for assigning haplogroup based on the haplogroup-diagnostic system with complete human mitochondrial SNP position list and for retrieving sequences that users query against by using accession numbers. Conclusion: MitoVariome is a freely accessible web application and database that enables human mitochondrial genome researchers to study genetic variation in mitochondrial genome with textual and graphical views accompanied by assignment function of haplogrouping if users submit their own data. Hence, the MitoVariome containing many kinds of variation features in the human mitochondrial genome will be useful for understanding mitochondrial variations of each individual, haplogroup, or geographical location to elucidate the history of human evolutionclose81

    SynechoNET: integrated protein-protein interaction database of a model cyanobacterium /Synechocystis/ sp. PCC 6803.

    Get PDF
    Background: Cyanobacteria are model organisms for studying photosynthesis, carbon and nitrogen assimilation, evolution of plant plastids, and adaptability to environmental stresses. Despite many studies on cyanobacteria, there is no web-based database of their regulatory and signaling protein-protein interaction networks to date. Description: We report a database and website SynechoNET that provides predicted protein-protein interactions. SynechoNET shows cyanobacterial domain-domain interactions as well as their protein-level interactions using the model cyanobacterium, Synechocystis sp. PCC 6803. It predicts the protein-protein interactions using public interaction databases that contain mutually complementary and redundant data. Furthermore, SynechoNET provides information on transmembrane topology, signal peptide, and domain structure in order to support the analysis of regulatory membrane proteins. Such biological information can be queried and visualized in user-friendly web interfaces that include the interactive network viewer and search pages by keyword and functional category. Conclusion: SynechoNET is an integrated protein-protein interaction database designed to analyze regulatory membrane proteins in cyanobacteria. It provides a platform for biologists to extend the genomic data of cyanobacteria by predicting interaction partners, membrane association, and membrane topology of Synechocystis proteins. SynechoNET is freely available at http://synechocystis.org/or directly at http://bioportal.kobic.kr/SynechoNET/close128

    PDbase: a database of Parkinson's Disease-related genes and genetic variation using substantia nigra ESTs

    Get PDF
    Background: Parkinson's disease (PD) is one of the most common neurodegenerative disorders, clinically characterized by impaired motor function. Since the etiology of PD is diverse and complex, many researchers have created PD-related research resources. However, resources for brain and PD studies are still lacking. Therefore, we have constructed a database of PD-related gene and genetic variations using the substantia nigra (SN) in PD and normal tissues. In addition, we integrated PD-related information from several resources. Results: We collected the 6,130 SN expressed sequenced tags (ESTs) from brain SN normal tissues and PD patients SN tissues using full-cDNA library and normalized cDNA library construction methods from our previous study. The SN ESTs were clustered in 2,951 unigene clusters and assigned in 2,678 genes. We then found up-regulated 57 genes and down-regulated 48 genes by comparing normal and PD SN ESTs frequencies with over 0.9 cut-off probability of differential expression based on the Audic and Claverie method. In addition, we integrated disease-related information from public resources. To examine the characteristics of these PD-related genes, we analyzed alternative splicing events, single nucleotide polymorphism (SNP) markers located in the gene regions, repeat elements, gene regulation elements, and pathways and protein-protein interaction networks. Conclusion: We constructed the PDbase database to capture the PD-related gene, genetic variation, and functional elements. This database contains 2,698 PD-related genes through ESTs discovered from human normal and PD patients SN tissues, and through integrating several public resources. PDbase provides the mitochondrion proteins, microRNA gene regulation elements, single nucleotide polymorphisms (SNPs) markers within PD-related gene structures, repeat elements, and pathways and networks with protein-protein interaction information. The PDbase information can aid in understanding the causation of PD. It is available at http://bioportal.kobic.re.kr/PDbase/. Supplementary data is available at http://bioportal.kobic.re.kr/PDbase/suppl.jsp. © 2009 Yang et al; licensee BioMed Central Ltdclose

    SNP@Promoter: A database of human SNPs (Single Nucleotide Polymorphisms) within putative promoter region.

    Get PDF
    Background: Analysis of single nucleotide polymorphism (SNP) is becoming a key research in genomics fields. Many functional analyses of SNPs have been carried out for coding regions and splicing sites that can alter proteins and mRNA splicing. However, SNPs in non-coding regulatory regions can also influence important biological regulation. Presently, there are few databases for SNPs in non-coding regulatory regions. Description: We identified 488,452 human SNPs in the putative promoter regions that extended from the +5000 bp to -500 bp region of the transcription start sites. Some SNPs occurring in transcription factor (TF) binding sites were also predicted (47,832 SNP; 9.8%). The result is stored in a database: SNP@promoter. Users can search the SNP@Promoter database using three entries: 1) by SNP identifier (rs number from dbSNP), 2) by gene (gene name, gene symbol, refSeq ID), and 3) by disease term. The SNP@Promoter database provides extensive genetic information and graphical views of queried terms. Conclusion: We present the SNP@Promoter database. It was created in order to predict functional SNPs in putative promoter regions and predicted transcription factor binding sites. SNP@Promoter will help researchers to identify functional SNPs in non-coding regionsclose353

    Welfare Genome Project: A Participatory Korean Personal Genome Project With Free Health Check-Up and Genetic Report Followed by Counseling.

    Get PDF
    The Welfare Genome Project (WGP) provided 1,000 healthy Korean volunteers with detailed genetic and health reports to test the social perception of integrating personal genetic and healthcare data at a large-scale. WGP was launched in 2016 in the Ulsan Metropolitan City as the first large-scale genome project with public participation in Korea. The project produced a set of genetic materials, genotype information, clinical data, and lifestyle survey answers from participants aged 20-96. As compensation, the participants received a free general health check-up on 110 clinical traits, accompanied by a genetic report of their genotypes followed by genetic counseling. In a follow-up survey, 91.0% of the participants indicated that their genetic reports motivated them to improve their health. Overall, WGP expanded not only the general awareness of genomics, DNA sequencing technologies, bioinformatics, and bioethics regulations among all the parties involved, but also the general public's understanding of how genome projects can indirectly benefit their health and lifestyle management. WGP established a data construction framework for not only scientific research but also the welfare of participants. In the future, the WGP framework can help lay the groundwork for a new personalized healthcare system that is seamlessly integrated with existing public medical infrastructure

    Whole genome sequence and analysis of the Marwari horse breed and its genetic origin

    Get PDF
    Background: The horse (Equus ferus caballus) is one of the earliest domesticated species and has played an important role in the development of human societies over the past 5,000 years. In this study, we characterized the genome of the Marwari horse, a rare breed with unique phenotypic characteristics, including inwardly turned ear tips. It is thought to have originated from the crossbreeding of local Indian ponies with Arabian horses beginning in the 12th century. Results: We generated 101 Gb (similar to 30 x coverage) of whole genome sequences from a Marwari horse using the Illumina HiSeq2000 sequencer. The sequences were mapped to the horse reference genome at a mapping rate of similar to 98% and with similar to 95% of the genome having at least 10 x coverage. A total of 5.9 million single nucleotide variations, 0.6 million small insertions or deletions, and 2,569 copy number variation blocks were identified. We confirmed a strong Arabian and Mongolian component in the Marwari genome. Novel variants from the Marwari sequences were annotated, and were found to be enriched in olfactory functions. Additionally, we suggest a potential functional genetic variant in the TSHZ1 gene (p.Ala344>Val) associated with the inward-turning ear tip shape of the Marwari horses. Conclusions: Here, we present an analysis of the Marwari horse genome. This is the first genomic data for an Asian breed, and is an invaluable resource for future studies of genetic variation associated with phenotypes and diseases in horses.open1

    Profiling age-related epigenetic markers of stomach adenocarcinoma in young and old subjects

    Get PDF
    The purpose of our study is to identify epigenetic markers that are differently expressed in the stomach adenocarcinoma (STAD) condition. Based on data from The Cancer Genome Atlas (TCGA), we were able to detect an age-related difference in methylation patterns and changes in gene and miRNA expression levels in young (n = 14) and old (n = 70) STAD subjects. Our analysis identified 323 upregulated and 653 downregulated genes in old STAD subjects. We also found 76 miRNAs with age-related expression patterns and 113 differentially methylated genes (DMGs), respectively. Our further analysis revealed that significant upregulated genes (n = 35) were assigned to the cell cycle, while the muscle system process (n = 27) and cell adhesion-related genes (n = 57) were downregulated. In addition, by comparing gene and miRNA expression with methylation change, we identified that three upregulated genes (ELF3, IL1??, and MMP13) known to be involved in inflammatory responses and cell growth were significantly hypomethylated in the promoter region. We further detected target candidates for age-related, downregulated miRNAs (hsa-mir-124-3, hsa-mir-204, and hsa-mir-125b-2) in old STAD subjects. This is the first report of the results from a study exploring age-related epigenetic biomarkers of STAD using high-throughput data and provides evidence for a complex clinicopathological condition expressed by the age-related STAD progression. © the authors, publisher and licensee Libertas Academica Limitedopen

    COMUS: Clinician-Oriented locus-specific MUtation detection and deposition System

    Get PDF
    Background: A disease-causing mutation refers to a heritable genetic change that is associated with a specific phenotype (disease). The detection of a mutation from a patient's sample is critical for the diagnosis, treatment, and prognosis of the disease. There are numerous databases and applications with which to archive mutation data. However, none of them have been implemented with any automated bioinformatics tools for mutation detection and analysis starting from raw data materials from patients. We present a Locus Specific mutation DB (LSDB) construction system that supports both mutation detection and deposition in one package. Results: COMUS (Clinician-Oriented locus specific MUtation detection and deposition System) is a mutation detection and deposition system for developing specific LSDBs. COMUS contains 1) a DNA sequence mutation analysis method for clinicians' mutation data identification and deposition and 2) a curation system for variation detection from clinicians' input data. To embody the COMUS system and to validate its clinical utility, we have chosen the disease hemophilia as a test database. A set of data files from bench experiments and clinical information from hemophilia patients were tested on the LSDB, KoHemGene http://www.kohemgene.org, which has proven to be a clinician-friendly interface for mutation detection and deposition. Conclusion: COMUS is a bioinformatics system for detecting and depositing new mutations from patient DNA with a clinician-friendly interface. LSDBs made using COMUS will promote the clinical utility of LSDBs. COMUS is available at http://www.comus.info. © 2009 Jho et al; licensee BioMed Central Ltdclose
    • โ€ฆ
    corecore