493,337 research outputs found

    SIG-DB: leveraging homomorphic encryption to Securely Interrogate privately held Genomic DataBases

    Full text link
    Genomic data are becoming increasingly valuable as we develop methods to utilize the information at scale and gain a greater understanding of how genetic information relates to biological function. Advances in synthetic biology and the decreased cost of sequencing are increasing the amount of privately held genomic data. As the quantity and value of private genomic data grows, so does the incentive to acquire and protect such data, which creates a need to store and process these data securely. We present an algorithm for the Secure Interrogation of Genomic DataBases (SIG-DB). The SIG-DB algorithm enables databases of genomic sequences to be searched with an encrypted query sequence without revealing the query sequence to the Database Owner or any of the database sequences to the Querier. SIG-DB is the first application of its kind to take advantage of locality-sensitive hashing and homomorphic encryption to allow generalized sequence-to-sequence comparisons of genomic data.Comment: 38 pages, 3 figures, 4 tables, 1 supplemental table, 7 supplemental figure

    The RNA Ontology (RNAO): An ontology for integrating RNA sequence and structure data

    Get PDF
    Biomedical Ontologies are intended to integrate diverse biomedical data to enable intelligent data-mining and facilitate translation of basic research into useful clinical knowledge. We present the first version of RNAO, an ontology for integrating RNA 3D structural, biochemical and sequence data. While each 3D data file depicts the structure of a specific molecule, such data have broader significance as representatives of classes of homologous molecules, which, while differing in sequence, generally share core structural features of functional importance. Thus, 3D structure data gain value by being linked to homologous sequences in genomic data and databases of sequence alignments. Likewise genomic data can increase in value by annotation of shared structural features, especially when these can be linked to specific functions. The RNAO is being developed in line with the developing standards of the Open Biomedical Ontologies (OBO) Consortium

    The Genomic HyperBrowser: inferential genomics at the sequence level

    Get PDF
    The immense increase in the generation of genomic scale data poses an unmet analytical challenge, due to a lack of established methodology with the required flexibility and power. We propose a first principled approach to statistical analysis of sequence-level genomic information. We provide a growing collection of generic biological investigations that query pairwise relations between tracks, represented as mathematical objects, along the genome. The Genomic HyperBrowser implements the approach and is available at http://hyperbrowser.uio.no

    WormBase: a multi-species resource for nematode biology and genomics

    Get PDF
    WormBase (http://www.wormbase.org/) is the central data repository for information about Caenorhabditis elegans and related nematodes. As a model organism database, WormBase extends beyond the genomic sequence, integrating experimental results with extensively annotated views of the genome. The WormBase Consortium continues to expand the biological scope and utility of WormBase with the inclusion of large-scale genomic analyses, through active data and literature curation, through new analysis and visualization tools, and through refinement of the user interface. Over the past year, the nearly complete genomic sequence and comparative analyses of the closely related species Caenorhabditis briggsae have been integrated into WormBase, including gene predictions, ortholog assignments and a new synteny viewer to display the relationships between the two species. Extensive site-wide refinement of the user interface now provides quick access to the most frequently accessed resources and a consistent browsing experience across the site. Unified single-page views now provide complete summaries of commonly accessed entries like genes. These advances continue to increase the utility of WormBase for C.elegans researchers, as well as for those researchers exploring problems in functional and comparative genomics in the context of a powerful genetic system

    SpBase: the sea urchin genome database and web site

    Get PDF
    SpBase is a system of databases focused on the genomic information from sea urchins and related echinoderms. It is exposed to the public through a web site served with open source software (http://spbase.org/). The enterprise was undertaken to provide an easily used collection of information to directly support experimental work on these useful research models in cell and developmental biology. The information served from the databases emerges from the draft genomic sequence of the purple sea urchin, Strongylocentrotus purpuratus and includes sequence data and genomic resource descriptions for other members of the echinoderm clade which in total span 540 million years of evolutionary time. This version of the system contains two assemblies of the purple sea urchin genome, associated expressed sequences, gene annotations and accessory resources. Search mechanisms for the sequences and the gene annotations are provided. Because the system is maintained along with the Sea Urchin Genome resource, a database of sequenced clones is also provided

    Genome-wide association study for calving performance using high-density genotypes in dairy and beef cattle

    Get PDF
    peer-reviewedBackground Calving difficulty and perinatal mortality are prevalent in modern-day cattle production systems. It is well-established that there is a genetic component to both traits, yet little is known about their underlying genomic architecture, particularly in beef breeds. Therefore, we performed a genome-wide association study using high-density genotypes to elucidate the genomic architecture of these traits and to identify regions of the bovine genome associated with them. Results Genomic regions associated with calving difficulty (direct and maternal) and perinatal mortality were detected using two statistical approaches: (1) single-SNP (single nucleotide polymorphism) regression and (2) a Bayesian approach. Data included high-density genotypes on 770 Holstein-Friesian, 927 Charolais and 963 Limousin bulls. Several novel or previously identified genomic regions were detected but associations differed by breed. For example, two genomic associations, one each on chromosomes 18 and 2 explained 2.49 % and 3.13 % of the genetic variance in direct calving difficulty in the Holstein-Friesian and Charolais populations, respectively. Imputed Holstein-Friesian sequence data was used to refine the genomic regions responsible for significant associations. Several candidate genes on chromosome 18 were identified and four highly significant missense variants were detected within three of these genes (SIGLEC12, CTU1, and ZNF615). Nevertheless, only CTU1 contained a missense variant with a putative impact on direct calving difficulty based on SIFT (0.06) and Polyphen (0.95) scores. Using imputed sequence data, we refined a genomic region on chromosome 4 associated with maternal calving difficulty in the Holstein-Friesian population and found the strongest association with an intronic variant in the PCLO gene. A meta-analysis was performed across the three breeds for each calving performance trait to identify common variants associated with these traits in the three breeds. Our results suggest that a portion of the genetic variation in calving performance is common to all three breeds. Conclusion The genomic architecture of calving performance is complex and mainly influenced by many polymorphisms of small effect. We identified several associations of moderate effect size but the majority were breed-specific, indicating that breed-specific alleles exist for calving performance or that the linkage phase between genotyped allele and causal mutation varies between breeds
    corecore