321 research outputs found

    Focal Spot, Spring 2005

    Get PDF
    https://digitalcommons.wustl.edu/focal_spot_archives/1099/thumbnail.jp

    The Semantic Automated Discovery and Integration (SADI) Web service Design-Pattern, API and Reference Implementation

    Get PDF
    Background. 
The complexity and inter-related nature of biological data poses a difficult challenge for data and tool integration. There has been a proliferation of interoperability standards and projects over the past decade, none of which has been widely adopted by the bioinformatics community. Recent attempts have focused on the use of semantics to assist integration, and Semantic Web technologies are being welcomed by this community.

Description. 
SADI – Semantic Automated Discovery and Integration – is a lightweight set of fully standards-compliant Semantic Web service design patterns that simplify the publication of services of the type commonly found in bioinformatics and other scientific domains. Using Semantic Web technologies at every level of the Web services “stack”, SADI services consume and produce instances of OWL Classes following a small number of very straightforward best-practices. In addition, we provide codebases that support these best-practices, and plug-in tools to popular developer and client software that dramatically simplify deployment of services by providers, and the discovery and utilization of those services by their consumers.

Conclusions.
SADI Services are fully compliant with, and utilize only foundational Web standards; are simple to create and maintain for service providers; and can be discovered and utilized in a very intuitive way by biologist end-users. In addition, the SADI design patterns significantly improve the ability of software to automatically discover appropriate services based on user-needs, and automatically chain these into complex analytical workflows. We show that, when resources are exposed through SADI, data compliant with a given ontological model can be automatically gathered, or generated, from these distributed, non-coordinating resources - a behavior we have not observed in any other Semantic system. Finally, we show that, using SADI, data dynamically generated from Web services can be explored in a manner very similar to data housed in static triple-stores, thus facilitating the intersection of Web services and Semantic Web technologies

    Investigating subsumption in DL-based terminologies: A case study in SNOMED CT

    Get PDF
    Formalisms such as description logics (DL) are sometimes expected to help terminologies ensure compliance with sound ontological principles. The objective of this paper is to study the degree to which one DL-based biomedical terminology (SNOMED CT) complies with such principles. We defined seven ontological principles (for example: each class must have at least one parent, each class must differ from its parent) and examined the properties of SNOMED CT classes with respect to these principles. Our major results are: 31% of the classes have a single child; 27% have multiple parents; 51% do not exhibit any differentiae between the description of the parent and that of the child. The applications of this study to quality assurance for ontologies are discussed and suggestions are made for dealing with multiple inheritance

    GeneKeyDB: A lightweight, gene-centric, relational database to support data mining environments

    Get PDF
    BACKGROUND: The analysis of biological data is greatly enhanced by existing or emerging databases. Most existing databases, with few exceptions are not designed to easily support large scale computational analysis, but rather offer exclusively a web interface to the resource. We have recognized the growing need for a database which can be used successfully as a backend to computational analysis tools and pipelines. Such database should be sufficiently versatile to allow easy system integration. RESULTS: GeneKeyDB is a gene-centered relational database developed to enhance data mining in biological data sets. The system provides an underlying data layer for computational analysis tools and visualization tools. GeneKeyDB relies primarily on existing database identifiers derived from community databases (NCBI, GO, Ensembl, et al.) as well as the known relationships among those identifiers. It is a lightweight, portable, and extensible platform for integration with computational tools and analysis environments. CONCLUSION: GeneKeyDB can enable analysis tools and users to manipulate the intersections, unions, and differences among different data sets

    Drug-gene interactions of antihypertensive medications and risk of incident cardiovascular disease: a pharmacogenomics study from the CHARGE consortium

    Get PDF
    Background Hypertension is a major risk factor for a spectrum of cardiovascular diseases (CVD), including myocardial infarction, sudden death, and stroke. In the US, over 65 million people have high blood pressure and a large proportion of these individuals are prescribed antihypertensive medications. Although large long-term clinical trials conducted in the last several decades have identified a number of effective antihypertensive treatments that reduce the risk of future clinical complications, responses to therapy and protection from cardiovascular events vary among individuals. Methods Using a genome-wide association study among 21,267 participants with pharmaceutically treated hypertension, we explored the hypothesis that genetic variants might influence or modify the effectiveness of common antihypertensive therapies on the risk of major cardiovascular outcomes. The classes of drug treatments included angiotensin-converting enzyme inhibitors, beta-blockers, calcium channel blockers, and diuretics. In the setting of the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) consortium, each study performed array-based genome-wide genotyping, imputed to HapMap Phase II reference panels, and used additive genetic models in proportional hazards or logistic regression models to evaluate drug-gene interactions for each of four therapeutic drug classes. We used meta-analysis to combine study-specific interaction estimates for approximately 2 million single nucleotide polymorphisms (SNPs) in a discovery analysis among 15,375 European Ancestry participants (3,527 CVD cases) with targeted follow-up in a case-only study of 1,751 European Ancestry GenHAT participants as well as among 4,141 African-Americans (1,267 CVD cases). Results Although drug-SNP interactions were biologically plausible, exposures and outcomes were well measured, and power was sufficient to detect modest interactions, we did not identify any statistically significant interactions from the four antihypertensive therapy meta-analyses (Pinteraction > 5.0×10−8). Similarly, findings were null for meta-analyses restricted to 66 SNPs with significant main effects on coronary artery disease or blood pressure from large published genome-wide association studies (Pinteraction ≥ 0.01). Our results suggest that there are no major pharmacogenetic influences of common SNPs on the relationship between blood pressure medications and the risk of incident CVD

    France's search for institutional schemes to promote innovation : the case of genomics

    Get PDF
    The subject of this paper is the relationship between the policy-making and the innovation performance in genomics and biomedical related biotechnologies in the national research and innovation system in France in 1990'. The aim is to highlight the relative effectiveness of the different public policies and their instruments compared to the action of the non for profit sector. Government policy has recently supported a development of the biotechnology sector by encouraging start-ups and creating favourable framework conditions such as incubators, a specialised stock exchange, financial institutions or technopoles. By studying the co-ordination mechanisms between the different organisations (non for profit organisations, public authorities, public sector research, biotech SMEs and large firms, especially in the biomedical sector), this paper shows that the path dependant institutions and the contradiction between the different policy tools to promote science base knowledge commercialisation can explain the poor development of biotech sector in France in the last few years, in spite of a high investment in the basic scientific researcb in life sciences.innovation system; S§T policy; biotechnology; genomics; Triple Helix model; France; policy-making; diffusion-oriented policy; science base knowledge

    A case study for cloud based high throughput analysis of NGS data using the globus genomics system

    Get PDF
    AbstractNext generation sequencing (NGS) technologies produce massive amounts of data requiring a powerful computational infrastructure, high quality bioinformatics software, and skilled personnel to operate the tools. We present a case study of a practical solution to this data management and analysis challenge that simplifies terabyte scale data handling and provides advanced tools for NGS data analysis. These capabilities are implemented using the “Globus Genomics” system, which is an enhanced Galaxy workflow system made available as a service that offers users the capability to process and transfer data easily, reliably and quickly to address end-to-endNGS analysis requirements. The Globus Genomics system is built on Amazon's cloud computing infrastructure. The system takes advantage of elastic scaling of compute resources to run multiple workflows in parallel and it also helps meet the scale-out analysis needs of modern translational genomics research
    corecore