364 research outputs found

    Enhancing Water Quality Data Service Discovery And Access Using Standard Vocabularies

    Full text link
    There is a growing need for consistency across the publishing, discovering, integrating and access to scientific datasets, such as water quality data. Such datasets may have varying formats and service interfaces. The Network Common Data Form (NetCDF) is both a software package and a data format for producing array-oriented scientific data, which is commonly used to exchange data, including water quality data. NetCDF datasets are also published through service interfaces using the THREDDS data server. Alternatively water quality datasets can be encoded with standard XML formats such as WaterML 2.0, which can be published with services such as the Open Geospatial Consortium (OGC) community\u27s Web Feature Service interface standard (WFS). However, appropriate interpretation of the content, discovery and interoperability of data depends on common models, schemas and vocabularies, though these may not always be available. Using the water quality vocabulary we have developed, formalized using the Resource Description Framework (RDF) language, and published as Linked Data, we demonstrate the use of such standard vocabularies in existing data services for providing service capability metadata. We also present methods for augmenting existing metadata fields for water quality data specifically in formats such as NetCDF, WaterML 2.0 using standard vocabularies. We show how using standard vocabularies that are encoded and published using semantic technologies can enhance discovery, integration and access to existing data services delivering water quality datasets

    A Harmonized Vocabulary For Water Quality

    Full text link
    Interoperability of water quality data depends on the use of common models, schemas and vocabularies. However, terms are usually collected during different activities and projects in isolation of one another, resulting in vocabularies that have the same scope being represented with different terms, using different formats and formalisms, and published in various access methods. Significantly, most water quality vocabularies conflate multiple concepts in a single term, e.g. quantity kind, units of measure, substance or taxon, medium and procedure. This bundles information associated with separate elements from the OGC Observations and Measurements (O&M) model into a single slot. We have developed a water quality vocabulary, formalized using RDF, and published as Linked Data. The terms were extracted from existing water quality vocabularies. The observable property model is inspired by O&M but aligned with existing ontologies. The core is an OWL ontology that extends the QUDT ontology for Unit and QuantityKind definitions. We add classes to generalize the QuantityKind model, and properties for explicit description of the conflated concepts. The key elements are defined to be sub-classes or sub-properties of SKOS elements, which enables a SKOS view to be published through standard vocabulary APIs, alongside the full view. QUDT terms are re-used where possible, supplemented with additional Unit and QuantityKind entries required for water quality. Along with items from separate vocabularies developed for objects, media, and procedures, these are linked into definitions in the actual observable property vocabulary. Definitions of objects related to chemical substances are linked to items from the Chemical Entities of Biological Interest (ChEBI) ontology. Mappings to other vocabularies, such as DBPedia, are in separately maintained files. By formalizing the model for observable properties, and clearly labelling the separate concerns, water quality observations from different sources may be more easily merged and also transformed to O&M for cross-domain applications

    Harmonization Of Vocabularies For Water Data

    Full text link
    Observational data encodes values of properties associated with a feature of interest, estimated by a specified procedure. For water the properties are physical parameters like level, volume, flow and pressure, and concentrations and counts of chemicals, substances and organisms. Water property vocabularies have been assembled at project, agency and jurisdictional level. Organizations such as EPA, USGS, CEH, GA and BoM maintain vocabularies for internal use, and may make them available externally as text files. BODC and MMI have harvested many water vocabularies alongside others of interest in their domain, formalized the content using SKOS, and published them through web interfaces. Scope is highly variable both within and between vocabularies. Individual items may conflate multiple concerns (e.g. property, instrument, statistical procedure, units). There is significant duplication between vocabularies. Semantic web technologies provide the opportunity both to publish vocabularies more effectively, and achieve harmonization to support greater interoperability between datasets. - Models for vocabulary items (property, substance/taxon, process, unit-of-measure, etc) may be formalized OWL ontologies, supporting semantic relations between items in related vocabularies; - By specializing the ontology elements from SKOS concepts and properties, diverse vocabularies may be published through a common interface; - Properties from standard vocabularies (e.g. OWL, SKOS, PROV-O and VAEM) support mappings between vocabularies having a similar scope - Existing items from various sources may be assembled into new virtual vocabularies However, there are a number of challenges: - use of standard properties such as sameAs/exactMatch/equivalentClass require reasoning support; - items have been conceptualised as both classes and individuals, complicating the mapping mechanics; - re-use of items across vocabularies may conflict with expectations concerning URI patterns; - versioning complicates cross-references and re-use. This presentation will discuss ways to harness semantic web technologies to publish harmonized vocabularies, and will summarise how many of the challenges may be addressed

    Measurement of the Bottom-Strange Meson Mixing Phase in the Full CDF Data Set

    Get PDF
    We report a measurement of the bottom-strange meson mixing phase \beta_s using the time evolution of B0_s -> J/\psi (->\mu+\mu-) \phi (-> K+ K-) decays in which the quark-flavor content of the bottom-strange meson is identified at production. This measurement uses the full data set of proton-antiproton collisions at sqrt(s)= 1.96 TeV collected by the Collider Detector experiment at the Fermilab Tevatron, corresponding to 9.6 fb-1 of integrated luminosity. We report confidence regions in the two-dimensional space of \beta_s and the B0_s decay-width difference \Delta\Gamma_s, and measure \beta_s in [-\pi/2, -1.51] U [-0.06, 0.30] U [1.26, \pi/2] at the 68% confidence level, in agreement with the standard model expectation. Assuming the standard model value of \beta_s, we also determine \Delta\Gamma_s = 0.068 +- 0.026 (stat) +- 0.009 (syst) ps-1 and the mean B0_s lifetime, \tau_s = 1.528 +- 0.019 (stat) +- 0.009 (syst) ps, which are consistent and competitive with determinations by other experiments.Comment: 8 pages, 2 figures, Phys. Rev. Lett 109, 171802 (2012

    Number preferences in lotteries

    Get PDF
    We explore people's preferences for numbers in large proprietary data sets from two different lottery games. We find that choice is far from uniform, and exhibits some familiar and some new tendencies and biases. Players favor personally meaningful and situationally available numbers, and are attracted towards numbers in the center of the choice form. Frequent players avoid winning numbers from recent draws, whereas infrequent players chase these. Combinations of numbers are formed with an eye for aesthetics, and players tend to spread their numbers relatively evenly across the possible range

    Integrated motor drives: state of the art and future trends

    Get PDF
    With increased need for high power density, high efficiency and high temperature capabilities in Aerospace and Automotive applications, Integrated Motor Drives (IMD) offers a potential solution. However, close physical integration of the converter and the machine may also lead to an increase in components temperature. This requires careful mechanical, structural and thermal analysis; and design of the IMD system. This paper reviews existing IMD technologies and their thermal effects on the IMD system. The effects of the power electronics (PE) position on the IMD system and its respective thermal management concepts are also investigated. The challenges faced in designing and manufacturing of an IMD along with the mechanical and structural impacts of close physical integration is also discussed and potential solutions are provided. Potential converter topologies for an IMD like the Matrix converter, 2-level Bridge, 3-level NPC and Multiphase full bridge converters are also reviewed. Wide band gap devices like SiC and GaN and their packaging in power modules for IMDs are also discussed. Power modules components and packaging technologies are also presented

    Genetic predisposition to ductal carcinoma in situ of the breast

    Get PDF
    Background: Ductal carcinoma in situ (DCIS) is a non-invasive form of breast cancer. It is often associated with invasive ductal carcinoma (IDC), and is considered to be a non-obligate precursor of IDC. It is not clear to what extent these two forms of cancer share low-risk susceptibility loci, or whether there are differences in the strength of association for shared loci. Methods: To identify genetic polymorphisms that predispose to DCIS, we pooled data from 38 studies comprising 5,067 cases of DCIS, 24,584 cases of IDC and 37,467 controls, all genotyped using the iCOGS chip. Results: Most (67 %) of the 76 known breast cancer predisposition loci showed an association with DCIS in the same direction as previously reported for invasive breast cancer. Case-only analysis showed no evidence for differences between associations for IDC and DCIS after considering multiple testing. Analysis by estrogen receptor (ER) status confirmed that loci associated with ER positive IDC were also associated with ER positive DCIS. Analysis of DCIS by grade suggested that two independent SNPs at 11q13.3 near CCND1 were specific to low/intermediate grade DCIS (rs75915166, rs554219). These associations with grade remained after adjusting for ER status and were also found in IDC. We found no novel DCIS-specific loci at a genome wide significance level of P < 5.0x10-8. Conclusion: In conclusion, this study provides the strongest evidence to date of a shared genetic susceptibility for IDC and DCIS. Studies with larger numbers of DCIS are needed to determine if IDC or DCIS specific loci exist

    Refined histopathological predictors of BRCA1 and BRCA2 mutation status: A large-scale analysis of breast cancer characteristics from the BCAC, CIMBA, and ENIGMA consortia

    Get PDF
    Introduction: The distribution of histopathological features of invasive breast tumors in BRCA1 or BRCA2 germline mutation carriers differs from that of individuals with no known mutation. Histopathological features thus have utility for mutation prediction, including statistical modeling to assess pathogenicity of BRCA1 or BRCA2 variants of uncertain clinical significance. We analyzed large pathology datasets accrued by the Consortium of Investigators of Modifiers of BRCA1/2 (CIMBA) and the Breast Cancer Association Consortium (BCAC) to reassess histopathological predictors of BRCA1 and BRCA2 mutation status, and provide robust likelihood ratio (LR) estimates for statistical modeling. Methods: Selection criteria for study/center inclusion were estrogen receptor (ER) status or grade data available for invasive breast cancer diagnosed younger than 70 years. The dataset included 4,477 BRCA1 mutation carriers, 2,565 BRCA2 mutation carriers, and 47,565 BCAC breast cancer cases. Country-stratified estimates of the

    No evidence that protein truncating variants in BRIP1 are associated with breast cancer risk: implications for gene panel testing.

    Get PDF
    BACKGROUND: BRCA1 interacting protein C-terminal helicase 1 (BRIP1) is one of the Fanconi Anaemia Complementation (FANC) group family of DNA repair proteins. Biallelic mutations in BRIP1 are responsible for FANC group J, and previous studies have also suggested that rare protein truncating variants in BRIP1 are associated with an increased risk of breast cancer. These studies have led to inclusion of BRIP1 on targeted sequencing panels for breast cancer risk prediction. METHODS: We evaluated a truncating variant, p.Arg798Ter (rs137852986), and 10 missense variants of BRIP1, in 48 144 cases and 43 607 controls of European origin, drawn from 41 studies participating in the Breast Cancer Association Consortium (BCAC). Additionally, we sequenced the coding regions of BRIP1 in 13 213 cases and 5242 controls from the UK, 1313 cases and 1123 controls from three population-based studies as part of the Breast Cancer Family Registry, and 1853 familial cases and 2001 controls from Australia. RESULTS: The rare truncating allele of rs137852986 was observed in 23 cases and 18 controls in Europeans in BCAC (OR 1.09, 95% CI 0.58 to 2.03, p=0.79). Truncating variants were found in the sequencing studies in 34 cases (0.21%) and 19 controls (0.23%) (combined OR 0.90, 95% CI 0.48 to 1.70, p=0.75). CONCLUSIONS: These results suggest that truncating variants in BRIP1, and in particular p.Arg798Ter, are not associated with a substantial increase in breast cancer risk. Such observations have important implications for the reporting of results from breast cancer screening panels.The COGS project is funded through a European Commission's Seventh Framework Programme grant (agreement number 223175 - HEALTH-F2-2009-223175). BCAC is funded by Cancer Research UK [C1287/A10118, C1287/A12014] and by the European Community´s Seventh Framework Programme under grant agreement number 223175 (grant number HEALTH-F2-2009-223175) (COGS). Funding for the iCOGS infrastructure came from: the European Community's Seventh Framework Programme under grant agreement n° 223175 (HEALTH-F2-2009-223175) (COGS), Cancer Research UK (C1287/A10118, C1287/A 10710, C12292/A11174, C1281/A12014, C5047/A8384, C5047/A15007, C5047/A10692, C8197/A16565), the National Institutes of Health (CA128978) and Post-Cancer GWAS initiative (1U19 CA148537, 1U19 16 CA148065 and 1U19 CA148112 - the GAME-ON initiative), the Department of Defense (W81XWH-10-1- 0341), the Canadian Institutes of Health Research (CIHR) for the CIHR Team in Familial Risks of Breast Cancer, Komen Foundation for the Cure, the Breast Cancer Research Foundation, and the Ovarian Cancer Research Fund. This study made use of data generated by the Wellcome Trust Case Control consortium. Funding for the project was provided by the Wellcome Trust under award 076113. The results published here are in part based upon data generated by The Cancer Genome Atlas Project established by the National Cancer Institute and National Human Genome Research Institute.This is the author accepted manuscript. The final version is available from BMJ Group at http://dx.doi.org/10.1136/jmedgenet-2015-103529

    Identification of independent association signals and putative functional variants for breast cancer risk through fine-scale mapping of the 12p11 locus.

    Get PDF
    BACKGROUND: Multiple recent genome-wide association studies (GWAS) have identified a single nucleotide polymorphism (SNP), rs10771399, at 12p11 that is associated with breast cancer risk. METHOD: We performed a fine-scale mapping study of a 700 kb region including 441 genotyped and more than 1300 imputed genetic variants in 48,155 cases and 43,612 controls of European descent, 6269 cases and 6624 controls of East Asian descent and 1116 cases and 932 controls of African descent in the Breast Cancer Association Consortium (BCAC; http://bcac.ccge.medschl.cam.ac.uk/ ), and in 15,252 BRCA1 mutation carriers in the Consortium of Investigators of Modifiers of BRCA1/2 (CIMBA). Stepwise regression analyses were performed to identify independent association signals. Data from the Encyclopedia of DNA Elements project (ENCODE) and the Cancer Genome Atlas (TCGA) were used for functional annotation. RESULTS: Analysis of data from European descendants found evidence for four independent association signals at 12p11, represented by rs7297051 (odds ratio (OR) = 1.09, 95 % confidence interval (CI) = 1.06-1.12; P = 3 × 10(-9)), rs805510 (OR = 1.08, 95 % CI = 1.04-1.12, P = 2 × 10(-5)), and rs1871152 (OR = 1.04, 95 % CI = 1.02-1.06; P = 2 × 10(-4)) identified in the general populations, and rs113824616 (P = 7 × 10(-5)) identified in the meta-analysis of BCAC ER-negative cases and BRCA1 mutation carriers. SNPs rs7297051, rs805510 and rs113824616 were also associated with breast cancer risk at P < 0.05 in East Asians, but none of the associations were statistically significant in African descendants. Multiple candidate functional variants are located in putative enhancer sequences. Chromatin interaction data suggested that PTHLH was the likely target gene of these enhancers. Of the six variants with the strongest evidence of potential functionality, rs11049453 was statistically significantly associated with the expression of PTHLH and its nearby gene CCDC91 at P < 0.05. CONCLUSION: This study identified four independent association signals at 12p11 and revealed potentially functional variants, providing additional insights into the underlying biological mechanism(s) for the association observed between variants at 12p11 and breast cancer risk.UK funding includes Cancer Research UK and NIH.This is the final version of the article. It first appeared from BioMed Central via http://dx.doi.org/10.1186/s13058-016-0718-
    corecore