48 research outputs found

    Efficient Versioning for Scientific Array Databases

    Get PDF
    In this paper, we describe a versioned database storage manager we are developing for the SciDB scientific database. The system is designed to efficiently store and retrieve array-oriented data, exposing a "no-overwrite" storage model in which each update creates a new "version" of an array. This makes it possible to perform comparisons of versions produced at different times or by different algorithms, and to create complex chains and trees of versions. We present algorithms to efficiently encode these versions, minimizing storage space while still providing efficient access to the data. Additionally, we present an optimal algorithm that, given a long sequence of versions, determines which versions to encode in terms of each other (using delta compression) to minimize total storage space or query execution cost. We compare the performance of these algorithms on real world data sets from the National Oceanic and Atmospheric Administration (NOAA), Open Street Maps, and several other sources. We show that our algorithms provide better performance than existing version control systems not optimized for array data, both in terms of storage size and access time, and that our delta-compression algorithms are able to substantially reduce the total storage space when versions exist with a high degree of similarity.National Science Foundation (U.S.) (Grant IIS/III-1111371)National Science Foundation (U.S.) (Grant SI2-1047955

    Self-Enforcing Access Control for Encrypted RDF

    Get PDF
    The amount of raw data exchanged via web protocols is steadily increasing. Although the Linked Data infrastructure could potentially be used to selectively share RDF data with different individuals or organisations, the primary focus remains on the unrestricted sharing of public data. In order to extend the Linked Data paradigm to cater for closed data, there is a need to augment the existing infrastructure with robust security mechanisms. At the most basic level both access control and encryption mechanisms are required. In this paper, we propose a flexible and dynamic mechanism for securely storing and efficiently querying RDF datasets. By employing an encryption strategy based on Functional Encryption (FE) in which controlled data access does not require a trusted mediator, but is instead enforced by the cryptographic approach itself, we allow for fine-grained access control over encrypted RDF data while at the same time reducing the administrative overhead associated with access control management

    An integrated approach for increasing breeding efficiency in apple and peach in Europe

    Get PDF
    Despite the availability of whole genome sequences of apple and peach, there has been a considerable gap between genomics and breeding. To bridge the gap, the European Union funded the FruitBreedomics project (March 2011 to August 2015) involving 28 research institutes and private companies. Three complementary approaches were pursued: (i) tool and software development, (ii) deciphering genetic control of main horticultural traits taking into account allelic diversity and (iii) developing plant materials, tools and methodologies for breeders. Decisive breakthroughs were made including the making available of ready-to-go DNA diagnostic tests for Marker Assisted Breeding, development of new, dense SNP arrays in apple and peach, new phenotypic methods for some complex traits, software for gene/QTL discovery on breeding germplasm via Pedigree Based Analysis (PBA). This resulted in the discovery of highly predictive molecular markers for traits of horticultural interest via PBA and via Genome Wide Association Studies (GWAS) on several European genebank collections. FruitBreedomics also developed pre-breeding plant materials in which multiple sources of resistance were pyramided and software that can support breeders in their selection activities. Through FruitBreedomics, significant progresses were made in the field of apple and peach breeding, genetics, genomics and bioinformatics of which advantage will be made by breeders, germplasm curators and scientists. A major part of the data collected during the project has been stored in the FruitBreedomics database and has been made available to the public. This review covers the scientific discoveries made in this major endeavour, and perspective in the apple and peach breeding and genomics in Europe and beyond

    H2O: A Hands-free Adaptive Store

    Get PDF
    Modern state-of-the-art database systems are designed around a single data storage layout. This is a fixed decision that drives the whole architectural design of a database system, i.e., row-stores, column-stores. However, none of those choices is a universally good solution; different workloads require different storage layouts and data access methods in order to achieve good performance. In this paper, we present the H2O system which introduces two novel concepts. First, it is flexible to support multiple storage layouts and data access patterns in a single engine. Second, and most importantly, it decides on-the-fly, i.e., during query processing, which design is best for classes of queries and the respective data parts. At any given point in time, parts of the data might be materialized in various patterns purely depending on the query workload; as the workload changes and with every single query, the storage and access patterns continuously adapt. In this way, H2O makes no a priori and fixed decisions on how data should be stored, allowing each single query to enjoy a storage and access pattern which is tailored to its specific properties. We present a detailed analysis of H2O using both synthetic benchmarks and realistic scientific workloads. We demonstrate that while existing systems cannot achieve maximum performance across all workloads, H2O can always match the best case performance without requiring any tuning or workload knowledge

    Thoracic spine pain in the general population: Prevalence, incidence and associated factors in children, adolescents and adults. A systematic review

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Thoracic spine pain (TSP) is experienced across the lifespan by healthy individuals and is a common presentation in primary healthcare clinical practice. However, the epidemiological characteristics of TSP are not well documented compared to neck and low back pain. A rigorous evaluation of the prevalence, incidence, correlates and risk factors needs to be undertaken in order for epidemiologic data to be meaningfully used to develop evidence-based prevention and treatment recommendations for TSP.</p> <p>Methods</p> <p>A systematic review method was followed to report the evidence describing prevalence, incidence, associated factors and risk factors for TSP among the general population. Nine electronic databases were systematically searched to identify studies that reported either prevalence, incidence, associated factors (cross-sectional study) or risk factors (prospective study) for TSP in healthy children, adolescents or adults. Studies were evaluated for level of evidence and method quality.</p> <p>Results</p> <p>Of the 1389 studies identified in the literature, 33 met the inclusion criteria for this systematic review. The mean (SD) quality score (out of 15) for the included studies was 10.5 (2.0). TSP prevalence data ranged from 4.0–72.0% (point), 0.5–51.4% (7-day), 1.4–34.8% (1-month), 4.8–7.0% (3-month), 3.5–34.8% (1-year) and 15.6–19.5% (lifetime). TSP prevalence varied according to the operational definition of TSP. Prevalence for any TSP ranged from 0.5–23.0%, 15.8–34.8%, 15.0–27.5% and 12.0–31.2% for 7-day, 1-month, 1-year and lifetime periods, respectively. TSP associated with backpack use varied from 6.0–72.0% and 22.9–51.4% for point and 7-day periods, respectively. TSP interfering with school or leisure ranged from 3.5–9.7% for 1-year prevalence. Generally, studies reported a higher prevalence for TSP in child and adolescent populations, and particularly for females. The 1 month, 6 month, 1 year and 25 year incidences were 0–0.9%, 10.3%, 3.8–35.3% and 9.8% respectively. TSP was significantly associated with: concurrent musculoskeletal pain; growth and physical; lifestyle and social; backpack; postural; psychological; and environmental factors. Risk factors identified for TSP in adolescents included age (being older) and poorer mental health.</p> <p>Conclusion</p> <p>TSP is a common condition in the general population. While there is some evidence for biopsychosocial associations it is limited and further prospectively designed research is required to inform prevention and management strategies.</p

    Thoracic spine pain in the general population: Prevalence, incidence and associated factors in children, adolescents and adults. A systematic review

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Thoracic spine pain (TSP) is experienced across the lifespan by healthy individuals and is a common presentation in primary healthcare clinical practice. However, the epidemiological characteristics of TSP are not well documented compared to neck and low back pain. A rigorous evaluation of the prevalence, incidence, correlates and risk factors needs to be undertaken in order for epidemiologic data to be meaningfully used to develop evidence-based prevention and treatment recommendations for TSP.</p> <p>Methods</p> <p>A systematic review method was followed to report the evidence describing prevalence, incidence, associated factors and risk factors for TSP among the general population. Nine electronic databases were systematically searched to identify studies that reported either prevalence, incidence, associated factors (cross-sectional study) or risk factors (prospective study) for TSP in healthy children, adolescents or adults. Studies were evaluated for level of evidence and method quality.</p> <p>Results</p> <p>Of the 1389 studies identified in the literature, 33 met the inclusion criteria for this systematic review. The mean (SD) quality score (out of 15) for the included studies was 10.5 (2.0). TSP prevalence data ranged from 4.0–72.0% (point), 0.5–51.4% (7-day), 1.4–34.8% (1-month), 4.8–7.0% (3-month), 3.5–34.8% (1-year) and 15.6–19.5% (lifetime). TSP prevalence varied according to the operational definition of TSP. Prevalence for any TSP ranged from 0.5–23.0%, 15.8–34.8%, 15.0–27.5% and 12.0–31.2% for 7-day, 1-month, 1-year and lifetime periods, respectively. TSP associated with backpack use varied from 6.0–72.0% and 22.9–51.4% for point and 7-day periods, respectively. TSP interfering with school or leisure ranged from 3.5–9.7% for 1-year prevalence. Generally, studies reported a higher prevalence for TSP in child and adolescent populations, and particularly for females. The 1 month, 6 month, 1 year and 25 year incidences were 0–0.9%, 10.3%, 3.8–35.3% and 9.8% respectively. TSP was significantly associated with: concurrent musculoskeletal pain; growth and physical; lifestyle and social; backpack; postural; psychological; and environmental factors. Risk factors identified for TSP in adolescents included age (being older) and poorer mental health.</p> <p>Conclusion</p> <p>TSP is a common condition in the general population. While there is some evidence for biopsychosocial associations it is limited and further prospectively designed research is required to inform prevention and management strategies.</p

    In search of attributes that support self-regulation in blended learning environments

    Get PDF

    Data-driven process discovery and analysis

    No full text
    corecore