406 research outputs found

    Don't hold my data hostage - A case for client protocol redesign

    Get PDF
    Transferring a large amount of data from a database to a client program is a surprisingly expensive operation. The time this requires can easily dominate the query execution time for large result sets. This represents a significant hurdle for external data analysis, for example when using statistical software. In this paper, we explore and analyse the result set serialization design space. We present experimental results from a large chunk of the database market and show the inefficiencies of current approaches. We then propose a columnar serialization method that improves transmission performance by an order of magnitude

    Data Management for Data Science - Towards Embedded Analytics

    Get PDF
    The rise of Data Science has caused an influx of new usersin need of data management solutions. However, insteadof utilizing existing RDBMS solutions they are opting touse a stack of independent solutions for data storage andprocessing glued together by scripting languages. This is notbecause they do not need the functionality that an integratedRDBMS provides, but rather because existing RDBMS im-plementations do not cater to their use case. To solve theseissues, we propose a new class of data management systems:embedded analytical systems. These systems are tightlyintegrated with analytical tools, and provide fast and effi-cient access to the data stored within them. In this work,we describe the unique challenges and opportunities w.r.tworkloads, resilience and cooperation that are faced by thisnew class of systems and the steps we have taken towardsaddressing them in the DuckDB system

    Методика оценки деятельности персонала предприятия на основе системы сбалансированных показателей

    Get PDF
    This article is about the mothods to Measure Employee Performance with KPI and balanced scorecard (BSC. It is consider the advantages and disadvantages of this system. Thus, a company gets a management tool linking strategy and daily operations

    BATSE observations of BL Lac Objects

    Get PDF
    The Burst and Transient Source Experiment (BATSE) on the Compton Gamma-Ray Observatory has been shown to be sensitive to non-transient hard X-ray sources in our galaxy, down to flux levels of 100 mCrab for daily measurements, 3 mCrab for integrations over several years. We use the continuous BATSE database and the Earth Occultation technique to extract average flux values between 20 and 200 keV from complete radio- and X-ray- selected BL Lac samples over a 2 year period

    Deep integration of machine learning Into column stores

    Get PDF
    We leverage vectorized User-Defined Functions (UDFs) to efficiently integrate unchanged machine learning pipelines into an analytical data management system. The entire pipelines including data, models, parameters and evaluation outcomes are stored and executed inside the database system. Experiments using our MonetDB/Python UDFs show greatly improved performance due to reduced data movement and parallel processing opportunities. In addition, this integration enables meta-analysis of models using relational queries

    Deployment of RDFa, Microdata, and Microformats on the Web – A Quantitative Analysis

    Get PDF
    More and more websites embed structured data describing for instance products, reviews, blog posts, people, organizations, events, and cooking recipes into their HTML pages using markup standards such as Microformats, Microdata and RDFa. This development has accelerated in the last two years as major Web companies, such as Google, Facebook, Yahoo!, and Microsoft, have started to use the embedded data within their applications. In this paper, we analyze the adoption of RDFa, Microdata, and Microformats across the Web. Our study is based on a large public Web crawl dating from early 2012 and consisting of 3 billion HTML pages which originate from over 40 million websites. The analysis reveals the deployment of the different markup standards, the main topical areas of the published data as well as the different vocabularies that are used within each topical area to represent data. What distinguishes our work from earlier studies, published by the large Web companies, is that the analyzed crawl as well as the extracted data are publicly available. This allows our findings to be verified and to be used as starting points for further domain-specific investigations as well as for focused information extraction endeavors

    The long-term effectiveness of the New Zealand Green Prescription primary health care intervention on Christchurch residents

    Get PDF
    The aim of this research was to evaluate the long term effectiveness of the ‘Green Prescription’ programme, (GRx) in encouraging an increase in physical activity levels in previously inactive individuals, between Janurary 1st 2012 – May 1st 2014. Participants were a non-randomised subset of a larger GRx population. Prescribed Christchurch residents were seperated into two groups, the intervention group (discharged-independently active from the programme) and the control group (discharged-not registered-declined programme and discharged-registered-declined programme). These groups were then randomly selected using Microsoft Excel. A retrospective survey was administered and conducted via telephone. Completed surveys were attained from 147 of 498 participants, a total response rate of 29.9% between the two groups. Forty-one percent of participants in the intervention group reported increases in physical activity levels since being prescribed the GRx programme, 23.1% meet the national physical activity guidelines, and 73.6% were classified as non-sedentary. A higher proportion of the control group (46.4%) were classified as sedentary and only 16.1% met the national physical activity guidelines. Participants who had completed a GRx averaged 146.9 ± 173.5 (mean ± SD) physical activity minutes per week in comparision to the control group 83.1 ± 100.3. A decrease in meeting physical activity guidelines was observed the longer participants were off the Green Prescription Programme. Participants in the intervention group also reported higher levels of energy, increased mobility, a decrease in medication, body weight and aches and pains, had fewer breathing difficulties, felt stronger and more mentally relaxed compared to those in the control group
    corecore