306,163 research outputs found

    OFFICIAL STATISTICS: ABOVE AND BELOW THE PUBLIC DEBATE. THIRTIETH GEARY LECTURE, 1999

    Get PDF
    Roy Geary was a person of great distinction, recognised for a wide range of achievements. He was a first class mathematician who made significant contributions to statistical theory. He was an Official Statistician of distinction and he made great contributions to the development of economic statistics and to the use of statistics for policy purposes in fields as diverse as demography and economic statistics. He was the first Director of the Central Statistics Office when it was created in 1949 and I am delighted to be asked to present this lecture in the CSO’s 50th birthday year

    The development of social class sensitive proxies for infant mortality at the PCT level: An appraisal of candiate indicators for the commission for health improvement

    Get PDF
    The main aim of the work is to identify social class-sensitive proxies for infant mortality at Primary Care Trust level that could be used in the CHI performance ratings process for PCTs in 2003/4

    Identifying Web Tables - Supporting a Neglected Type of Content on the Web

    Full text link
    The abundance of the data in the Internet facilitates the improvement of extraction and processing tools. The trend in the open data publishing encourages the adoption of structured formats like CSV and RDF. However, there is still a plethora of unstructured data on the Web which we assume contain semantics. For this reason, we propose an approach to derive semantics from web tables which are still the most popular publishing tool on the Web. The paper also discusses methods and services of unstructured data extraction and processing as well as machine learning techniques to enhance such a workflow. The eventual result is a framework to process, publish and visualize linked open data. The software enables tables extraction from various open data sources in the HTML format and an automatic export to the RDF format making the data linked. The paper also gives the evaluation of machine learning techniques in conjunction with string similarity functions to be applied in a tables recognition task.Comment: 9 pages, 4 figure

    Use of record-linkage to handle non-response and improve alcohol consumption estimates in health survey data: a study protocol

    Get PDF
    <p>Introduction: Reliable estimates of health-related behaviours, such as levels of alcohol consumption in the population, are required to formulate and evaluate policies. National surveys provide such data; validity depends on generalisability, but this is threatened by declining response levels. Attempts to address bias arising from non-response are typically limited to survey weights based on sociodemographic characteristics, which do not capture differential health and related behaviours within categories. This project aims to explore and address non-response bias in health surveys with a focus on alcohol consumption.</p> <p>Methods and analysis: The Scottish Health Surveys (SHeS) aim to provide estimates representative of the Scottish population living in private households. Survey data of consenting participants (92% of the achieved sample) have been record-linked to routine hospital admission (Scottish Morbidity Records (SMR)) and mortality (from National Records of Scotland (NRS)) data for surveys conducted in 1995, 1998, 2003, 2008, 2009 and 2010 (total adult sample size around 40 000), with maximum follow-up of 16 years. Also available are census information and SMR/NRS data for the general population. Comparisons of alcohol-related mortality and hospital admission rates in the linked SHeS-SMR/NRS with those in the general population will be made. Survey data will be augmented by quantification of differences to refine alcohol consumption estimates through the application of multiple imputation or inverse probability weighting. The resulting corrected estimates of population alcohol consumption will enable superior policy evaluation. An advanced weighting procedure will be developed for wider use.</p> <p>Ethics and dissemination: Ethics approval for SHeS has been given by the National Health Service (NHS) Multi-Centre Research Ethics Committee and use of linked data has been approved by the Privacy Advisory Committee to the Board of NHS National Services Scotland and Registrar General. Funding has been granted by the MRC. The outputs will include four or five public health and statistical methodological international journal and conference papers.</p&gt

    A visual exploration workflow as enabler for the exploitation of Linked Open Data

    Get PDF
    Abstract. Semantically annotating and interlinking Open Data results in Linked Open Data which concisely and unambiguously describes a knowledge domain. However, the uptake of the Linked Data depends on its usefulness to non-Semantic Web experts. Failing to support data consumers to understand the added-value of Linked Data and possible exploitation opportunities could inhibit its diffusion. In this paper, we propose an interactive visual workflow for discovering and ex-ploring Linked Open Data. We implemented the workflow considering academic library metadata and carried out a qualitative evaluation. We assessed the work-flow’s potential impact on data consumers which bridges the offer: published Linked Open Data; and the demand as requests for: (i) higher quality data; and (ii) more applications that re-use data. More than 70 % of the 34 test users agreed that the workflow fulfills its goal: it facilitates non-Semantic Web experts to un-derstand the potential of Linked Open Data.
    corecore