25,782 research outputs found

    Probabilistic linkage without personal information successfully linked national clinical datasets: Linkage of national clinical datasets without patient identifiers using probabilistic methods.

    Get PDF
    BACKGROUND: Probabilistic linkage can link patients from different clinical databases without the need for personal information. If accurate linkage can be achieved, it would accelerate the use of linked datasets to address important clinical and public health questions. OBJECTIVE: We developed a step-by-step process for probabilistic linkage of national clinical and administrative datasets without personal information, and validated it against deterministic linkage using patient identifiers. STUDY DESIGN AND SETTING: We used electronic health records from the National Bowel Cancer Audit (NBOCA) and Hospital Episode Statistics (HES) databases for 10,566 bowel cancer patients undergoing emergency surgery in the English National Health Service. RESULTS: Probabilistic linkage linked 81.4% of NBOCA records to HES, versus 82.8% using deterministic linkage. No systematic differences were seen between patients that were and were not linked, and regression models for mortality and length of hospital stay according to patient and tumour characteristics were not sensitive to the linkage approach. CONCLUSION: Probabilistic linkage was successful in linking national clinical and administrative datasets for patients undergoing a major surgical procedure. It allows analysts outside highly secure data environments to undertake linkage while minimising costs and delays, protecting data security, and maintaining linkage quality

    National Aeronautics and Space Administration fundamental research program. Information utilization and evaluation

    Get PDF
    In the second half of the 1980's NASA can expect to face difficult choices among alternative fundamental and applied research, and development projects that could potentially lead to improvements in the information systems used to manage renewable resources. The working group on information utilization and evaluation believes that effective choices cannot be made without a better understanding of the current and prospective problems and opportunities involved in the application of remote sensing to improve renewable research information systems. A renewable resources information system is defined in a broad context to include a flow of data/information from: acquisition through processing, storage, integration with other data, analysis, graphic presentation, decision making, and assessment of the affects of those decisions

    Low genetic variability, female-biased dispersal and high movement rates in an urban population of Eurasian badgersMeles meles

    Get PDF
    1. Urban and rural populations of animals can differ in their behaviour, both in order to meet their ecological requirements and due to the constraints imposed by different environments. The study of urban populations can therefore offer useful insights into the behavioural flexibility of a species as a whole, as well as indicating how the species in question adapts to a specifically urban environment. 2. The genetic structure of a population can provide information about social structure and movement patterns that is difficult to obtain by other means. Using non-invasively collected hair samples, we estimated the population size of Eurasian badgers Meles meles in the city of Brighton, England, and calculated population-specific parameters of genetic variability and sex-specific rates of outbreeding and dispersal. 3. Population density was high in the context of badger densities reported throughout their range. This was due to a high density of social groups rather than large numbers of individuals per group. 4. The allelic richness of the population was low compared with other British populations. However, the rate of extra-group paternity and the relatively frequent (mainly temporary) intergroup movements suggest that, on a local scale, the population was outbred. Although members of both sexes visited other groups, there was a trend for more females to make intergroup movements. 5. The results reveal that urban badgers can achieve high densities and suggest that while some population parameters are similar between urban and rural populations, the frequency of intergroup movements is higher among urban badgers. In a wider context, these results demonstrate the ability of non-invasive genetic sampling to provide information about the population density, social structure and behaviour of urban wildlife

    Blockchain-backed analytics. Adding blockchain-based quality gates to data science projects

    Full text link
    [EN] A typical analytical lifecycle in data science projects starts with the process of data generation and collection, continues with data preparation and preprocessing and heads towards project specific analytics, visualizations and presentations. In order to ensure high quality trusted analytics, every relevant step of the data-model-result linkage needs to meet certain quality standards that furthermore should be certified by trusted quality gate mechanisms.We propose “blockchain-backed analytics”, a scalable and easy-to-use generic approach to introduce quality gates to data science projects, backed by the immutable records of a blockchain. For that reason, data, models and results are stored as cryptographically hashed fingerprints with mutually linked transactions in a public blockchain database.This approach enables stakeholders of data science projects to track and trace the linkage of data, applied models and modeling results without the need of trust validation of escrow systems or any other third party.Herrmann, M.; Petzold, J.; Bombatkar, V. (2018). Blockchain-backed analytics. Adding blockchain-based quality gates to data science projects. En 2nd International Conference on Advanced Reserach Methods and Analytics (CARMA 2018). Editorial Universitat Politècnica de València. 1-9. https://doi.org/10.4995/CARMA2018.2018.8292OCS1

    Data Mining in Electronic Commerce

    Full text link
    Modern business is rushing toward e-commerce. If the transition is done properly, it enables better management, new services, lower transaction costs and better customer relations. Success depends on skilled information technologists, among whom are statisticians. This paper focuses on some of the contributions that statisticians are making to help change the business world, especially through the development and application of data mining methods. This is a very large area, and the topics we cover are chosen to avoid overlap with other papers in this special issue, as well as to respect the limitations of our expertise. Inevitably, electronic commerce has raised and is raising fresh research problems in a very wide range of statistical areas, and we try to emphasize those challenges.Comment: Published at http://dx.doi.org/10.1214/088342306000000204 in the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Time Slot Management in Attended Home Delivery

    Get PDF
    Many e-tailers providing attended home delivery, especially e-grocers, offer narrow delivery time slots to ensure satisfactory customer service. The choice of delivery time slots has to balance marketing and operational considerations, which results in a complex planning problem. We study the problem of selecting the set of time slots to offer in each of the zip codes in a service region. The selection needs to facilitate cost-effective delivery routes, but also needs to ensure an acceptable level of service to the customer. We present two fully-automated approaches that are capable of producing high-quality delivery time slot offerings in a reasonable amount of time. Computational experiments reveal the value of these approaches and the impact of the environment on the underlying trade-offs.integer programming;vehicle routing;continuous approximation;e-grocery;home delivery;time slots
    corecore