35 research outputs found

    De-identifying a public use microdata file from the Canadian national discharge abstract database

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The Canadian Institute for Health Information (CIHI) collects hospital discharge abstract data (DAD) from Canadian provinces and territories. There are many demands for the disclosure of this data for research and analysis to inform policy making. To expedite the disclosure of data for some of these purposes, the construction of a DAD public use microdata file (PUMF) was considered. Such purposes include: confirming some published results, providing broader feedback to CIHI to improve data quality, training students and fellows, providing an easily accessible data set for researchers to prepare for analyses on the full DAD data set, and serve as a large health data set for computer scientists and statisticians to evaluate analysis and data mining techniques. The objective of this study was to measure the probability of re-identification for records in a PUMF, and to de-identify a national DAD PUMF consisting of 10% of records.</p> <p>Methods</p> <p>Plausible attacks on a PUMF were evaluated. Based on these attacks, the 2008-2009 national DAD was de-identified. A new algorithm was developed to minimize the amount of suppression while maximizing the precision of the data. The acceptable threshold for the probability of correct re-identification of a record was set at between 0.04 and 0.05. Information loss was measured in terms of the extent of suppression and entropy.</p> <p>Results</p> <p>Two different PUMF files were produced, one with geographic information, and one with no geographic information but more clinical information. At a threshold of 0.05, the maximum proportion of records with the diagnosis code suppressed was 20%, but these suppressions represented only 8-9% of all values in the DAD. Our suppression algorithm has less information loss than a more traditional approach to suppression. Smaller regions, patients with longer stays, and age groups that are infrequently admitted to hospitals tend to be the ones with the highest rates of suppression.</p> <p>Conclusions</p> <p>The strategies we used to maximize data utility and minimize information loss can result in a PUMF that would be useful for the specific purposes noted earlier. However, to create a more detailed file with less information loss suitable for more complex health services research, the risk would need to be mitigated by requiring the data recipient to commit to a data sharing agreement.</p

    Spationomy

    Get PDF
    This open access book is based on "Spationomy ā€“ Spatial Exploration of Economic Data", an interdisciplinary and international project in the frame of ERASMUS+ funded by the European Union. The project aims to exchange interdisciplinary knowledge in the fields of economics and geomatics. For the newly introduced courses, interdisciplinary learning materials have been developed by a team of lecturers from four different universities in three countries. In a first study block, students were taught methods from the two main research fields. Afterwards, the knowledge gained had to be applied in a project. For this international project, teams were formed, consisting of one student from each university participating in the project. The achieved results were presented in a summer school a few months later. At this event, more methodological knowledge was imparted to prepare students for a final simulation game about spatial and economic decision making. In a broader sense, the chapters will present the methodological background of the project, give case studies and show how visualisation and the simulation game works

    Spationomy

    Get PDF
    This open access book is based on "Spationomy ā€“ Spatial Exploration of Economic Data", an interdisciplinary and international project in the frame of ERASMUS+ funded by the European Union. The project aims to exchange interdisciplinary knowledge in the fields of economics and geomatics. For the newly introduced courses, interdisciplinary learning materials have been developed by a team of lecturers from four different universities in three countries. In a first study block, students were taught methods from the two main research fields. Afterwards, the knowledge gained had to be applied in a project. For this international project, teams were formed, consisting of one student from each university participating in the project. The achieved results were presented in a summer school a few months later. At this event, more methodological knowledge was imparted to prepare students for a final simulation game about spatial and economic decision making. In a broader sense, the chapters will present the methodological background of the project, give case studies and show how visualisation and the simulation game works

    Women and Stability: A Topological View of the Relationship between Women and Armed Conflict in West Africa

    Get PDF
    The relationship between women and stability, if any, is a topic of much debate and research. Several large and influential organizations have all researched women\u27s effect on stability. Furthermore, several of these world organizations, the United Nations, in particular, have declared gender equality to be a driving force in promoting stability and conflict prevention. Due to the United States active involvement in conflict prevention in such regions as West Africa, research concerning the relationship between women and stability is of particular interest to the United States Africa Command. As such, this research applied Topological Data Analysis, combined with other machine learning algorithms, to Demographic and Health Survey Program data combined with Armed Conflict Location and Event Data so as to observe the relationship between women\u27s status and armed conflicts in the West African region. While this team did not observe any direct correlation between women\u27s well-being and stability - defined as a lack of armed conflict events - the chosen methodologies and data usage have potential implications for future research concerning stability and conflict

    Aggregating privatized medical data for secure querying applications

    Full text link
    &nbsp;This thesis analyses and examines the challenges of aggregation of sensitive data and data querying on aggregated data at cloud server. This thesis also delineates applications of aggregation of sensitive medical data in several application scenarios, and tests privatization techniques to assist in improving the strength of privacy and utility

    Biometric recognition in automated border control : a survey

    Get PDF
    The increasing demand for traveler clearance at international border crossing points (BCPs) has motivated research for finding more efficient solutions. Automated border control (ABC) is emerging as a solution to enhance the convenience of travelers, the throughput of BCPs, and national security. This is the first comprehensive survey on the biometric techniques and systems that enable automatic identity verification in ABC. We survey the biometric literature relevant to identity verification and summarize the best practices and biometric techniques applicable to ABC, relying on real experience collected in the field. Furthermore, we select some of the major biometric issues raised and highlight the open research areas

    Semantic discovery and reuse of business process patterns

    Get PDF
    Patterns currently play an important role in modern information systems (IS) development and their use has mainly been restricted to the design and implementation phases of the development lifecycle. Given the increasing significance of business modelling in IS development, patterns have the potential of providing a viable solution for promoting reusability of recurrent generalized models in the very early stages of development. As a statement of research-in-progress this paper focuses on business process patterns and proposes an initial methodological framework for the discovery and reuse of business process patterns within the IS development lifecycle. The framework borrows ideas from the domain engineering literature and proposes the use of semantics to drive both the discovery of patterns as well as their reuse

    Happiness and environmental quality

    Get PDF
    Subjective wellbeing ā€” happiness ā€” is of increasing interest to economists, including environmental economists. There are several reasons for thinking that environmental quality (EQ), deļ¬ned as high levels of environmental goods and low levels of environmental ā€˜badsā€™, will be positively related to happiness. Quantitative evidence on this remains limited, however. Some papers use cross-sectional data aggregated at country level, but it is open to doubt whether these aggregated measures reļ¬‚ect individualsā€™ real EQ exposures. Other papers use individual-level data, but in general have spatial data at very coarse resolution, and consider a limited range of EQ variables, exclusively around individualsā€™ homes. This thesis reports two related strands of work. The ļ¬rst designs, implements and analyses data from two new cross-sectional surveys. It builds on earlier work by using spatial data at very high resolution, and advanced Geographical Information Systems (GIS) techniques; by simultaneously considering multiple EQ characteristics, around both homes and workplaces; and by investigating the sensitivity of results to the choice of happiness indicator. The second strand develops and implements a new methodology focused on individualsā€™ momentary experiences of the environment. It extends a protocol known by psychologists as the Experience Sampling Method (ESM) to incorporate satellite (GPS) location data. Using an app for participantsā€™ own smartphones, called Mappiness, it collects a panel data set comprising millions of geo-located responses from thousands of volunteers. EQ indicators are again joined to this data set using GIS. Results of the ļ¬rst strand of work are mixed, but support some links between happiness and the accessibility of natural environments, providing quantitative (including monetary) estimates of their strength. The second strand demonstrates that individuals are signiļ¬cantly and substantially happier outdoors in natural environments than continuous urban ones. It introduces a valuable new line of evidence on this question, which has great potential for future development

    Tradition and Innovation in Construction Project Management

    Get PDF
    This book is a reprint of the Special Issue 'Tradition and Innovation in Construction Project Management' that was published in the journal Buildings

    Advanced Location-Based Technologies and Services

    Get PDF
    Since the publication of the first edition in 2004, advances in mobile devices, positioning sensors, WiFi fingerprinting, and wireless communications, among others, have paved the way for developing new and advanced location-based services (LBSs). This second edition provides up-to-date information on LBSs, including WiFi fingerprinting, mobile computing, geospatial clouds, geospatial data mining, location privacy, and location-based social networking. It also includes new chapters on application areas such as LBSs for public health, indoor navigation, and advertising. In addition, the chapter on remote sensing has been revised to address advancements
    corecore