10 research outputs found

    Suicide Information Database-Cymru: a protocol for a population-based, routinely collected data linkage study to explore risks and patterns of healthcare contact prior to suicide to identify opportunities for intervention

    Get PDF
    Introduction Prevention of suicide is a global public health challenge extending beyond mental health services. Linking routinely collected health and social care system data records for the same individual across different services and over time has enormous potential in suicide research. Most previous research linking suicide mortality data with routinely collected electronic health records involves only one or two domains of healthcare provision such as psychiatric inpatient care. This protocol paper describes the development of a population-based, routinely collected data linkage study: the Suicide Information Database Cymru (SID-Cymru). SID-Cymru aims to contribute to the information available on people who complete suicide. Methods and analysis SID-Cymru will facilitate a series of electronic case–control studies based in the Secure Anonymised Information Linkage (SAIL) Databank. We have identified 2664 cases of suicide in Wales between 2003 and 2011 from routinely collected mortality data using International Classification of Diseases, Tenth Revision, codes X60–X84 (intentional self-harm) and Y10–Y34 (undetermined intent). Each case will be matched by age and sex to at least five controls. Records will be collated and linked from routinely collected health and social data in Wales for each individual. Conditional logistic regression will be applied to produce crude and confounder (including general practice, socioeconomic status) adjusted ORs. Ethics and dissemination The SAIL Databank has the required ethical permissions in place to analyse anonymised data. Ethical approval has been granted by the Information Governance Review Panel (IGRP). Findings will be disseminated through peer-reviewed publications, consultations with stakeholders and national/international conference presentations. The improved understanding of the prior health, nature of previous contacts with services and wider social circumstances of those who complete suicide will assist in prevention policy, service organisation and delivery. SID-Cymru is funded through the National Institute for Social Care and Health Research, Welsh Government (RFS-12-25)

    Reusable, set-based selection algorithm for matched control groups

    Get PDF
    ABSTRACT Aims The wealth of data available in linked administrative datasets offers great potential for research, but researchers face methodological and computational challenges in data preparation, due to the size and complexity of the data. The creation of matched control groups in the Secure Anonymised Information Linkage (SAIL) Databank illustrates this point: SAIL contains multiple health datasets describing millions of individuals in Wales. The volume of data creates the potential for more precise matching, but only if an appropriate algorithm can be applied. We aimed to create such an algorithm for reuse by many research projects. Methods We developed set-based code in SQL that efficiently selects matches from millions of potential combinations in a relational database environment. It is parameterized to allow different matching criteria to be employed as needed, including follow-up time around an index event. A combinatorial optimisation problem occurs when a potential control could match more than one subject, which we solved by ranking potential match pairs first by subject with the fewest potential matches, then by closeness of the match. Results One example of the algorithm’s use was the Suicide Information Database Cymru, an electronic case-control study on suicide in Wales between 2003 and 2011. Subjects who had a cause of death recorded as self-harm were each matched to twenty controls who were alive at the subject’s date of death and had the same gender and similar birth week. The rate of matching success was >99.9%, with all subjects but one matching the full twenty controls. >99.99% of the matched controls had a week of birth that was identical to the subject. The second example was a matched cohort study looking at hospital admissions and type 1 diabetes, using the Brecon register of childhood diabetes in Wales, with matching based on week of birth within two weeks, gender, county of residence, deprivation quintile, and residence in Wales at time of diagnosis. This study had a matching rate of 98.9%; 97.5% of subjects matched to five controls, and 69.8% of matches had the same week of birth. Conclusions This algorithm provides good matching performance while executing efficiently and scalably on large datasets. Its implementation as reusable code will facilitate more efficient, high-quality research in SAIL. Instead of spending many hours developing a custom solution, analysts can execute parameterized code in a few minutes. We hope it to be useful more widely beyond SAIL as well
    corecore