492 research outputs found

    Stability metrics for multi-source biomedical data based on simplicial projections from probability distribution distances

    Full text link
    [EN] Biomedical data may be composed of individuals generated from distinct, meaningful sources. Due to possible contextual biases in the processes that generate data, there may exist an undesirable and unexpected variability among the probability distribution functions (PDFs) of the source subsamples, which, when uncontrolled, may lead to inaccurate or unreproducible research results. Classical statistical methods may have difficulties to undercover such variabilities when dealing with multi-modal, multi-type, multi-variate data. This work proposes two metrics for the analysis of stability among multiple data sources, robust to the aforementioned conditions, and defined in the context of data quality assessment. Specifically, a global probabilistic deviation (GPD) and a source probabilistic outlyingness (SPO) metrics are proposed. The first provides a bounded degree of the global multi-source variability, designed as an estimator equivalent to the notion of normalized standard deviation of PDFs. The second provides a bounded degree of the dissimilarity of each source to a latent central distribution. The metrics are based on the projection of a simplex geometrical structure constructed from the Jensen-Shannon distances among the sources PDFs. The metrics have been evaluated and demonstrated their correct behaviour on a simulated benchmark and with real multi-source biomedical data using the UCI Heart Disease dataset. The biomedical data quality assessment based on the proposed stability metrics may improve the efficiency and effectiveness of biomedical data exploitation and research.The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by own IBIME funds under the UPV project Servicio de evaluacion y rating de la calidad de repositorios de datos biomedicos [UPV-2014-872] and the EU FP7 Project Help4Mood - A Computational Distributed System to Support the Treatment of Patients with Major Depression [ICT-248765].SĂĄez Silvestre, C.; Robles Viejo, M.; GarcĂ­a GĂłmez, JM. (2014). Stability metrics for multi-source biomedical data based on simplicial projections from probability distribution distances. Statistical Methods in Medical Research. 1-25. https://doi.org/10.1177/0962280214545122S12

    Comparison of respiratory disease prevalence among voluntary monitoring systems for pig health and welfare in the UK

    Get PDF
    Surveillance of animal diseases provides information essential for the protection of animal health and ultimately public health. The voluntary pig health schemes, implemented in the United Kingdom, are integrated systems which capture information on different macroscopic disease conditions detected in slaughtered pigs. Many of these conditions have been associated with a reduction in performance traits and consequent increases in production costs. The schemes are the Wholesome Pigs Scotland in Scotland, the BPEX Pig Health Scheme in England and Wales and the Pig Regen Ltd. health and welfare checks done in Northern Ireland. This report set out to compare the prevalence of four respiratory conditions (enzootic pneumonia-like lesions, pleurisy, pleuropneumonia lesions and abscesses in the lung) assessed by these three Pig Health Schemes. The seasonal variations and year trends associated with the conditions in each scheme are presented. The paper also highlights the differences in prevalence for each condition across these schemes and areas where further research is needed. A general increase in the prevalence of enzootic pneumonia like lesions was observed in Scotland, England and Wales since 2009, while a general decrease was observed in Northern Ireland over the years of the scheme. Pleurisy prevalence has increased since 2010 in all three schemes, whilst pleuropneumonia has been decreasing. Prevalence of abscesses in the lung has decreased in England, Wales and Northern Ireland but has increased in Scotland. This analysis highlights the value of surveillance schemes based on abattoir pathology monitoring of four respiratory lesions. The outputs at scheme level have significant value as indicators of endemic and emerging disease, and for producers and herd veterinarians in planning and evaluating herd health control programs when comparing individual farm results with national averages

    Health services research in the public healthcare system in Hong Kong: An analysis of over 1 million antihypertensive prescriptions between 2004-2007 as an example of the potential and pitfalls of using routinely collected electronic patient data

    Get PDF
    <b>Objectives</b> Increasing use is being made of routinely collected electronic patient data in health services research. The aim of the present study was to evaluate the potential usefulness of a comprehensive database used routinely in the public healthcare system in Hong Kong, using antihypertensive drug prescriptions in primary care as an example.<p></p> <b>Methods</b> Data on antihypertensive drug prescriptions were retrieved from the electronic Clinical Management System (e-CMS) of all primary care clinics run by the Health Authority (HA) in the New Territory East (NTE) cluster of Hong Kong between January 2004 and June 2007. Information was also retrieved on patients’ demographic and socioeconomic characteristics, visit type (new or follow-up), and relevant diseases (International Classification of Primary Care, ICPC codes). <p></p> <b>Results</b> 1,096,282 visit episodes were accessed, representing 93,450 patients. Patients’ demographic and socio-economic details were recorded in all cases. Prescription details for anti-hypertensive drugs were missing in only 18 patients (0.02%). However, ICPC-code was missing for 36,409 patients (39%). Significant independent predictors of whether disease codes were applied included patient age > 70 years (OR 2.18), female gender (OR 1.20), district of residence (range of ORs in more rural districts; 0.32-0.41), type of clinic (OR in Family Medicine Specialist Clinics; 1.45) and type of visit (OR follow-up visit; 2.39). <p></p> In the 57,041 patients with an ICPC-code, uncomplicated hypertension (ICPC K86) was recorded in 45,859 patients (82.1%). The characteristics of these patients were very similar to those of the non-coded group, suggesting that most non-coded patients on antihypertensive drugs are likely to have uncomplicated hypertension. <p></p> <b>Conclusion</b> The e-CMS database of the HA in Hong Kong varies in quality in terms of recorded information. Potential future health services research using demographic and prescription information is highly feasible but for disease-specific research dependant on ICPC codes some caution is warranted. In the case of uncomplicated hypertension, future research on pharmaco-epidemiology (such as prescription patterns) and clinical issues (such as side-effects of medications on metabolic parameters) seems feasible given the large size of the data set and the comparability of coded and non-coded patients

    E-Cadherin Destabilization Accounts for the Pathogenicity of Missense Mutations in Hereditary Diffuse Gastric Cancer

    Get PDF
    E-cadherin is critical for the maintenance of tissue architecture due to its role in cell-cell adhesion. E-cadherin mutations are the genetic cause of Hereditary Diffuse Gastric Cancer (HDGC) and missense mutations represent a clinical burden, due to the uncertainty of their pathogenic role. In vitro and in vivo, most mutations lead to loss-of-function, although the causal factor is unknown for the majority. We hypothesized that destabilization could account for the pathogenicity of E-cadherin missense mutations in HDGC, and tested our hypothesis using in silico and in vitro tools. FoldX algorithm was used to calculate the impact of each mutation in E-cadherin native-state stability, and the analysis was complemented with evolutionary conservation, by SIFT. Interestingly, HDGC patients harbouring germline E-cadherin destabilizing mutants present a younger age at diagnosis or death, suggesting that the loss of native-state stability of E-cadherin accounts for the disease phenotype. To elucidate the biological relevance of E-cadherin destabilization in HDGC, we investigated a group of newly identified HDGC-associated mutations (E185V, S232C and L583R), of which L583R is predicted to be destabilizing. We show that this mutation is not functional in vitro, exhibits shorter half-life and is unable to mature, due to premature proteasome-dependent degradation, a phenotype reverted by stabilization with the artificial mutation L583I (structurally tolerated). Herein we report E-cadherin structural models suitable to predict the impact of the majority of cancer-associated missense mutations and we show that E-cadherin destabilization leads to loss-of-function in vitro and increased pathogenicity in vivo

    Measurement of the inclusive and dijet cross-sections of b-jets in pp collisions at sqrt(s) = 7 TeV with the ATLAS detector

    Get PDF
    The inclusive and dijet production cross-sections have been measured for jets containing b-hadrons (b-jets) in proton-proton collisions at a centre-of-mass energy of sqrt(s) = 7 TeV, using the ATLAS detector at the LHC. The measurements use data corresponding to an integrated luminosity of 34 pb^-1. The b-jets are identified using either a lifetime-based method, where secondary decay vertices of b-hadrons in jets are reconstructed using information from the tracking detectors, or a muon-based method where the presence of a muon is used to identify semileptonic decays of b-hadrons inside jets. The inclusive b-jet cross-section is measured as a function of transverse momentum in the range 20 < pT < 400 GeV and rapidity in the range |y| < 2.1. The bbbar-dijet cross-section is measured as a function of the dijet invariant mass in the range 110 < m_jj < 760 GeV, the azimuthal angle difference between the two jets and the angular variable chi in two dijet mass regions. The results are compared with next-to-leading-order QCD predictions. Good agreement is observed between the measured cross-sections and the predictions obtained using POWHEG + Pythia. MC@NLO + Herwig shows good agreement with the measured bbbar-dijet cross-section. However, it does not reproduce the measured inclusive cross-section well, particularly for central b-jets with large transverse momenta.Comment: 10 pages plus author list (21 pages total), 8 figures, 1 table, final version published in European Physical Journal
    • 

    corecore