27 research outputs found

    Distributed learning on 20 000+ lung cancer patients - The Personal Health Train

    Get PDF
    Background and purpose Access to healthcare data is indispensable for scientific progress and innovation. Sharing healthcare data is time-consuming and notoriously difficult due to privacy and regulatory concerns. The Personal Health Train (PHT) provides a privacy-by-design infrastructure connecting FAIR (Findable, Accessible, Interoperable, Reusable) data sources and allows distributed data analysis and machine learning. Patient data never leaves a healthcare institute. Materials and methods Lung cancer patient-specific databases (tumor staging and post-treatment survival information) of oncology departments were translated according to a FAIR data model and stored locally in a graph database. Software was installed locally to enable deployment of distributed machine learning algorithms via a central server. Algorithms (MATLAB, code and documentation publicly available) are patient privacy-preserving as only summary statistics and regression coefficients are exchanged with the central server. A logistic regression model to predict post-treatment two-year survival was trained and evaluated by receiver operating characteristic curves (ROC), root mean square prediction error (RMSE) and calibration plots. Results In 4 months, we connected databases with 23 203 patient cases across 8 healthcare institutes in 5 countries (Amsterdam, Cardiff, Maastricht, Manchester, Nijmegen, Rome, Rotterdam, Shanghai) using the PHT. Summary statistics were computed across databases. A distributed logistic regression model predicting post-treatment two-year survival was trained on 14 810 patients treated between 1978 and 2011 and validated on 8 393 patients treated between 2012 and 2015. Conclusion The PHT infrastructure demonstrably overcomes patient privacy barriers to healthcare data sharing and enables fast data analyses across multiple institutes from different countries with different regulatory regimens. This infrastructure promotes global evidence-based medicine while prioritizing patient privacy

    The ALFAM2 database on ammonia emission from field-applied manure: Description and illustrative analysis

    Get PDF
    peer-reviewedAmmonia (NH3) emission from animal manure contributes to air pollution and ecosystem degradation, and the loss of reactive nitrogen (N) from agricultural systems. Estimates of NH3 emission are necessary for national inventories and nutrient management, and NH3 emission from field-applied manure has been measured in many studies over the past few decades. In this work, we facilitate the use of these data by collecting and organizing them in the ALFAM2 database. In this paper we describe the development of the database and summarise its contents, quantify effects of application methods and other variables on emission using a data subset, and discuss challenges for data analysis and model development. The database contains measurements of emission, manure and soil properties, weather, application technique, and other variables for 1895 plots from 22 research institutes in 12 countries. Data on five manure types (cattle, pig, mink, poultry, mixed, as well as sludge and “other”) applied to three types of crops (grass, small grains, maize, as well as stubble and bare soil) are included. Application methods represented in the database include broadcast, trailing hose, trailing shoe (narrow band application), and open slot injection. Cattle manure application to grassland was the most common combination, and analysis of this subset (with dry matter (DM) limited to <15%) was carried out using mixed- and fixed-effects models in order to quantify effects of management and environment on ammonia emission, and to highlight challenges for use of the database. Measured emission in this subset ranged from <1% to 130% of applied ammonia after 48 h. Results showed clear, albeit variable, reductions in NH3 emission due to trailing hose, trailing shoe, and open slot injection of slurry compared to broadcast application. There was evidence of positive effects of air temperature and wind speed on NH3 emission, and limited evidence of effects of slurry DM. However, random-effects coefficients for differences among research institutes were among the largest model coefficients, and showed a deviation from the mean response by more than 100% in some cases. The source of these institute differences could not be determined with certainty, but there is some evidence that they are related to differences in soils, or differences in application or measurement methods. The ALFAM2 database should be useful for development and evaluation of both emission factors and emission models, but users need to recognize the limitations caused by confounding variables, imbalance in the dataset, and dependence among observations from the same institute. Variation among measurements and in reported variables highlights the importance of international agreement on how NH3 emission should be measured, along with necessary types of supporting data and standard protocols for their measurement. Both are needed in order to produce more accurate and useful ammonia emission measurements. Expansion of the ALFAM2 database will continue, and readers are invited to contact the corresponding author for information on data submission. The latest version of the database is available at http://www.alfam.dk

    Poor outcome in hypoxic endometrial carcinoma is related to vascular density

    Get PDF
    Background Identification of endometrial carcinoma (EC) patients at high risk of recurrence is lacking. In this study, the prognostic role of hypoxia and angiogenesis was investigated in EC patients. Methods Tumour slides from EC patients were stained by immunofluorescence for carbonic anhydrase IX (CAIX) as hypoxic marker and CD34 for assessment of microvessel density (MVD). CAIX expression was determined in epithelial tumour cells, with a cut-off of 1%. MVD was assessed according to the Weidner method. Correlations with disease-specific survival (DSS), disease-free survival (DFS) and distant disease-free survival (DDFS) were calculated using Kaplan–Meier curves and Cox regression analysis. Results Sixty-three (16.4%) of 385 ECs showed positive CAIX expression with high vascular density. These ECs had a reduced DSS compared to tumours with either hypoxia or high vascular density (log-rank p = 0.002). Multivariable analysis showed that hypoxic tumours with high vascular density had a reduced DSS (hazard ratio [HR] 3.71, p = 0.002), DDFS (HR 2.68, p = 0.009) and a trend for reduced DFS (HR 1.87, p = 0.054). Conclusions This study has shown that adverse outcome in hypoxic ECs is seen in the presence of high vascular density, suggesting an important role of angiogenesis in the metastatic process of hypoxic EC. Differential adjuvant treatment might be indicated for these patients.publishedVersio

    YKL-40 tissue expression and plasma levels in patients with ovarian cancer

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>YKL-40 (chitinase-3-like-1) is a member of "mammalian chitinase-like proteins". The protein is expressed in many types of cancer cells and the highest plasma YKL-40 levels have been found in patients with metastatic disease, short recurrence/progression-free intervals, and short overall survival. The aim of the study was to determine the expression of YKL-40 in tumor tissue and plasma in patients with borderline ovarian tumor or epithelial ovarian cancer (OC), and investigate prognostic value of this marker.</p> <p>Methods</p> <p>YKL-40 protein expression was determined by immunohistochemistry in tissue arrays from 181 borderline tumors and 473 OC. Plasma YKL-40 was determined by ELISA in preoperative samples from 19 patients with borderline tumor and 76 OC patients.</p> <p>Results</p> <p>YKL-40 protein expression was found in cancer cells, tumor associated macrophages, neutrophils and mast cells. The tumor cell expression was higher in OC than in borderline tumors (p = 0.001), and associated with FIGO stage (p < 0.0001) and histological subtype (p = 0.0009). Positive YKL-40 expression (≥ 5% staining) was not associated with reduced survival. Plasma YKL-40 was also higher in patients with OC than in patients with borderline tumors (p < 0.0001), and it was positively correlated to serum CA-125 (p < 0.0001) and FIGO stage (p = 0.0001). Univariate Cox analysis of plasma YKL-40 showed association with overall survival (p < 0.0001). Multivariate Cox analysis, including plasma YKL-40, serum CA125, FIGO stage, age and radicality after primary surgery as variables, showed that elevated plasma YKL-40 was associated with a shorter survival (HR = 2.13, 95% CI: 1.40–3.25, p = 0.0004).</p> <p>Conclusion</p> <p>YKL-40 in OC tissue and plasma are related to stage and histology, but only plasma YKL-40 is a prognostic biomarker in patients with OC.</p

    In vivo Identification and Specificity assessment of mRNA markers of hypoxia in human and mouse tumors

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Tumor hypoxia is linked to poor prognosis, but identification and quantification of tissue hypoxia remains a challenge. The hypoxia-specificity of HIF-1Îą target genes in vivo has been questioned due to the confounding influence of other microenvironmental abnormalities known to affect gene expression (e.g., low pH). Here we describe a new technique that by exploiting intratumoral oxygenation heterogeneity allows us to identify and objectively rank the most robust mRNA hypoxia biomarkers.</p> <p>Methods</p> <p>Mice carrying human (FaDu<sub>dd</sub>) or murine (SCCVII) tumors were injected with the PET hypoxia tracer FAZA. Four hours post-injection tumors were removed, frozen, and crushed into milligram-sized fragments, which were transferred individually to pre-weighed tubes containing RNAlater and then weighed. For each fragment radioactivity per tissue mass and expression patterns of selected mRNA biomarkers were analyzed and compared.</p> <p>Results</p> <p>In both tumour models, fragmentation into pieces weighing 10 to 60 mg resulted in tissue fragments with highly variable relative content of hypoxic cells as evidenced by an up to 13-fold variation in FAZA radioactivity per mass of tissue. Linear regression analysis comparing FAZA retention with patterns of gene expression in individual tissue fragments revealed that CA9, GLUT1 and LOX mRNA levels were equally and strongly correlated to hypoxic extent in FaDu<sub>dd</sub>. The same link between hypoxia and gene expression profile was observed for CA9 and GLUT1, but not LOX, in SCCVII tumors. Apparent in vivo hypoxia-specificity for other putative molecular markers of tissue hypoxia was considerably weaker.</p> <p>Conclusions</p> <p>The portrayed technique allows multiple pairwise measurements of mRNA transcript levels and extent of hypoxia in individual tumors at a smallest possible volumetric scale which (by limiting averaging effects inherent to whole-tumor analysis) strengthen the conclusiveness on true hypoxia-specificity of candidate genes while limiting the required number of tumors. Among tested genes, our study identified CA9, GLUT1 and possibly LOX as highly specific biomarkers of tumor hypoxia in vivo.</p
    corecore