610 research outputs found

    Deep learning cardiac motion analysis for human survival prediction

    Get PDF
    Motion analysis is used in computer vision to understand the behaviour of moving objects in sequences of images. Optimising the interpretation of dynamic biological systems requires accurate and precise motion tracking as well as efficient representations of high-dimensional motion trajectories so that these can be used for prediction tasks. Here we use image sequences of the heart, acquired using cardiac magnetic resonance imaging, to create time-resolved three-dimensional segmentations using a fully convolutional network trained on anatomical shape priors. This dense motion model formed the input to a supervised denoising autoencoder (4Dsurvival), which is a hybrid network consisting of an autoencoder that learns a task-specific latent code representation trained on observed outcome data, yielding a latent representation optimised for survival prediction. To handle right-censored survival outcomes, our network used a Cox partial likelihood loss function. In a study of 302 patients the predictive accuracy (quantified by Harrell's C-index) was significantly higher (p < .0001) for our model C=0.73 (95%\% CI: 0.68 - 0.78) than the human benchmark of C=0.59 (95%\% CI: 0.53 - 0.65). This work demonstrates how a complex computer vision task using high-dimensional medical image data can efficiently predict human survival

    Genetic-Based Hypertension Subtype Identification Using Informative SNPs

    Get PDF
    In this work, we proposed a process to select informative genetic variants for identifying clinically meaningful subtypes of hypertensive patients. We studied 575 African American (AA) and 612 Caucasian hypertensive participants enrolled in the Hypertension Genetic Epidemiology Network (HyperGEN) study and analyzed each race-based group separately. All study participants underwent GWAS (Genome-Wide Association Studies) and echocardiography. We applied a variety of statistical methods and filtering criteria, including generalized linear models, F statistics, burden tests, deleterious variant filtering, and others to select the most informative hypertension-related genetic variants. We performed an unsupervised learning algorithm non-negative matrix factorization (NMF) to identify hypertension subtypes with similar genetic characteristics. Kruskal–Wallis tests were used to demonstrate the clinical meaningfulness of genetic-based hypertension subtypes. Two subgroups were identified for both African American and Caucasian HyperGEN participants. In both AAs and Caucasians, indices of cardiac mechanics differed significantly by hypertension subtypes. African Americans tend to have more genetic variants compared to Caucasians; therefore, using genetic information to distinguish the disease subtypes for this group of people is relatively challenging, but we were able to identify two subtypes whose cardiac mechanics have statistically different distributions using the proposed process. The research gives a promising direction in using statistical methods to select genetic information and identify subgroups of diseases, which may inform the development and trial of novel targeted therapies

    Machine learning based on biomarker profiles identifies distinct subgroups of heart failure with preserved ejection fraction

    Get PDF
    International audienceAims: The lack of effective therapies for patients with heart failure with preserved ejection fraction (HFpEF) is often ascribed to the heterogeneity of patients with HFpEF. We aimed to identify distinct pathophysiologic clusters of HFpEF based on circulating biomarkers.Methods and results: We performed an unsupervised cluster analysis using 363 biomarkers from 429 patient with HFpEF. Relative differences in expression profiles of the biomarkers between clusters were assessed and used for pathway over-representation analyses. We identified four distinct patients subgroups based on their biomarker profiles : cluster 1 with the highest prevalence of diabetes mellitus and renal disease; cluster 2 with oldest age and frequent age-related comorbidities; cluster 3 with youngest age, largest body size, least symptoms and lowest NT-proBNP levels; and cluster 4 with highest prevalence of ischemic aetiology, smoking and chronic lung disease, most symptoms, as well as highest NT-proBNP and troponin levels. Over a median follow-up of 21 months, the occurrence of death or HF hospitalization was highest in clusters 1 and 4 (62.1% and 62.8% respectively) and lowest in cluster 3 (25.6%). Pathway over-representation analyses revealed that the biomarker profile of patients in cluster 1 was associated with activation of inflammatory pathways while the biomarker profile of patients in cluster 4 was specifically associated with pathways implicated in cell proliferation regulation and cell survival.Conclusion: Unsupervised cluster analysis based on biomarker profiles identified mutually exclusive subgroups of patients with HFpEF with distinct biomarker profiles, clinical characteristics and outcomes, suggesting different underlying pathophysiological pathway

    Information Maximizing Component Analysis of Left Ventricular Remodeling Due to Myocardial Infarction

    Get PDF
    Background: Although adverse left ventricular shape changes (remodeling) after myocardial infarction (MI) are predictive of morbidity and mortality, current clinical assessment is limited to simple mass and volume measures, or dimension ratios such as length to width ratio. We hypothesized that information maximizing component analysis (IMCA), a supervised feature extraction method, can provide more efficient and sensitive indices of overall remodeling. Methods: IMCA was compared to linear discriminant analysis (LDA), both supervised methods, to extract the most discriminatory global shape changes associated with remodeling after MI. Finite element shape models from 300 patients with myocardial infarction from the DETERMINE study (age 31–86, mean age 63, 20 % women) were compared with 1991 asymptomatic cases from the MESA study (age 44–84, mean age 62, 52 % women) available from the Cardiac Atlas Project. IMCA and LDA were each used to identify a single mode of global remodeling best discriminating the two groups. Logistic regression was employed to determine the association between the remodeling index and MI. Goodness-of-fit results were compared against a baseline logistic model comprising standard clinical indices. Results: A single IMCA mode simultaneously describing end-diastolic and end-systolic shapes achieved best results (lowest Deviance, Akaike information criterion and Bayesian information criterion, and the largest area under the receiver-operating-characteristic curve). This mode provided a continuous scale where remodeling can be quantified and visualized, showing that MI patients tend to present larger size and more spherical shape, more bulging of the apex, and thinner wall thickness. Conclusions: IMCA enables better characterization of global remodeling than LDA, and can be used to quantify progression of disease and the effect of treatment. These data and results are available from the Cardiac Atlas Project (http://www.cardiacatlas.org)

    Identifying novel phenotypes of elevated left ventricular end diastolic pressure using hierarchical clustering of features derived from electromechanical waveform data

    Get PDF
    Introduction Elevated left ventricular end diastolic pressure (LVEDP) is a consequence of compromised left ventricular compliance and an important measure of myocardial dysfunction. An algorithm was developed to predict elevated LVEDP utilizing electro-mechanical (EM) waveform features. We examined the hierarchical clustering of selected features developed from these EM waveforms in order to identify important patient subgroups and assess their possible prognostic significance. Materials and methods Patients presenting with cardiovascular symptoms (N = 396) underwent EM data collection and direct LVEDP measurement by left heart catheterization. LVEDP was classified as non-elevated ( ≤ 12 mmHg) or elevated (≥25 mmHg). The 30 most contributive features to the algorithm output were extracted from EM data and input to an unsupervised hierarchical clustering algorithm. The resultant dendrogram was divided into five clusters, and patient metadata overlaid. Results The cluster with highest LVEDP (cluster 1) was most dissimilar from the lowest LVEDP cluster (cluster 5) in both clustering and with respect to clinical characteristics. In contrast to the cluster demonstrating the highest percentage of elevated LVEDP patients, the lowest was predominantly non-elevated LVEDP, younger, lower BMI, and males with a higher rate of significant coronary artery disease (CAD). The next adjacent cluster (cluster 2) to that of the highest LVEDP (cluster 1) had the second lowest LVEDP of all clusters. Cluster 2 differed from Cluster 1 primarily based on features extracted from the electrical data, and those that quantified predictability and variability of the signal. There was a low predictability and high variability in the highest LVEDP cluster 1, and the opposite in adjacent cluster 2. Conclusion This analysis identified subgroups of patients with varying degrees of LVEDP elevation based on waveform features. An approach to stratify movement between clusters and possible progression of myocardial dysfunction may include changes in features that differentiate clusters; specifically, reductions in electrical signal predictability and increases in variability. Identification of phenotypes of myocardial dysfunction evidenced by elevated LVEDP and knowledge of factors promoting transition to clusters with higher levels of left ventricular filling pressures could permit early risk stratification and improve patient selection for novel therapeutic interventions

    Automated deep phenotyping of the cardiovascular system using magnetic resonance imaging

    Get PDF
    Across a lifetime, the cardiovascular system must adapt to a great range of demands from the body. The individual changes in the cardiovascular system that occur in response to loading conditions are influenced by genetic susceptibility, and the pattern and extent of these changes have prognostic value. Brachial blood pressure (BP) and left ventricular ejection fraction (LVEF) are important biomarkers that capture this response, and their measurements are made at high resolution. Relatively, clinical analysis is crude, and may result in lost information and the introduction of noise. Digital information storage enables efficient extraction of information from a dataset, and this strategy may provide more precise and deeper measures to breakdown current phenotypes into their component parts. The aim of this thesis was to develop automated analysis of cardiovascular magnetic resonance (CMR) imaging for more detailed phenotyping, and apply these techniques for new biological insights into the cardiovascular response to different loading conditions. I therefore tested the feasibility and clinical utility of computational approaches for image and waveform analysis, recruiting and acquiring additional patient cohorts where necessary, and then applied these approaches prospectively to participants before and after six-months of exercise training for a first-time marathon. First, a multi-centre, multi-vendor, multi-field strength, multi-disease CMR resource of 110 patients undergoing repeat imaging in a short time-frame was assembled. The resource was used to assess whether automated analysis of LV structure and function is feasible on real-world data, and if it can improve upon human precision. This showed that clinicians can be confident in detecting a 9% change in EF or a 20g change in LV mass. This will be difficult to improve by clinicians because the greatest source of human error was attributable to the observer rather than modifiable factors. Having understood these errors, a convolutional neural network was trained on separate multi-centre data for automated analysis and was successfully generalizable to the real-world CMR data. Precision was similar to human analysis, and performance was 186 times faster. This real-world benchmarking resource has been made freely available (thevolumesresource.com). Precise automated segmentations were then used as a platform to delve further into the LV phenotype. Global LVEFs measured from CMR imaging in 116 patients with severe aortic stenosis were broken down into ~10 million regional measurements of structure and function, represented by computational three-dimensional LV models for each individual. A cardiac atlas approach was used to compile, label, segment and represent these data. Models were compared with healthy matched controls, and co-registered with follow-up one year after aortic valve replacement (AVR). This showed that there is a tendency to asymmetric septal hypertrophy in all patients with severe aortic stenosis (AS), rather than a characteristic specific to predisposed patients. This response to AS was more unfavourable in males than females (associated with higher NT-proBNP, and lower blood pressure), but was more modifiable with AVR. This was not detected using conventional analysis. Because cardiac function is coupled with the vasculature, a novel integrated assessment of the cardiovascular system was developed. Wave intensity theory was used to combine central blood pressure and CMR aortic blood flow-velocity waveforms to represent the interaction of the heart with the vessels in terms of traveling energy waves. This was performed and then validated in 206 individuals (the largest cohort to date), demonstrating inefficient ventriculo-arterial coupling in female sex and healthy ageing. CMR imaging was performed in 236 individuals before training for a first-time marathon and 138 individuals were followed-up after marathon completion. After training, systolic/diastolic blood pressure reduced by 4/3mmHg, descending aortic stiffness decreased by 16%, and ventriculo-arterial coupling improved by 14%. LV mass increased slightly, with a tendency to more symmetrical hypertrophy. The reduction in aortic stiffness was equivalent to a 4-year reduction in estimated biological aortic age, and the benefit was greater in older, male, and slower individuals. In conclusion, this thesis demonstrates that automating analysis of clinical cardiovascular phenotypes is precise with significant time-saving. Complex data that is usually discarded can be used efficiently to identify new biology. Deeper phenotypes developed in this work inform risk reduction behaviour in healthy individuals, and demonstrably deliver a more sensitive marker of LV remodelling, potentially enhancing risk prediction in severe aortic stenosis

    Applications of Machine Learning Methods in Health Outcomes Research: Heart Failure in Women

    Get PDF
    There is robust evidence that heart failure (HF) is associated with substantial mortality, morbidity, poor health-related quality of life, healthcare utilization, and economic burden. Previous research has revealed that there are sex differences in the epidemiology, etiology, and disease burden of HF. However, research on HF among women, especially postmenopausal women, is limited. To fill the knowledge gap, the three related aims of this dissertation were to: (1) identify knowledge gaps in HF research among women, especially postmenopausal women, using unsupervised machine learning methods and big data (i.e., articles published in PubMed); (2) identify emerging predictors (i.e., polypharmacy and some prescription medications) of incident HF among postmenopausal women using supervised machine learning methods; (3) identify leading predictors of HF-related emergency room use among postmenopausal women using supervised machine learning methods with data from a large commercial insurance claims database in the United States. This study utilized machine learning methods. In the first aim, non-negative matrix factorization algorithms were used to cluster HF articles based on the primary topic. Clusters were independently validated and labeled by three investigators familiar with HF research. The most understudied area among women was atrial fibrillation. Among postmenopausal women, the most understudied topic was stress-induced cardiomyopathy. For the second and third aims, a retrospective cohort design and Optum’s de-identified Clinformatics® Data Mart Database (Optum, Eden Prairie, MN), de-identified health insurance claims data, were used. In the second aim, multivariable logistic regression and three classification machine learning algorithms (cross-validated logistic regression (CVLR), random forest (RF), and eXtreme Gradient Boosting (XGBoost) algorithms) were used to identify predictors of incident HF among postmenopausal women. The associations of the leading predictors to incident HF were explored with an interpretable machine learning SHapley Additive exPlanations (SHAP) technique. The eight leading predictors of incident HF consistent across all models were: older age, arrhythmia, polypharmacy, Medicare, chronic obstructive pulmonary disease (COPD), coronary artery disease, hypertension, and chronic kidney disease. Some prescription medications such as sulfonylureas and antibiotics other than fluoroquinolones predicted incident HF in some machine learning algorithms. In the third aim, a random forest algorithm was used to identify predictors of HF-related emergency room use among postmenopausal women. Interpretable machine learning techniques were used to explain the association of leading predictors to HF-related emergency room use. Random forest algorithm had high predictive accuracy in the test dataset (Area Under the Curve: 94%, sensitivity: 93%, specificity: 77%, and accuracy: 0.81). We found that the number of HF-related emergency room visits at baseline, fragmented care, age, insurance type (Health Maintenance Organization), and coronary artery disease were the top five predictors of HF-related emergency room use among postmenopausal women. Partial dependence plots suggested positive associations of the top predictors with HF-related emergency room use. However, insurance type was found to be negatively associated with HF-related emergency room use. Findings from this dissertation suggest that machine learning algorithms can achieve comparable and better predictive accuracy compared to traditional statistical models
    • …
    corecore