14 research outputs found

    Deep conv-attention model for diagnosing left bundle branch block from 12-lead electrocardiograms

    Full text link
    Cardiac resynchronization therapy (CRT) is a treatment that is used to compensate for irregularities in the heartbeat. Studies have shown that this treatment is more effective in heart patients with left bundle branch block (LBBB) arrhythmia. Therefore, identifying this arrhythmia is an important initial step in determining whether or not to use CRT. On the other hand, traditional methods for detecting LBBB on electrocardiograms (ECG) are often associated with errors. Thus, there is a need for an accurate method to diagnose this arrhythmia from ECG data. Machine learning, as a new field of study, has helped to increase human systems' performance. Deep learning, as a newer subfield of machine learning, has more power to analyze data and increase systems accuracy. This study presents a deep learning model for the detection of LBBB arrhythmia from 12-lead ECG data. This model consists of 1D dilated convolutional layers. Attention mechanism has also been used to identify important input data features and classify inputs more accurately. The proposed model is trained and validated on a database containing 10344 12-lead ECG samples using the 10-fold cross-validation method. The final results obtained by the model on the 12-lead ECG data are as follows. Accuracy: 98.80+-0.08%, specificity: 99.33+-0.11 %, F1 score: 73.97+-1.8%, and area under the receiver operating characteristics curve (AUC): 0.875+-0.0192. These results indicate that the proposed model in this study can effectively diagnose LBBB with good efficiency and, if used in medical centers, will greatly help diagnose this arrhythmia and early treatment

    Predicting diabetes second-line therapy initiation in the Australian population via time span-guided neural attention network

    Get PDF
    Introduction The first line of treatment for people with Diabetes mellitus is metformin. However, over the course of the disease metformin may fail to achieve appropriate glycemic control, and a second-line therapy may become necessary. In this paper we introduce Tangle, a time span-guided neural attention model that can accurately and timely predict the upcoming need for a second-line diabetes therapy from administrative data in the Australian adult population. The method is suitable for designing automatic therapy review recommendations for patients and their providers without the need to collect clinical measures. Data We analyzed seven years of de-identified records (2008-2014) of the 10% publicly available linked sample of Medicare Benefits Schedule (MBS) and Pharmaceutical Benefits Scheme (PBS) electronic databases of Australia. Methods By design, Tangle inherits the representational power of pre-trained word embedding, such as GloVe, to encode sequences of claims with the related MBS codes. Moreover, the proposed attention mechanism natively exploits the information hidden in the time span between two successive claims (measured in number of days). We compared the proposed method against state-of-the-art sequence classification methods. Results Tangle outperforms state-of-the-art recurrent neural networks, including attention-based models. In particular, when the proposed time span-guided attention strategy is coupled with pre-trained embedding methods, the model performance reaches an Area Under the ROC Curve of 90%, an improvement of almost 10 percentage points over an attentionless recurrent architecture. Implementation Tangle is implemented in Python using Keras and it is hosted on GitHub at https://github. com/samuelefiorini/tangle

    RVD: A Handheld Device-Based Fundus Video Dataset for Retinal Vessel Segmentation

    Full text link
    Retinal vessel segmentation is generally grounded in image-based datasets collected with bench-top devices. The static images naturally lose the dynamic characteristics of retina fluctuation, resulting in diminished dataset richness, and the usage of bench-top devices further restricts dataset scalability due to its limited accessibility. Considering these limitations, we introduce the first video-based retinal dataset by employing handheld devices for data acquisition. The dataset comprises 635 smartphone-based fundus videos collected from four different clinics, involving 415 patients from 50 to 75 years old. It delivers comprehensive and precise annotations of retinal structures in both spatial and temporal dimensions, aiming to advance the landscape of vasculature segmentation. Specifically, the dataset provides three levels of spatial annotations: binary vessel masks for overall retinal structure delineation, general vein-artery masks for distinguishing the vein and artery, and fine-grained vein-artery masks for further characterizing the granularities of each artery and vein. In addition, the dataset offers temporal annotations that capture the vessel pulsation characteristics, assisting in detecting ocular diseases that require fine-grained recognition of hemodynamic fluctuation. In application, our dataset exhibits a significant domain shift with respect to data captured by bench-top devices, thus posing great challenges to existing methods. In the experiments, we provide evaluation metrics and benchmark results on our dataset, reflecting both the potential and challenges it offers for vessel segmentation tasks. We hope this challenging dataset would significantly contribute to the development of eye disease diagnosis and early prevention

    Analysis of health trajectories using administrative data

    No full text
    This thesis presents studies of health trajectories using administrative data. In the first study, we propose a new method for clustering of Medicare Benefit Schedule (MBS) claim trajectories. In the second study, we examine the extent to which the adult Australian population on lipid-lowering medications receives the level of High-Density Lipoprotein-Cholesterol (HDL-C) testing recommended by national guidelines. Finally, we study mental health services and medications usage in the MBS and Pharmaceutical Benefit Scheme (PBS) administrative data for the purpose of finding groups of patients with similar utilisation patterns. For these studies, we analyse records from seven years (2008-2014) of the 10% publicly available sample of de-identified, individual level, linked MBS and PBS administrative data. In the first study, we apply a Hierarchical Deep Belief Networks (HDBN) to cluster individuals’ health trajectories in four types of services: general practitioner attendances, specialist attendances, pathology tests, and diagnostic imaging. In the second study, the PBS data is used to identify individuals on stable lipid-lowering medications. The MBS data is used to estimate the annual frequency of HDL-C testing. We develop a methodology to address the issue of “episode coning” in the MBS data, which causes an undercounting of pathology tests. We use a published figure on the proportion of unreported HDL-C tests to correct for the undercounting and estimate the probability that an HDL-C test is performed. The rate of HDL-C testing is then compared to national guidelines that people at high-risk for cardiovascular disease undergo annual testing, to determine appropriateness. For mental health study, we create individual level utilisation patterns describing the sequence of mental health services and medications, extracted from the MBS and PBS data, respectively. We propose an Extended Inter-Spike Interval (EISI) metric to estimate the pairwise distances between the individuals’ utilisation patterns. Then, we develop a split-and-merge Partitioning Around Medoids (PAM) algorithm to cluster the study population and discover “interesting” utilisation patterns. In order to better understand the extent to which particular personal characteristics impact an individual utilisation pattern, we perform descriptive and multivariate analyses with gender, age, state of residence, and concessional status as covariates. For the health trajectories clustering study, we applied the proposed HDBN algorithm to cluster one million health trajectories of the New South Wales patients with the age of 45-55 years extracted from the MBS data and detected 31 clusters. For the HDL-C testing study, we estimated that approximately 50% of the population on stable lipid-lowering medications did not receive any HDL-C test in each year. We also found that approximately 19% of the same population received two or more HDL-C tests a year. These levels of underutilisation and overutilisation have been changing at an average rate of 2% and -4% a year, respectively, since 2009. The yearly expenditure associated with test overutilisation was approximately A4.3Mduringthestudyperiod,whilethecostavertedbecauseoftestunderutilisationwasapproximatelyA4.3M during the study period, while the cost averted because of test underutilisation was approximately A11.3M a year. For the mental health study, after having excluded obvious and common utilisation patterns, we find that mental health patients can be grouped into 10 clusters with distinct and interpretable utilisation patterns. We find that patients differ in the composition of mental health services and medications and the length of use of those services. The largest cluster (27.1% of the study population) is composed of individuals who only visit general practitioners and take psycholeptics medications for a short period of time. The smallest cluster (4.4% of the study population) contains individuals that have occasional visits with general practitioners, and regularly utilise both psycholeptics and psychoanaleptics medications over long periods of time

    EISI: extended inter-spike interval for mental health patients clustering based on mental health services and medications utilisation

    No full text
    Mental health is vital in all human life stages, and managing mental healthcare service resources is crucial for providers. This paper presents a new method, called Extended Inter-Spike Interval (EISI), on identifying the patients with a similar utilisation of mental health services and medications. The EISI measures the distance between the utilisation patterns of the patients. Then, the pairwise distances are given to a developed split-and-merge Partitioning Around Medoids (PAM) clustering algorithm to identify the patients with similar utilisation patterns. To evaluate the proposed method, we use two years (2013–2014) of the 10% publicly available sample of the Australian Medicare Benefits Schedule (MBS) and Pharmaceutical Benefits Scheme (PBS) administrative data. Results show that mental health patients can be grouped into ten clusters with distinct and interpretable utilisations patterns. The largest cluster comprises individuals who only visit general practitioners and take psycholeptics medications for a short time. The smallest group contains occasional visits with general practitioners and regularly utilises psycholeptics and psychoanaleptics medications over long periods. The proposed method provides insights on whom to target and how to structure services for different groups of individuals with mental health conditions

    Video classification using deep autoencoder network

    No full text
    Abstract We present a deep learning framework for video classification applicable to face recognition and dynamic texture recognition. A Deep Autoencoder Network Template (DANT) is designed whose weights are initialized by conducting unsupervised pre-training in a layer-wise fashion using Gaussian Restricted Boltzmann Machines. In order to obtain a class specific network and fine tune the weights for each class, the pre-initialized DANT is trained for each class of video sequences, separately. A majority voting technique based on the reconstruction error is employed for the classification task. The extensive evaluation and comparisons with state-of-the-art approaches on Honda/UCSD, DynTex, and YUPPEN databases demonstrate that the proposed method significantly improves the performance of dynamic texture classification

    Hospital Readmission and Length-of-Stay Prediction Using an Optimized Hybrid Deep Model

    No full text
    Hospital readmission and length-of-stay predictions provide information on how to manage hospital bed capacity and the number of required staff, especially during pandemics. We present a hybrid deep model called the Genetic Algorithm-Optimized Convolutional Neural Network (GAOCNN), with a unique preprocessing method to predict hospital readmission and the length of stay required for patients of various conditions. GAOCNN uses one-dimensional convolutional layers to predict hospital readmission and the length of stay. The parameters of the layers are optimized via a genetic algorithm. To show the performance of the proposed model in patients with various conditions, we evaluate the model under three healthcare datasets: the Diabetes 130-US hospitals dataset, the COVID-19 dataset, and the MIMIC-III dataset. The diabetes 130-US hospitals dataset has information on both readmission and the length of stay, while the COVID-19 and MIMIC-III datasets just include information on the length of stay. Experimental results show that the proposed model’s accuracy for hospital readmission was 97.2% for diabetic patients. Furthermore, the accuracy of the length-of-stay prediction was 89%, 99.4%, and 94.1% for the diabetic, COVID-19, and ICU patients, respectively. These results confirm the superiority of the proposed model compared to existing methods. Our findings offer a platform for managing the healthcare funds and resources for patients with various diseases

    Patterns and trends of potentially inappropriate high-density lipoprotein cholesterol testing in Australian adults at high risk of cardiovascular disease from 2008 to 2014: analysis of linked individual patient data from the Australian Medicare Benefits Schedule and Pharmaceutical Benefits Scheme.

    Get PDF
    Objectives We examine the extent to which the adult Australian population on lipid-lowering medications receives the level of high-density lipoprotein cholesterol (HDL-C) testing recommended by national guidelines. Data We analysed records from 7 years (2008–2014) of the 10% publicly available sample of deidentified, individual level, linked Medicare Benefits Schedule (MBS) and Pharmaceutical Benefits Scheme (PBS) electronic databases of Australia

    Ensemble Learning for Disease Prediction: A Review

    No full text
    Machine learning models are used to create and enhance various disease prediction frameworks. Ensemble learning is a machine learning technique that combines multiple classifiers to improve performance by making more accurate predictions than a single classifier. Although numerous studies have employed ensemble approaches for disease prediction, there is a lack of thorough assessment of commonly used ensemble approaches against highly researched diseases. Consequently, this study aims to identify significant trends in the performance accuracies of ensemble techniques (i.e., bagging, boosting, stacking, and voting) against five hugely researched diseases (i.e., diabetes, skin disease, kidney disease, liver disease, and heart conditions). Using a well-defined search strategy, we first identified 45 articles from the current literature that applied two or more of the four ensemble approaches to any of these five diseases and were published in 2016–2023. Although stacking has been used the fewest number of times (23) compared with bagging (41) and boosting (37), it showed the most accurate performance the most times (19 out of 23). The voting approach is the second-best ensemble approach, as revealed in this review. Stacking always revealed the most accurate performance in the reviewed articles for skin disease and diabetes. Bagging demonstrated the best performance for kidney disease (five out of six times) and boosting for liver and diabetes (four out of six times). The results show that stacking has demonstrated greater accuracy in disease prediction than the other three candidate algorithms. Our study also demonstrates variability in the perceived performance of different ensemble approaches against frequently used disease datasets. The findings of this work will assist researchers in better understanding current trends and hotspots in disease prediction models that employ ensemble learning, as well as in determining a more suitable ensemble model for predictive disease analytics. This article also discusses variability in the perceived performance of different ensemble approaches against frequently used disease datasets

    Fast COVID-19 versus H1N1 screening using Optimized Parallel Inception

    No full text
    COVID-19 and swine-origin influenza A (H1N1) are both pandemics that sparked significant concern worldwide. Since these two diseases have common symptoms, a fast COVID-19 versus H1N1 screening helps better manage patients at healthcare facilities. We present a novel deep model, called Optimized Parallel Inception, for fast screening of COVID-19 and H1N1 patients. We also present a Semi-supervised Generative Adversarial Network (SGAN) to address the problem related to the smaller size of the COVID-19 and H1N1 research data. To evaluate the proposed models, we have merged two separate COVID-19 and H1N1 data from different sources to build a new dataset. The created dataset includes 4,383 positive COVID-19 cases, 989 positive H1N1 cases, and 1,059 negative cases. We applied SGAN on this dataset to remove issues related to unequal class densities. The experimental results show that the proposed model's screening accuracy is 99.2% and 99.6% for COVID-19 and H1N1, respectively. According to our analysis, the most significant symptoms and underlying chronic diseases for COVID-19 versus H1N1 screening are dry cough, breathing problems, diabetes, and gastrointestinal
    corecore