37,915 research outputs found

    A Data Mining Approach To identify Diabetes

    Get PDF
    Mounting amounts of data made traditional data analysis methods impractical. Data mining (DM) tools provide a useful for alternative framework that addresses this problem. This study follows a DM technique to identify diabetic patients. We develop a model that clusters diabetes patients of a large healthcare company into different subpopulation. Consequently, we show the value of applying a DM model to identify diabetic patients

    A Comparative Analysis on the Evaluation of Classification Algorithms in the Prediction of Diabetes

    Get PDF
    Data mining techniques are applied in many applications as a standard procedure for analyzing the large volume of available data, extracting useful information and knowledge to support the major decision-making processes. Diabetes mellitus is a continuing, general, deadly syndrome occurring all around the world. It is characterized by hyperglycemia occurring due to abnormalities in insulin secretion which would in turn result in irregular rise of glucose level. In recent years, the impact of Diabetes mellitus has increased to a great extent especially in developing countries like India. This is mainly due to the irregularities in the food habits and life style. Thus, early diagnosis and classification of this deadly disease has become an active area of research in the last decade. Numerous clustering and classifications techniques are available in the literature to visualize temporal data to identify trends for controlling diabetes mellitus. This work presents an experimental study of several algorithms which classifies Diabetes Mellitus data effectively. The existing algorithms are analyzed thoroughly to identify their advantages and limitations. The performance assessment of the existing algorithms is carried out to determine the best approach

    Identifying risk patterns for suicide attempts in individuals with diabetes : a data-driven approach using LASSO regression

    Get PDF
    Diabetes is a major health concern in the United States, with 34.2 million Americans affected in 2020. Unfortunately, the risk of suicide is also elevated in individuals with diabetes, with around 90,000 people with diabetes committing suicide each year. People with type 1 diabetes are three to four times more likely to attempt suicide, and those with newly diagnosed type 2 diabetes are twice as likely to attempt suicide compared to the general population. However, poor mental health comorbidity is still neglected, and more recommendations are needed to support for people with diabetes. It is widely acknowledged that the comorbidity of depression with diabetes is considered a higher risk factor for suicide attempts Previous studies have used logistic regression to identify risk factors for suicide attempts in individuals with diabetes. However, this technique can be prone to overfitting when the number of variables is high. To address this issue, we used the LASSO (Least Absolute Shrinkage and Selection Operator), a regularization technique, to reduce overfitting in a logistic regression model. It works by adding a penalty term ([lambda]) to the log-likelihood function, which shrinks the estimates of the coefficients. This process allows LASSO to act as a feature selection method, effectively setting coefficients that contribute most to the error to zero. Because few studies have focused on un derstanding the relationship between suicide attempts and diabetes, we used association rule mining ARM an explainable rule based machine learning technique, for knowledge discovery to reveal previously unknown relationships between suicide attempts and diabetes. This approach has already proved useful in the medical field, where it has been applied to electronic health record (EHR) data to discover associations such as disease co-occurrences, drug-disease associations, and symptomatic patterns of disease. However, no previous studies have used ARM to determine risk factors and predict suicide attempts in people with diabetes. The aim of this dissertation is to identify patterns of risk factors for suicide attempts in individuals with diabetes, with the long term goal of developing a clinical decision support system that can be integrated into EHRs. This system would allow healthcare providers to identify patients with diabetes at high risk of suicide attempts and provide appropriate preventive measures during outpatient clinic visits. To achieve this goal, we have three specific aims: (1) to identify potential risk factors for suicide attempts in individuals with diabetes through a literature review; (2) to investigate risk factors for suicide attempts in individuals with diabetes using LASSO regression; (3) to identify risk patterns for suicide attempts in individuals with diabetes using association rule mining. In this dissertation, we have reviewed the literature and compiled a list of data elements for suicide attempts in people with diabetes. We then retrieved data on patients with diabetes from Cerner Real-World Data [trade mark]. LASSO regression was used for feature selection, and ARM was used for investigating the risk patterns. We discovered risk patterns that are understandable and practical for healthcare providers. The findings of this research can inform suicide prevention efforts for people with diabetes and contribute to improved mental health outcomes.Includes bibliographical references

    Literature-based discovery of diabetes- and ROS-related targets

    Get PDF
    Abstract Background Reactive oxygen species (ROS) are known mediators of cellular damage in multiple diseases including diabetic complications. Despite its importance, no comprehensive database is currently available for the genes associated with ROS. Methods We present ROS- and diabetes-related targets (genes/proteins) collected from the biomedical literature through a text mining technology. A web-based literature mining tool, SciMiner, was applied to 1,154 biomedical papers indexed with diabetes and ROS by PubMed to identify relevant targets. Over-represented targets in the ROS-diabetes literature were obtained through comparisons against randomly selected literature. The expression levels of nine genes, selected from the top ranked ROS-diabetes set, were measured in the dorsal root ganglia (DRG) of diabetic and non-diabetic DBA/2J mice in order to evaluate the biological relevance of literature-derived targets in the pathogenesis of diabetic neuropathy. Results SciMiner identified 1,026 ROS- and diabetes-related targets from the 1,154 biomedical papers (http://jdrf.neurology.med.umich.edu/ROSDiabetes/). Fifty-three targets were significantly over-represented in the ROS-diabetes literature compared to randomly selected literature. These over-represented targets included well-known members of the oxidative stress response including catalase, the NADPH oxidase family, and the superoxide dismutase family of proteins. Eight of the nine selected genes exhibited significant differential expression between diabetic and non-diabetic mice. For six genes, the direction of expression change in diabetes paralleled enhanced oxidative stress in the DRG. Conclusions Literature mining compiled ROS-diabetes related targets from the biomedical literature and led us to evaluate the biological relevance of selected targets in the pathogenesis of diabetic neuropathy.http://deepblue.lib.umich.edu/bitstream/2027.42/78315/1/1755-8794-3-49.xmlhttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/2/1755-8794-3-49-S7.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/3/1755-8794-3-49-S10.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/4/1755-8794-3-49-S8.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/5/1755-8794-3-49-S3.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/6/1755-8794-3-49-S1.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/7/1755-8794-3-49-S4.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/8/1755-8794-3-49-S2.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/9/1755-8794-3-49-S12.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/10/1755-8794-3-49-S11.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/11/1755-8794-3-49-S9.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/12/1755-8794-3-49-S5.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/13/1755-8794-3-49-S6.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/14/1755-8794-3-49.pdfPeer Reviewe

    Identifying Patient Groups based on Frequent Patterns of Patient Samples

    Full text link
    Grouping patients meaningfully can give insights about the different types of patients, their needs, and the priorities. Finding groups that are meaningful is however very challenging as background knowledge is often required to determine what a useful grouping is. In this paper we propose an approach that is able to find groups of patients based on a small sample of positive examples given by a domain expert. Because of that, the approach relies on very limited efforts by the domain experts. The approach groups based on the activities and diagnostic/billing codes within health pathways of patients. To define such a grouping based on the sample of patients efficiently, frequent patterns of activities are discovered and used to measure the similarity between the care pathways of other patients to the patients in the sample group. This approach results in an insightful definition of the group. The proposed approach is evaluated using several datasets obtained from a large university medical center. The evaluation shows F1-scores of around 0.7 for grouping kidney injury and around 0.6 for diabetes
    • …
    corecore