22 research outputs found

    A novel and dynamic prediction engine for practicing precision medicine to prevent chemotherapy-induced nausea and vomiting

    Get PDF
    Cancer remains the second major cause of death in the United States over the last decade. Chemotherapy is a core component of nearly every cancer treatment plan. Chemotherapy-Induced Nausea and Vomiting (CINV) are the two most dreadful and unpleasant side-effects of chemotherapy for cancer patients. The consequences of CINV include: (1) impaired life quality, (2) poor social life, (3) burden on economy due to loss of workdays, (4) increased healthcare cost, and (5) denial of chemotherapy due to unendurable CINV. There are three clinical guidelines (ASCO, NCCN, and MASCC/ESMO) for the management of CINV. Several patient-specific factors affect the risk of CINV. However, none of the guidelines consider those factors. Not all of patients have the similar emetic risk of CINV. Despite the improvements in CINV management, as many as two-thirds of chemotherapy patients still experience some degree of CINV. As a result, physicians use their personal experiences for CINV treatment, which leads to inconsistent managements of CINV. The overall objective of this study is to improve the prevention of CINV using precise, personalized and evidence-based antiemetic treatment before chemotherapy. Physicians receive feedback about CINV risks of patients from a CINV decision support system based on patient-specific factors. This objective was achieved by accomplishing clinical innovations through the discovery of combined relationships of various patient-specific factors for causing CINV, and informatics innovations through the development of a novel, precise and dynamic prediction engine for practicing precision and personalized medicine in CINV prevention. The approach presented in this dissertation can be applied to any other clinical predictions

    Burden of disease scenarios for 204 countries and territories, 2022–2050: a forecasting analysis for the Global Burden of Disease Study 2021

    Get PDF
    Background: Future trends in disease burden and drivers of health are of great interest to policy makers and the public at large. This information can be used for policy and long-term health investment, planning, and prioritisation. We have expanded and improved upon previous forecasts produced as part of the Global Burden of Diseases, Injuries, and Risk Factors Study (GBD) and provide a reference forecast (the most likely future), and alternative scenarios assessing disease burden trajectories if selected sets of risk factors were eliminated from current levels by 2050. Methods: Using forecasts of major drivers of health such as the Socio-demographic Index (SDI; a composite measure of lag-distributed income per capita, mean years of education, and total fertility under 25 years of age) and the full set of risk factor exposures captured by GBD, we provide cause-specific forecasts of mortality, years of life lost (YLLs), years lived with disability (YLDs), and disability-adjusted life-years (DALYs) by age and sex from 2022 to 2050 for 204 countries and territories, 21 GBD regions, seven super-regions, and the world. All analyses were done at the cause-specific level so that only risk factors deemed causal by the GBD comparative risk assessment influenced future trajectories of mortality for each disease. Cause-specific mortality was modelled using mixed-effects models with SDI and time as the main covariates, and the combined impact of causal risk factors as an offset in the model. At the all-cause mortality level, we captured unexplained variation by modelling residuals with an autoregressive integrated moving average model with drift attenuation. These all-cause forecasts constrained the cause-specific forecasts at successively deeper levels of the GBD cause hierarchy using cascading mortality models, thus ensuring a robust estimate of cause-specific mortality. For non-fatal measures (eg, low back pain), incidence and prevalence were forecasted from mixed-effects models with SDI as the main covariate, and YLDs were computed from the resulting prevalence forecasts and average disability weights from GBD. Alternative future scenarios were constructed by replacing appropriate reference trajectories for risk factors with hypothetical trajectories of gradual elimination of risk factor exposure from current levels to 2050. The scenarios were constructed from various sets of risk factors: environmental risks (Safer Environment scenario), risks associated with communicable, maternal, neonatal, and nutritional diseases (CMNNs; Improved Childhood Nutrition and Vaccination scenario), risks associated with major non-communicable diseases (NCDs; Improved Behavioural and Metabolic Risks scenario), and the combined effects of these three scenarios. Using the Shared Socioeconomic Pathways climate scenarios SSP2-4.5 as reference and SSP1-1.9 as an optimistic alternative in the Safer Environment scenario, we accounted for climate change impact on health by using the most recent Intergovernmental Panel on Climate Change temperature forecasts and published trajectories of ambient air pollution for the same two scenarios. Life expectancy and healthy life expectancy were computed using standard methods. The forecasting framework includes computing the age-sex-specific future population for each location and separately for each scenario. 95% uncertainty intervals (UIs) for each individual future estimate were derived from the 2·5th and 97·5th percentiles of distributions generated from propagating 500 draws through the multistage computational pipeline. Findings: In the reference scenario forecast, global and super-regional life expectancy increased from 2022 to 2050, but improvement was at a slower pace than in the three decades preceding the COVID-19 pandemic (beginning in 2020). Gains in future life expectancy were forecasted to be greatest in super-regions with comparatively low life expectancies (such as sub-Saharan Africa) compared with super-regions with higher life expectancies (such as the high-income super-region), leading to a trend towards convergence in life expectancy across locations between now and 2050. At the super-region level, forecasted healthy life expectancy patterns were similar to those of life expectancies. Forecasts for the reference scenario found that health will improve in the coming decades, with all-cause age-standardised DALY rates decreasing in every GBD super-region. The total DALY burden measured in counts, however, will increase in every super-region, largely a function of population ageing and growth. We also forecasted that both DALY counts and age-standardised DALY rates will continue to shift from CMNNs to NCDs, with the most pronounced shifts occurring in sub-Saharan Africa (60·1% [95% UI 56·8–63·1] of DALYs were from CMNNs in 2022 compared with 35·8% [31·0–45·0] in 2050) and south Asia (31·7% [29·2–34·1] to 15·5% [13·7–17·5]). This shift is reflected in the leading global causes of DALYs, with the top four causes in 2050 being ischaemic heart disease, stroke, diabetes, and chronic obstructive pulmonary disease, compared with 2022, with ischaemic heart disease, neonatal disorders, stroke, and lower respiratory infections at the top. The global proportion of DALYs due to YLDs likewise increased from 33·8% (27·4–40·3) to 41·1% (33·9–48·1) from 2022 to 2050, demonstrating an important shift in overall disease burden towards morbidity and away from premature death. The largest shift of this kind was forecasted for sub-Saharan Africa, from 20·1% (15·6–25·3) of DALYs due to YLDs in 2022 to 35·6% (26·5–43·0) in 2050. In the assessment of alternative future scenarios, the combined effects of the scenarios (Safer Environment, Improved Childhood Nutrition and Vaccination, and Improved Behavioural and Metabolic Risks scenarios) demonstrated an important decrease in the global burden of DALYs in 2050 of 15·4% (13·5–17·5) compared with the reference scenario, with decreases across super-regions ranging from 10·4% (9·7–11·3) in the high-income super-region to 23·9% (20·7–27·3) in north Africa and the Middle East. The Safer Environment scenario had its largest decrease in sub-Saharan Africa (5·2% [3·5–6·8]), the Improved Behavioural and Metabolic Risks scenario in north Africa and the Middle East (23·2% [20·2–26·5]), and the Improved Nutrition and Vaccination scenario in sub-Saharan Africa (2·0% [–0·6 to 3·6]). Interpretation: Globally, life expectancy and age-standardised disease burden were forecasted to improve between 2022 and 2050, with the majority of the burden continuing to shift from CMNNs to NCDs. That said, continued progress on reducing the CMNN disease burden will be dependent on maintaining investment in and policy emphasis on CMNN disease prevention and treatment. Mostly due to growth and ageing of populations, the number of deaths and DALYs due to all causes combined will generally increase. By constructing alternative future scenarios wherein certain risk exposures are eliminated by 2050, we have shown that opportunities exist to substantially improve health outcomes in the future through concerted efforts to prevent exposure to well established risk factors and to expand access to key health interventions

    A Study on Pubmed Search Tag Usage Pattern: Association Rule Mining of a Full-day Pubmed Query Log

    Get PDF
    Abstract Background The practice of evidence-based medicine requires efficient biomedical literature search such as PubMed/MEDLINE. Retrieval performance relies highly on the efficient use of search field tags. The purpose of this study was to analyze PubMed log data in order to understand the usage pattern of search tags by the end user in PubMed/MEDLINE search. Methods A PubMed query log file was obtained from the National Library of Medicine containing anonymous user identification, timestamp, and query text. Inconsistent records were removed from the dataset and the search tags were extracted from the query texts. A total of 2,917,159 queries were selected for this study issued by a total of 613,061 users. The analysis of frequent co-occurrences and usage patterns of the search tags was conducted using an association mining algorithm. Results The percentage of search tag usage was low (11.38% of the total queries) and only 2.95% of queries contained two or more tags. Three out of four users used no search tag and about two-third of them issued less than four queries. Among the queries containing at least one tagged search term, the average number of search tags was almost half of the number of total search terms. Navigational search tags are more frequently used than informational search tags. While no strong association was observed between informational and navigational tags, six (out of 19) informational tags and six (out of 29) navigational tags showed strong associations in PubMed searches. Conclusions The low percentage of search tag usage implies that PubMed/MEDLINE users do not utilize the features of PubMed/MEDLINE widely or they are not aware of such features or solely depend on the high recall focused query translation by the PubMed’s Automatic Term Mapping. The users need further education and interactive search application for effective use of the search tags in order to fulfill their biomedical information needs from PubMed/MEDLINE.</p

    Evaluating the benefits of Octree-based indexing for LiDAR data

    No full text
    In recent years the geospatial domain has seen a significant increase in the availability of very large three dimensional (3D) point datasets. These datasets originate from a variety of sources, such as for example Light Detection and Ranging (LiDAR) or meteorological weather recordings. Increasingly, a desire within the geospatial community has been expressed to exploit these types of 3D point data in a meaningful engineering context that goes beyond mere visualization. However, current Spatial Information Systems (SISs) provide only limited support for vast 3D point datasets. Even those systems that advertise their support for in-built 3D data types provide very limited functionality to manipulate such data types. In particular, an effective means of indexing large 3D point datasets is yet missing, however it is crucial for effective analysis. Next to the large size of 3D point datasets they may also be information rich, for example they may contain color information or some other associated semantic. This paper presents an alternative spatial indexing technique, which is based on an octree data structure. We show that it outperforms R-tree index, while being able to group 3D points based on their attribute values at the same time. This paper presents an evaluation employing this octree spatial indexing technique and successfully highlights its advantages for sparse as well as uniformly distributed data on the basis of an extensive LiDAR dataset.Deposited by bulk impor

    Evaluating the Benefits of Octree-based Indexing for LiDAR Data

    No full text
    Very large true three-dimensional (3D) point datasets, as opposed to the previously common 2.5D data sets, are progressively more common nowadays, such as from Light Detection and Ranging (LiDAR). Increasingly, attempts are made to exploit these 3D point data sets beyond mere visualization. However, current Spatial Information Systems provide only limited 3D support. Even commercial systems advertising in-built, 3D data types provide only minimal functionality. Particularly, there is no effective means of indexing large 3D point datasets, which is crucial for efficient analysis and engineering usage. Also many datasets are information rich (e.g. contain color or some other associated semantic information), which has yet to be fully exploited. This paper presents the implementation in a commercial spatial database of a spatial indexing technique using an octree data structure and highlights its advantages for sparse, as well as uniformly distributed, aerial LiDAR data. The implementation outperforms an existing r-tree index within the software, and offers additional functionality of attribute-based 3D grouping

    A Smartphone-Based Decision Support Tool for Predicting Patients at Risk of Chemotherapy-Induced Nausea and Vomiting: Retrospective Study on App Development Using Decision Tree Induction

    No full text
    BackgroundChemotherapy-induced nausea and vomiting (CINV) are the two most frightful and unpleasant side effects of chemotherapy. CINV is accountable for poor treatment outcomes, treatment failure, or even death. It can affect patients' overall quality of life, leading to many social, economic, and clinical consequences. ObjectiveThis study compared the performances of different data mining models for predicting the risk of CINV among the patients and developed a smartphone app for clinical decision support to recommend the risk of CINV at the point of care. MethodsData were collected by retrospective record review from the electronic medical records used at the University of Missouri Ellis Fischel Cancer Center. Patients who received chemotherapy and standard antiemetics at the oncology outpatient service from June 1, 2010, to July 31, 2012, were included in the study. There were six independent data sets of patients based on emetogenicity (low, moderate, and high) and two phases of CINV (acute and delayed). A total of 14 risk factors of CINV were chosen for data mining. For our study, we used five popular data mining algorithms: (1) naive Bayes algorithm, (2) logistic regression classifier, (3) neural network, (4) support vector machine (using sequential minimal optimization), and (5) decision tree. Performance measures, such as accuracy, sensitivity, and specificity with 10-fold cross-validation, were used for model comparisons. A smartphone app called CINV Risk Prediction Application was developed using the ResearchKit in iOS utilizing the decision tree algorithm, which conforms to the criteria of explainable, usable, and actionable artificial intelligence. The app was created using both the bulk questionnaire approach and the adaptive approach. ResultsThe decision tree performed well in both phases of high emetogenic chemotherapies, with a significant margin compared to the other algorithms. The accuracy measure for the six patient groups ranged from 79.3% to 94.8%. The app was developed using the results from the decision tree because of its consistent performance and simple, explainable nature. The bulk questionnaire approach asks 14 questions in the smartphone app, while the adaptive approach can determine questions based on the previous questions' answers. The adaptive approach saves time and can be beneficial when used at the point of care. ConclusionsThis study solved a real clinical problem, and the solution can be used for personalized and precise evidence-based CINV management, leading to a better life quality for patients and reduced health care costs
    corecore