30,289 research outputs found
Preterm Birth Prediction: Deriving Stable and Interpretable Rules from High Dimensional Data
Preterm births occur at an alarming rate of 10-15%. Preemies have a higher
risk of infant mortality, developmental retardation and long-term disabilities.
Predicting preterm birth is difficult, even for the most experienced
clinicians. The most well-designed clinical study thus far reaches a modest
sensitivity of 18.2-24.2% at specificity of 28.6-33.3%. We take a different
approach by exploiting databases of normal hospital operations. We aims are
twofold: (i) to derive an easy-to-use, interpretable prediction rule with
quantified uncertainties, and (ii) to construct accurate classifiers for
preterm birth prediction. Our approach is to automatically generate and select
from hundreds (if not thousands) of possible predictors using stability-aware
techniques. Derived from a large database of 15,814 women, our simplified
prediction rule with only 10 items has sensitivity of 62.3% at specificity of
81.5%.Comment: Presented at 2016 Machine Learning and Healthcare Conference (MLHC
2016), Los Angeles, C
Recommended from our members
Improving Assessment of Drug Safety Through Proteomics: Early Detection and Mechanistic Characterization of the Unforeseen Harmful Effects of Torcetrapib.
BackgroundEarly detection of adverse effects of novel therapies and understanding of their mechanisms could improve the safety and efficiency of drug development. We have retrospectively applied large-scale proteomics to blood samples from ILLUMINATE (Investigation of Lipid Level Management to Understand its Impact in Atherosclerotic Events), a trial of torcetrapib (a cholesterol ester transfer protein inhibitor), that involved 15 067 participants at high cardiovascular risk. ILLUMINATE was terminated at a median of 550 days because of significant absolute increases of 1.2% in cardiovascular events and 0.4% in mortality with torcetrapib. The aims of our analysis were to determine whether a proteomic analysis might reveal biological mechanisms responsible for these harmful effects and whether harmful effects of torcetrapib could have been detected early in the ILLUMINATE trial with proteomics.MethodsA nested case-control analysis of paired plasma samples at baseline and at 3 months was performed in 249 participants assigned to torcetrapib plus atorvastatin and 223 participants assigned to atorvastatin only. Within each treatment arm, cases with events were matched to controls 1:1. Main outcomes were a survey of 1129 proteins for discovery of biological pathways altered by torcetrapib and a 9-protein risk score validated to predict myocardial infarction, stroke, heart failure, or death.ResultsPlasma concentrations of 200 proteins changed significantly with torcetrapib. Their pathway analysis revealed unexpected and widespread changes in immune and inflammatory functions, as well as changes in endocrine systems, including in aldosterone function and glycemic control. At baseline, 9-protein risk scores were similar in the 2 treatment arms and higher in participants with subsequent events. At 3 months, the absolute 9-protein derived risk increased in the torcetrapib plus atorvastatin arm compared with the atorvastatin-only arm by 1.08% (P=0.0004). Thirty-seven proteins changed in the direction of increased risk of 49 proteins previously associated with cardiovascular and mortality risk.ConclusionsHeretofore unknown effects of torcetrapib were revealed in immune and inflammatory functions. A protein-based risk score predicted harm from torcetrapib within just 3 months. A protein-based risk assessment embedded within a large proteomic survey may prove to be useful in the evaluation of therapies to prevent harm to patients.Clinical trial registrationURL: https://www.clinicaltrials.gov. Unique identifier: NCT00134264
Optimal management of urinary tract infections in older people
Urinary tract infections (UTI) occur frequently in older people. Unfortunately, UTI is commonly overdiagnosed and overtreated on the basis of nonspecific clinical signs and symptoms. The diagnosis of a UTI in the older patient requires the presence of new urinary symptoms, with or without systemic symptoms. Urinalysis is commonly used to diagnose infection in this population, however, the evidence for its use is limited. There is overwhelming evidence that asymptomatic bacteriuria should not be treated. Catheter associated urinary tract infection accounts for a significant amount of hospital-associated infection. Indwelling urinary catheters should be avoided where possible and alternatives sought. The use of narrow spectrum antimicrobial agents for urinary tract infection is advocated. Local guidelines are now widely used to reflect local resistance patterns and available agents. Guidelines need to be updated to reflect changes in antimicrobial prescribing and a move from broad to narrow spectrum antimicrobials
The role of risk aversion in non-conscious decision making
To what extent can people choose advantageously without knowing why they are making those choices? This hotly debated question has capitalized on the Iowa Gambling Task (IGT), in which people often learn to choose advantageously without appearing to know why. However, because the IGT is unconstrained in many respects, this finding remains debated and other interpretations are possible (e.g., risk aversion, ambiguity aversion, limits of working memory, or insensitivity to reward/punishment can explain the finding of the IGT). Here we devised an improved variant of the IGT in which the deck-payoff contingency switches after subjects repeatedly choose from a good deck, offering the statistical power of repeated within-subject measures based on learning the reward contingencies associated with each deck. We found that participants exhibited low confidence in their choices, as probed with post-decision wagering, despite high accuracy in selecting advantageous decks in the task, which is putative evidence for non-conscious decision making. However, such a behavioral dissociation could also be explained by risk aversion, a tendency to avoid risky decisions under uncertainty. By explicitly measuring risk aversion for each individual, we predicted subjects’ post-decision wagering using Bayesian modeling. We found that risk aversion indeed does play a role, but that it did not explain the entire effect. Moreover, independently measured risk aversion was uncorrelated with risk aversion exhibited during our version of the IGT, raising the possibility that the latter risk aversion may be non-conscious. Our findings support the idea that people can make optimal choices without being fully aware of the basis of their decision. We suggest that non-conscious decision making may be mediated by emotional feelings of risk that are based on mechanisms distinct from those that support cognitive assessment of risk
Survival ensembles by the sum of pairwise differences with application to lung cancer microarray studies
Lung cancer is among the most common cancers in the United States, in terms
of incidence and mortality. In 2009, it is estimated that more than 150,000
deaths will result from lung cancer alone. Genetic information is an extremely
valuable data source in characterizing the personal nature of cancer. Over the
past several years, investigators have conducted numerous association studies
where intensive genetic data is collected on relatively few patients compared
to the numbers of gene predictors, with one scientific goal being to identify
genetic features associated with cancer recurrence or survival. In this note,
we propose high-dimensional survival analysis through a new application of
boosting, a powerful tool in machine learning. Our approach is based on an
accelerated lifetime model and minimizing the sum of pairwise differences in
residuals. We apply our method to a recent microarray study of lung
adenocarcinoma and find that our ensemble is composed of 19 genes, while a
proportional hazards (PH) ensemble is composed of nine genes, a proper subset
of the 19-gene panel. In one of our simulation scenarios, we demonstrate that
PH boosting in a misspecified model tends to underfit and ignore
moderately-sized covariate effects, on average. Diagnostic analyses suggest
that the PH assumption is not satisfied in the microarray data and may explain,
in part, the discrepancy in the sets of active coefficients. Our simulation
studies and comparative data analyses demonstrate how statistical learning by
PH models alone is insufficient.Comment: Published in at http://dx.doi.org/10.1214/10-AOAS426 the Annals of
Applied Statistics (http://www.imstat.org/aoas/) by the Institute of
Mathematical Statistics (http://www.imstat.org
- …