24 research outputs found
Beyond Volume: The Impact of Complex Healthcare Data on the Machine Learning Pipeline
From medical charts to national census, healthcare has traditionally operated
under a paper-based paradigm. However, the past decade has marked a long and
arduous transformation bringing healthcare into the digital age. Ranging from
electronic health records, to digitized imaging and laboratory reports, to
public health datasets, today, healthcare now generates an incredible amount of
digital information. Such a wealth of data presents an exciting opportunity for
integrated machine learning solutions to address problems across multiple
facets of healthcare practice and administration. Unfortunately, the ability to
derive accurate and informative insights requires more than the ability to
execute machine learning models. Rather, a deeper understanding of the data on
which the models are run is imperative for their success. While a significant
effort has been undertaken to develop models able to process the volume of data
obtained during the analysis of millions of digitalized patient records, it is
important to remember that volume represents only one aspect of the data. In
fact, drawing on data from an increasingly diverse set of sources, healthcare
data presents an incredibly complex set of attributes that must be accounted
for throughout the machine learning pipeline. This chapter focuses on
highlighting such challenges, and is broken down into three distinct
components, each representing a phase of the pipeline. We begin with attributes
of the data accounted for during preprocessing, then move to considerations
during model building, and end with challenges to the interpretation of model
output. For each component, we present a discussion around data as it relates
to the healthcare domain and offer insight into the challenges each may impose
on the efficiency of machine learning techniques.Comment: Healthcare Informatics, Machine Learning, Knowledge Discovery: 20
Pages, 1 Figur
Structural studies of Helicase NS3 variants from Hepatitis C virus genotype 3 in virological sustained responder and non-responder patients
<p>Abstract</p> <p>Background</p> <p>About 130 million people are infected with the hepatitis C virus (HCV) worldwide, but effective treatment options are not yet available. One of the most promising targets for antiviral therapy is nonstructural protein 3 (NS3). To identify possible changes in the structure of NS3 associated with virological sustained response or non-response of patients, a model was constructed for each helicase NS3 protein coding sequence. From this, the goal was to verify the interaction between helicases variants and their ligands.</p> <p>Findings</p> <p>Evidence was found that the NS3 helicase portion of non-responder patients contained substitutions in its ATP and RNA binding sites. K210E substitution can cause an imbalance in the distribution of loads, leading to a decrease in the number of ligations between the essential amino acids required for the hydrolysis of ATP. W501R substitution causes an imbalance in the distribution of loads, leading and forcing the RNA to interact with the amino acid Thr269, but not preventing binding of ribavirin inhibitor.</p> <p>Conclusions</p> <p>Useful information is provided on the genetic profiling of the HCV genotype 3, specifically the coding region of the NS3 protein, improving our understanding of the viral genome and the regions of its protein catalytic site.</p
Evaluation of appendicitis risk prediction models in adults with suspected appendicitis
Background
Appendicitis is the most common general surgical emergency worldwide, but its diagnosis remains challenging. The aim of this study was to determine whether existing risk prediction models can reliably identify patients presenting to hospital in the UK with acute right iliac fossa (RIF) pain who are at low risk of appendicitis.
Methods
A systematic search was completed to identify all existing appendicitis risk prediction models. Models were validated using UK data from an international prospective cohort study that captured consecutive patients aged 16–45 years presenting to hospital with acute RIF in March to June 2017. The main outcome was best achievable model specificity (proportion of patients who did not have appendicitis correctly classified as low risk) whilst maintaining a failure rate below 5 per cent (proportion of patients identified as low risk who actually had appendicitis).
Results
Some 5345 patients across 154 UK hospitals were identified, of which two‐thirds (3613 of 5345, 67·6 per cent) were women. Women were more than twice as likely to undergo surgery with removal of a histologically normal appendix (272 of 964, 28·2 per cent) than men (120 of 993, 12·1 per cent) (relative risk 2·33, 95 per cent c.i. 1·92 to 2·84; P < 0·001). Of 15 validated risk prediction models, the Adult Appendicitis Score performed best (cut‐off score 8 or less, specificity 63·1 per cent, failure rate 3·7 per cent). The Appendicitis Inflammatory Response Score performed best for men (cut‐off score 2 or less, specificity 24·7 per cent, failure rate 2·4 per cent).
Conclusion
Women in the UK had a disproportionate risk of admission without surgical intervention and had high rates of normal appendicectomy. Risk prediction models to support shared decision‐making by identifying adults in the UK at low risk of appendicitis were identified