3,652 research outputs found

    How to Host a Data Competition: Statistical Advice for Design and Analysis of a Data Competition

    Full text link
    Data competitions rely on real-time leaderboards to rank competitor entries and stimulate algorithm improvement. While such competitions have become quite popular and prevalent, particularly in supervised learning formats, their implementations by the host are highly variable. Without careful planning, a supervised learning competition is vulnerable to overfitting, where the winning solutions are so closely tuned to the particular set of provided data that they cannot generalize to the underlying problem of interest to the host. This paper outlines some important considerations for strategically designing relevant and informative data sets to maximize the learning outcome from hosting a competition based on our experience. It also describes a post-competition analysis that enables robust and efficient assessment of the strengths and weaknesses of solutions from different competitors, as well as greater understanding of the regions of the input space that are well-solved. The post-competition analysis, which complements the leaderboard, uses exploratory data analysis and generalized linear models (GLMs). The GLMs not only expand the range of results we can explore, they also provide more detailed analysis of individual sub-questions including similarities and differences between algorithms across different types of scenarios, universally easy or hard regions of the input space, and different learning objectives. When coupled with a strategically planned data generation approach, the methods provide richer and more informative summaries to enhance the interpretation of results beyond just the rankings on the leaderboard. The methods are illustrated with a recently completed competition to evaluate algorithms capable of detecting, identifying, and locating radioactive materials in an urban environment.Comment: 36 page

    Interpretable Machine Learning Model for Clinical Decision Making

    Get PDF
    Despite machine learning models being increasingly used in medical decision-making and meeting classification predictive accuracy standards, they remain untrusted black-boxes due to decision-makers\u27 lack of insight into their complex logic. Therefore, it is necessary to develop interpretable machine learning models that will engender trust in the knowledge they generate and contribute to clinical decision-makers intention to adopt them in the field. The goal of this dissertation was to systematically investigate the applicability of interpretable model-agnostic methods to explain predictions of black-box machine learning models for medical decision-making. As proof of concept, this study addressed the problem of predicting the risk of emergency readmissions within 30 days of being discharged for heart failure patients. Using a benchmark data set, supervised classification models of differing complexity were trained to perform the prediction task. More specifically, Logistic Regression (LR), Random Forests (RF), Decision Trees (DT), and Gradient Boosting Machines (GBM) models were constructed using the Healthcare Cost and Utilization Project (HCUP) Nationwide Readmissions Database (NRD). The precision, recall, area under the ROC curve for each model were used to measure predictive accuracy. Local Interpretable Model-Agnostic Explanations (LIME) was used to generate explanations from the underlying trained models. LIME explanations were empirically evaluated using explanation stability and local fit (R2). The results demonstrated that local explanations generated by LIME created better estimates for Decision Trees (DT) classifiers

    Spatial prediction of rotational landslide using geographically weighted regression, logistic regression, and support vector machine models in Xing Guo area (China)

    Full text link
    © 2017 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group. This study evaluated the geographically weighted regression (GWR) model for landslide susceptibility mapping in Xing Guo County, China. In this study, 16 conditioning factors, such as slope, aspect, altitude, topographic wetness index, stream power index, sediment transport index, soil, lithology, normalized difference vegetation index (NDVI), landuse, rainfall, distance to road, distance to river, distance to fault, plan curvature, and profile curvature, were analyzed. Chi-square feature selection method was adopted to compare the significance of each factor with landslide occurence. The GWR model was compared with two well-known models, namely, logistic regression (LR) and support vcector machine (SVM). Results of chi-square feature selection indicated that lithology and slope are the most influencial factors, whereas SPI was found statistically insignificant. Four landslide susceptibility maps were generated by GWR, SGD-LR, SGD-SVM, and SVM models. The GWR model exhibited the highest performance in terms of success rate and prediction accuracy, with values of 0.789 and 0.819, respectively. The SVM model exhibited slightly lower AUC values than that of the GWR model. Validation result of the four models indicates that GWR is a better model than other widely used models

    A novel preoperative model to predict 90-day surgical mortality in patients considered for renal cell carcinoma surgery

    Get PDF
    Introduction Surgical benefits for renal cell carcinoma must be weighed against competing causes of mortality, especially in the elderly patient population. We used a large cancer registry to evaluate the impact of patient and cancer-specific factors on 90-day mortality (90DM). A nomogram to predict the odds of short-term mortality was created. Materials and Methods The National Cancer Database was queried to identify all patients with clinically localized, nonmetastatic disease treated with partial or radical nephrectomy. Using a random sample of 60%, multiple logistic regression with 90DM outcomes were performed to identify preoperative variables associated with mortality. Variables included age, sex, race, co-morbidity score, tumor size, and presence of a thrombus. A nomogram was created and tested on the remaining 40% of patients to predict 90DM. Results 183,407 patients met inclusion criteria. Overall 90DM for the cohort was 1.9%. All preoperative variables significantly influenced the risk of 90DM. Patient age was by far the strongest predictor. Nomogram scores ranged from 0 to 12. Compared to patients with 0 to 1 points, those with 2 to 3 (odds ratio [OR] 2.89, 2.42–3.46; P 6 (OR 12.86, 10.83–15.27; P 80 years of age alone placed patients into the highest risk of surgical mortality. Conclusions Management of localized kidney cancer must consider competing causes of mortality, especially in elderly patients with multiple co-morbidities. We present a preoperative tool to calculate risk of surgical short-term mortality to aid surgeon–patient counseling

    Crowding Perception in a Tourist City: A Question of Preference

    Get PDF
    Two main topics are analysed in this paper: a crowding model for an urban destination is tested by the use of a binary logistic model in order to identify the variables influencing crowding perception; and the inherent negativity of the crowding concept, as is often assumed, is examined through association statistics. The results confirmed that personal and behavioural variables have a larger effect on the perception of crowding than use-level. Furthermore, the relationship between crowding and experience, while significantly negative, could only be found in respondents with a preference for low, and a perception of high, use levels, while for the majority of individuals the perception of a certain crowding level did not lead to a negative evaluation of the conditions. This proves that the concept of crowding cannot be assumed to be implicitly negative, and needs individual preferences to be fully understood.status: publishe
    • …
    corecore