22,288 research outputs found

    Global Hypothesis Test to Compare the Predictive Values of Diagnostic Tests Subject to a Case-Control Design

    Get PDF
    Use of a case-control design to compare the accuracy of two binary diagnostic tests is frequent in clinical practice. This design consists of applying the two diagnostic tests to all of the individuals in a sample of those who have the disease and in another sample of those who do not have the disease. This manuscript studies the comparison of the predictive values of two diagnostic tests subject to a case-control design. A global hypothesis test, based on the chi-square distribution, is proposed to compare the predictive values simultaneously, as well as other alternative methods. The hypothesis tests studied require knowing the prevalence of the disease. Simulation experiments were carried out to study the type I errors and the powers of the hypothesis tests proposed, as well as to study the effect of a misspecification of the prevalence on the asymptotic behavior of the hypothesis tests and on the estimators of the predictive values. The proposed global hypothesis test was extended to the situation in which there are more than two diagnostic tests. The results have been applied to the diagnosis of coronary disease.Spanish Ministry of Economy, Grant Number MTM2016- 76938-PUniversity of Nouakchott Alaasriy

    Compbdt: an R program to compare two binary diagnostic tests subject to a paired design

    Get PDF
    I thank the Editor and the two referees for their helpful comments that improved the quality of the manuscript.Background: The comparison of the performance of two binary diagnostic tests is an important topic in Clinical Medicine. The most frequent type of sample design to compare two binary diagnostic tests is the paired design. This design consists of applying the two binary diagnostic tests to all of the individuals in a random sample, where the disease status of each individual is known through the application of a gold standard. This article presents an R program to compare parameters of two binary tests subject to a paired design. Results: The “compbdt” program estimates the sensitivity and the specificity, the likelihood ratios and the predictive values of each diagnostic test applying the confidence intervals with the best asymptotic performance. The program compares the sensitivities and specificities of the two diagnostic tests simultaneously, as well as the likelihood ratios and the predictive values, applying the global hypothesis tests with the best performance in terms of type I error and power. When the global hypothesis test is significant, the causes of the significance are investigated solving the individual hypothesis tests and applying the multiple comparison method of Holm. The most optimal confidence intervals are also calculated for the difference or ratio between the respective parameters. Based on the data observed in the sample, the program also estimates the probability of making a type II error if the null hypothesis is not rejected, or estimates the power if the if the alternative hypothesis is accepted. The “compbdt” program provides all the necessary results so that the researcher can easily interpret them. The estimation of the probability of making a type II error allows the researcher to decide about the reliability of the null hypothesis when this hypothesis is not rejected. The “compbdt” program has been applied to a real example on the diagnosis of coronary artery disease. Conclusions: The “compbdt” program is one which is easy to use and allows the researcher to compare the most important parameters of two binary tests subject to a paired design. The “compbdt” program is available as supplementary material.This research was supported by the Spanish Ministry of Economy, Grant Number MTM2016–76938-P

    Added predictive value of high-throughput molecular data to clinical data, and its validation

    Get PDF
    Hundreds of ''molecular signatures'' have been proposed in the literature to predict patient outcome in clinical settings from high-dimensional data, many of which eventually failed to get validated. Validation of such molecular research findings is thus becoming an increasingly important branch of clinical bioinformatics. Moreover, in practice well-known clinical predictors are often already available. From a statistical and bioinformatics point of view, poor attention has been given to the evaluation of the added predictive value of a molecular signature given that clinical predictors are available. This article reviews procedures that assess and validate the added predictive value of high-dimensional molecular data. It critically surveys various approaches for the construction of combined prediction models using both clinical and molecular data, for validating added predictive value based on independent data, and for assessing added predictive value using a single data set

    Comparison of Two or More Correlated AUCs in Paired Sample Design

    Get PDF
    Purpose of study Methods of comparing the accuracy of diagnostic tests are of increasing necessity in biomedical science. When a test result is measured on a continuous scale, an assessment of the performance of the overall value of the test can be made using the Receiver Operating Characteristic (ROC) curve. This curve describes the discrimination ability of a diagnosis test in terms of diseased subjects from non-diseased subjects. The area under the ROC curve (AUC) describes the probability that a randomly chosen diseased subject will have higher probability of having disease than a randomly chosen non-diseased subject. For comparing two or more diagnostic test results, the difference between AUCs is often used. This paper proposes a non-parametric alternative method of comparing two or more correlated area under the curve (AUCs) of diagnostic tests for paired sample data. This method is based on Chi-square test statistic. Methods This paper investigated both parametric and non-parametric methods of comparing the equality of two AUCs and proposed a Chi-square test for the comparison of two or more diagnostic test processes. The proposed method does not require the knowledge of true status of subjects or gold standard in evaluating the accuracy of tests unlike other existing methods. The proposed method is most suitable for paired sample design. It also offers reliable statistical inferences even in small sample problems and circumvent the difficulties of deriving the statistical moments of complex summary statistics as seen in the Delong method. The proposed method provides for further analysis to determine the possible reason for rejecting the null hypothesis of equality of AUCs. Results The proposed method when applied on real data, avoids the lengthy and more difficult procedures of estimating the variances of two AUCs as a way of determining if two AUCs differ significantly. The method is validated using the Cochran Q test and was shown to compare favourably. The proposed method recommended for comparing two or more correlated AUCs when the data is paired. It is simple and does not require prior knowledge of true status of subjects unlike other existing methods. Keywords: Chi-square test, Cochran Q test, cut-off value, area under the curve, receiver operating characteristic, Dichotomous data DOI: 10.7176/JNSR/9-9-06 Publication date:May 31st 201

    Confidence Intervals and Sample Size to Compare the Predictive Values of Two Diagnostic Tests

    Get PDF
    A binary diagnostic test is a medical test that is applied to an individual in order to determine the presence or the absence of a certain disease and whose result can be positive or negative. A positive result indicates the presence of the disease, and a negative result indicates the absence. Positive and negative predictive values represent the accuracy of a binary diagnostic test when it is applied to a cohort of individuals, and they are measures of the clinical accuracy of the binary diagnostic test. In this manuscript, we study the comparison of the positive (negative) predictive values of two binary diagnostic tests subject to a paired design through confidence intervals. We have studied confidence intervals for the difference and for the ratio of the two positive (negative) predictive values. Simulation experiments have been carried out to study the asymptotic behavior of the confidence intervals, giving some general rules for application. We also study a method to calculate the sample size to compare the parameters using confidence intervals. We have written a program in R to solve the problems studied in this manuscript. The results have been applied to the diagnosis of colorectal cancer

    Online Updating of Statistical Inference in the Big Data Setting

    Full text link
    We present statistical methods for big data arising from online analytical processing, where large amounts of data arrive in streams and require fast analysis without storage/access to the historical data. In particular, we develop iterative estimating algorithms and statistical inferences for linear models and estimating equations that update as new data arrive. These algorithms are computationally efficient, minimally storage-intensive, and allow for possible rank deficiencies in the subset design matrices due to rare-event covariates. Within the linear model setting, the proposed online-updating framework leads to predictive residual tests that can be used to assess the goodness-of-fit of the hypothesized model. We also propose a new online-updating estimator under the estimating equation setting. Theoretical properties of the goodness-of-fit tests and proposed estimators are examined in detail. In simulation studies and real data applications, our estimator compares favorably with competing approaches under the estimating equation setting.Comment: Submitted to Technometric

    An Interpretable Deep Hierarchical Semantic Convolutional Neural Network for Lung Nodule Malignancy Classification

    Full text link
    While deep learning methods are increasingly being applied to tasks such as computer-aided diagnosis, these models are difficult to interpret, do not incorporate prior domain knowledge, and are often considered as a "black-box." The lack of model interpretability hinders them from being fully understood by target users such as radiologists. In this paper, we present a novel interpretable deep hierarchical semantic convolutional neural network (HSCNN) to predict whether a given pulmonary nodule observed on a computed tomography (CT) scan is malignant. Our network provides two levels of output: 1) low-level radiologist semantic features, and 2) a high-level malignancy prediction score. The low-level semantic outputs quantify the diagnostic features used by radiologists and serve to explain how the model interprets the images in an expert-driven manner. The information from these low-level tasks, along with the representations learned by the convolutional layers, are then combined and used to infer the high-level task of predicting nodule malignancy. This unified architecture is trained by optimizing a global loss function including both low- and high-level tasks, thereby learning all the parameters within a joint framework. Our experimental results using the Lung Image Database Consortium (LIDC) show that the proposed method not only produces interpretable lung cancer predictions but also achieves significantly better results compared to common 3D CNN approaches

    Generalization of Kullback-Leibler Divergence for Multi-Stage Diseases: Application to Diagnostic Test Accuracy and Optimal Cut-Points Selection Criterion

    Get PDF
    The Kullback-Leibler divergence (KL), which captures the disparity between two distributions, has been considered as a measure for determining the diagnostic performance of an ordinal diagnostic test. This study applies KL and further generalizes it to comprehensively measure the diagnostic accuracy test for multi-stage (K \u3e 2) diseases, named generalized total Kullback-Leibler divergence (GTKL). Also, GTKL is proposed as an optimal cut-points selection criterion for discriminating subjects among different disease stages. Moreover, the study investigates a variety of applications of GTKL on measuring the rule-in/out potentials in the single-stage and multi-stage levels. Intensive simulation studies are conducted to compare the performance of GTKL and other diagnostic accuracy measures, such as generalized Youden index (GYI), hypervolume under the manifold (HUM), and maximum absolute determinant (MADET). Furthermore, a comprehensive analysis of a real dataset is performed to illustrate the application of the proposed measure

    Globalization of Corporate Covernance: The American Influence on Dismissal Performance Sensitivity of European CEOs

    Get PDF
    This study examines how globalization of corporate governance practices influence the risk of European CEOs being dismissed. We argue that the harsh monitoring of the American corporate governance system spills over to the rest of the world as a result of this globalization. We focus on direct and indirect American influence on the dismissal performance sensitivity among the 250 largest European publicly listed firms. The indirect influence is assumed to materialize via European firms cross-listing on U.S. exchanges, whereas the direct influence is assumed to appear as a result of European firms hiring of American independent board members. Both sources of influence are hypothesized to result in increased dismissal performance sensitivity. The empirical results show a significant increase in the dismissal sensitivity in poorly performing companies with American board membership whereas no significant increase is found from cross-listing in the U.S.CEO dismissal; Performance sensitivity; Globalization; Corporate governance; Foreign board membership; Institutional contagion
    corecore