13,439 research outputs found

    Application of the matched nested case-control design to the secondary analysis of trial data

    Get PDF
    Background: A nested case-control study is an efficient design that can be embedded within an existing cohort study or randomised trial. It has a number of advantages compared to the conventional case-control design, and has the potential to answer important research questions using untapped prospectively collected data. Methods: We demonstrate the utility of the matched nested case-control design by applying it to a secondary analysis of the Abnormal Doppler Enteral Prescription Trial. We investigated the role of milk feed type and changes in milk feed type in the development of necrotising enterocolitis in a group of 398 high risk growth-restricted preterm infants. Results: Using matching, we were able to generate a comparable sample of controls selected from the same population as the cases. In contrast to the standard case-control design, exposure status was ascertained prior to the outcome event occurring and the comparison between the cases and matched controls could be made at the point at which the event occurred. This enabled us to reliably investigate the temporal relationship between feed type and necrotising enterocolitis. Conclusions: A matched nested case-control study can be used to identify credible associations in a secondary analysis of clinical trial data where the exposure of interest was not randomised, and has several advantages over a standard case-control design. This method offers the potential to make reliable inferences in scenarios where it would be unethical or impractical to perform a randomised clinical trial.</br

    Tests in a Case-Control Design Including Relatives

    Get PDF
    We present a new approach to handle dependencies within the general framework of case-control designs, illustrating our approach by a particular application from the field of genetic epidemiology. The method is derived for parent-offspring trios, which will later be relaxed to more general family structures. For applications in genetic epidemiology we consider tests on equality of allele frequencies among cases and controls utilizing well-known risk measures to test for independence of phenotype and genotype at the observed locus. These test statistics are derived as functions of the entries in the associated contingency table containing the numbers of the alleles under consideration in the case and the control group. We find the joint asymptotic distribution of these entries, which enables us to derive critical values for any test constructed on this basis. A simulation study reveals the finite sample behavior of our test statistics. --association tests,contingency tables,dependent data,risk measures

    Case-control design identifies ecological drivers of endemic coral diseases

    Get PDF
    Endemic disease transmission is an important ecological process that is challenging to study because of low occurrence rates. Here, we investigate the ecological drivers of two coral diseases-growth anomalies and tissue loss-affecting five coral species. We first show that a statistical framework called the case-control study design, commonly used in epidemiology but rarely applied to ecology, provided high predictive accuracy (67-82%) and disease detection rates (60-83%) compared with a traditional statistical approach that yielded high accuracy (98-100%) but low disease detection rates (0-17%). Using this framework, we found evidence that 1) larger corals have higher disease risk; 2) shallow reefs with low herbivorous fish abundance, limited water motion, and located adjacent to watersheds with high fertilizer and pesticide runoff promote low levels of growth anomalies, a chronic coral disease; and 3) wave exposure, stream exposure, depth, and low thermal stress are associated with tissue loss disease risk during interepidemic periods. Variation in risk factors across host-disease pairs suggests that either different pathogens cause the same gross lesions in different species or that the same disease may arise in different species under different ecological conditions

    Advantages of the nested case-control design in diagnostic research

    Get PDF
    Abstract Background Despite its benefits, it is uncommon to apply the nested case-control design in diagnostic research. We aim to show advantages of this design for diagnostic accuracy studies. Methods We used data from a full cross-sectional diagnostic study comprising a cohort of 1295 consecutive patients who were selected on their suspicion of having deep vein thrombosis (DVT). We draw nested case-control samples from the full study population with case:control ratios of 1:1, 1:2, 1:3 and 1:4 (per ratio 100 samples were taken). We calculated diagnostic accuracy estimates for two tests that are used to detect DVT in clinical practice. Results Estimates of diagnostic accuracy in the nested case-control samples were very similar to those in the full study population. For example, for each case:control ratio, the positive predictive value of the D-dimer test was 0.30 in the full study population and 0.30 in the nested case-control samples (median of the 100 samples). As expected, variability of the estimates decreased with increasing sample size. Conclusion Our findings support the view that the nested case-control study is a valid and efficient design for diagnostic studies and should also be (re)appraised in current guidelines on diagnostic accuracy research.</p

    Analysis of case-parent trios at a locus with a deletion allele: association of GSTM1 with autism

    Get PDF
    BACKGROUND: Certain loci on the human genome, such as glutathione S-transferase M1 (GSTM1), do not permit heterozygotes to be reliably determined by commonly used methods. Association of such a locus with a disease is therefore generally tested with a case-control design. When subjects have already been ascertained in a case-parent design however, the question arises as to whether the data can still be used to test disease association at such a locus. RESULTS: A likelihood ratio test was constructed that can be used with a case-parents design but has somewhat less power than a Pearson's chi-squared test that uses a case-control design. The test is illustrated on a novel dataset showing a genotype relative risk near 2 for the homozygous GSTM1 deletion genotype and autism. CONCLUSION: Although the case-control design will remain the mainstay for a locus with a deletion, the likelihood ratio test will be useful for such a locus analyzed as part of a larger case-parent study design. The likelihood ratio test has the advantage that it can incorporate complete and incomplete case-parent trios as well as independent cases and controls. Both analyses support (p = 0.046 for the proposed test, p = 0.028 for the case-control analysis) an association of the homozygous GSTM1 deletion genotype with autism

    Exact Logistic Regression for a Matched Pairs case-Control Design with Polytomous Exposure Variables

    Get PDF
    Logistic regression methods are useful in estimating odds ratios under matched pairs case-control designs when the exposure variable of interest is binary or polytomous in nature. Analysis is typically performed using large sample approximation techniques. When conducting the analysis with polytomous exposure variable, situations where the numbers of discordant pairs in the resulting cells are small or the data structure is sparse can be encountered. In such situations, the asymptotic method of analysis is questionable, thus an exact method of analysis may be more suitable. A method is presented that performs exact inference in the case of pair-wise matched case-control data with more than two unordered exposure categories using a distribution of conditional sufficient statistics of logistic model parameters

    Missing Heritability in the Tails of Quantitative Traits? A Simulation Study on the Impact of Slightly Altered True Genetic Models

    Get PDF
    Objective: Genome-wide association studies have identified robust associations between single nucleotide polymorphisms and complex traits. As the proportion of phenotypic variance explained is still limited for most of the traits, larger and larger meta-analyses are being conducted to detect additional associations. Here we investigate the impact of the study design and the underlying assumption about the true genetic effect in a bimodal mixture situation on the power to detect associations. Methods: We performed simulations of quantitative phenotypes analysed by standard linear regression and dichotomized case-control data sets from the extremes of the quantitative trait analysed by standard logistic regression. Results: Using linear regression, markers with an effect in the extremes of the traits were almost undetectable, whereas analysing extremes by case-control design had superior power even for much smaller sample sizes. Two real data examples are provided to support our theoretical findings and to explore our mixture and parameter assumption. Conclusions: Our findings support the idea to re-analyse the available meta-analysis data sets to detect new loci in the extremes. Moreover, our investigation offers an explanation for discrepant findings when analysing quantitative traits in the general population and in the extremes. Copyright (C) 2011 S. Karger AG, Base

    Bayesian Modeling and MCMC Computation in Linear Logistic Regression for Presence-only Data

    Full text link
    Presence-only data are referred to situations in which, given a censoring mechanism, a binary response can be observed only with respect to on outcome, usually called \textit{presence}. In this work we present a Bayesian approach to the problem of presence-only data based on a two levels scheme. A probability law and a case-control design are combined to handle the double source of uncertainty: one due to the censoring and one due to the sampling. We propose a new formalization for the logistic model with presence-only data that allows further insight into inferential issues related to the model. We concentrate on the case of the linear logistic regression and, in order to make inference on the parameters of interest, we present a Markov Chain Monte Carlo algorithm with data augmentation that does not require the a priori knowledge of the population prevalence. A simulation study concerning 24,000 simulated datasets related to different scenarios is presented comparing our proposal to optimal benchmarks.Comment: Affiliations: Fabio Divino - Division of Physics, Computer Science and Mathematics, University of Molise Giovanna jona Lasinio and Natalia Golini - Department of Statistical Sciences, University of Rome "La Sapienza" Antti Penttinen - Department of Mathematics and Statistics, University of Jyv\"{a}skyl\"{a} CONTACT: [email protected], [email protected]
    corecore