187 research outputs found

    Combining evidence for association from transmission disequilibrium and case-control studies using single-nucleotide polymorphisms

    Get PDF
    The aim of the present analysis is to combine evidence for association from the two most commonly used designs in genetic association analysis, the case-control design and the transmission disequilibrium test (TDT) design. The cases here are affected offspring from nuclear families and are used in both the case-control and TDT designs. As a result, inference from these designs is not independent. We applied a simple logistic regression method for combining evidence for association from case-control and TDT designs to single-nucleotide polymorphism data purchased on a region on chromosome 3, replicate 1 of the Aipotu population. Combining the evidence from the case-control and TDT designs yielded a 5–10% reduction in the standard errors of the relative risk estimates. The authors did not know the results before the analyses were conducted

    Methods to test for association between a disease and a multi-allelic marker applied to a candidate region

    Get PDF
    We report the analysis results of the Genetic Analysis Workshop 14 simulated microsatellite marker dataset, using replicate 50 from the Danacaa population. We applied several methods for association analysis of multi-allelic markers to case-control data to study the association between Kofendrerd Personality Disorder and multi-allelic markers in a candidate region previously identified by the linkage analysis. Evidence for association was found for marker D03S0127 (p < 0.01). The analyses were done without any prior knowledge of the answers

    Discussion on the paper ‘Statistical contributions to bioinformatics: Design, modelling, structure learning and integration’ by Jeffrey S. Morris and Veerabhadran Baladandayuthapani

    Get PDF
    Bioinformatics is an important research area for statisticians. This discussion provides some additional topics to the paper, namely on statistical contributions to detect differential expressed genes, for protein structure prediction, and for the analysis of highly correlated features in Glycomics datasets

    Genomic prediction across populations, using pre-selected markers and differential weight models

    Get PDF
    Genomic prediction (GP) in numerically small breeds is limited due to the requirement for a large reference set. Across breed prediction has not been very successful either. Our objective was to test alternative models for across breed and multi-breed GP in a small Jersey population, utilizing prior information on marker causality. We used data on 596 Jersey bulls from new Zealand and 5503 Holstein bulls from the Netherlands, all of which had deregressed proofs for stature. Two sets of genotype data were used, one containing 357 potential causal markers identified from a multi-breed meta-GWAS on stature (top markers), while the other contained 48,912 markers on the custom 50k chip, excluding the top markers. We used models in which only one GRM (either top markers, 50k, or top plus 50k markers combined) was fitted, and models in which two GRMs (both the top and 50k) were fitted simultaneously, however with different variance components to weight the GRMs differently. Moreover, we estimated the genetic correlation(s) between the breeds (for each GRM) using a multi-trait GP model, which implicitly weights the contribution of one breed’s information to another. Across breed, we observed low accuracies of GP when the 50k markers were fitted alone (0.06) or when the top markers were added to 50k (0.15). Higher accuracy was obtained when only the top markers were fitted (0.21), whereas the highest accuracy was obtained when fitting 50k and top markers simultaneously as two independent GRMs (0.25). Multi-breed prediction outperformed both within and across breed prediction with accuracies ranging from 0.34 to 0.45, with the same trend as in across breed prediction. Based on our results, the best approach for across and multi-breed GP is to fit models that are able to isolate and differentially weight the most important markers for the trait. Keywords: Across breed genomic prediction, marker pre-selection, multi-trait model, sequence data

    Gene analysis for longitudinal family data using random-effects models

    Get PDF
    We have extended our recently developed 2-step approach for gene-based analysis to the family design and to the analysis of rare variants. The goal of this approach is to study the joint effect of multiple single-nucleotide polymorphisms that belong to a gene. First, the information in a gene is summarized by 2 variables, namely the empirical Bayes estimate capturing common variation and the number of rare variants. By using random effects for the common variants, our approach acknowledges the within-gene correlations. In the second step, the 2 summaries were included as covariates in linear mixed models. To test the null hypothesis of no association, a multivariate Wald test was applied. We analyzed the simulated data sets to assess the performance of the method. Then we applied the method to the real data set and identified a significant association between FRMD4B and diastolic blood pressure (p-value = 8.3 × 10(-12))

    Generalizing Terwilliger's likelihood approach: a new score statistic to test for genetic association

    Get PDF
    <p>Abstract</p> <p>Background:</p> <p>In this paper, we propose a one degree of freedom test for association between a candidate gene and a binary trait. This method is a generalization of Terwilliger's likelihood ratio statistic and is especially powerful for the situation of one associated haplotype. As an alternative to the likelihood ratio statistic, we derive a score statistic, which has a tractable expression. For haplotype analysis, we assume that phase is known.</p> <p>Results:</p> <p>By means of a simulation study, we compare the performance of the score statistic to Pearson's chi-square statistic and the likelihood ratio statistic proposed by Terwilliger. We illustrate the method on three candidate genes studied in the Leiden Thrombophilia Study.</p> <p>Conclusion:</p> <p>We conclude that the statistic follows a chi square distribution under the null hypothesis and that the score statistic is more powerful than Terwilliger's likelihood ratio statistic when the associated haplotype has frequency between 0.1 and 0.4 and has a small impact on the studied disorder. With regard to Pearson's chi-square statistic, the score statistic has more power when the associated haplotype has frequency above 0.2 and the number of variants is above five.</p

    Locally weighted transmission/disequilibrium test for genetic association analysis

    Get PDF
    The transmission/disequilibrium test statistic has been used for assessing genetic association in affected-parent trios. In the presence of multiple tightly linked marker loci where local dependency may exist, haplotypes are reconstructed statistically to estimate the joint effects of these markers. In this manuscript, we propose an alternative to the haplotype approach by taking a weighted average of multiple loci, where the weight is proportional to the product of (1-2X recombination fraction) and the linkage disequilibrium between markers. As an illustration, we applied the method to the simulated Aipotu data
    • …
    corecore