43 research outputs found

    SNP interaction pattern identifier (SIPI): an intensive search for SNP-SNP interaction patterns.

    Get PDF
    MOTIVATION: Testing SNP-SNP interactions is considered as a key for overcoming bottlenecks of genetic association studies. However, related statistical methods for testing SNP-SNP interactions are underdeveloped. RESULTS: We propose the SNP Interaction Pattern Identifier (SIPI), which tests 45 biologically meaningful interaction patterns for a binary outcome. SIPI takes non-hierarchical models, inheritance modes and mode coding direction into consideration. The simulation results show that SIPI has higher power than MDR (Multifactor Dimensionality Reduction), AA_Full, Geno_Full (full interaction model with additive or genotypic mode) and SNPassoc in detecting interactions. Applying SIPI to the prostate cancer PRACTICAL consortium data with approximately 21 000 patients, the four SNP pairs in EGFR-EGFR , EGFR-MMP16 and EGFR-CSF1 were found to be associated with prostate cancer aggressiveness with the exact or similar pattern in the discovery and validation sets. A similar match for external validation of SNP-SNP interaction studies is suggested. We demonstrated that SIPI not only searches for more meaningful interaction patterns but can also overcome the unstable nature of interaction patterns. AVAILABILITY AND IMPLEMENTATION: The SIPI software is freely available at http://publichealth.lsuhsc.edu/LinSoftware/ . CONTACT: [email protected]. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.This study was supported by the National Cancer Institute (R01CA128813, PI: Park, JY and R21CA202417, PI: Lin, HY)

    Reducing the environmental impact of surgery on a global scale: systematic review and co-prioritization with healthcare workers in 132 countries

    Get PDF
    Abstract Background Healthcare cannot achieve net-zero carbon without addressing operating theatres. The aim of this study was to prioritize feasible interventions to reduce the environmental impact of operating theatres. Methods This study adopted a four-phase Delphi consensus co-prioritization methodology. In phase 1, a systematic review of published interventions and global consultation of perioperative healthcare professionals were used to longlist interventions. In phase 2, iterative thematic analysis consolidated comparable interventions into a shortlist. In phase 3, the shortlist was co-prioritized based on patient and clinician views on acceptability, feasibility, and safety. In phase 4, ranked lists of interventions were presented by their relevance to high-income countries and low–middle-income countries. Results In phase 1, 43 interventions were identified, which had low uptake in practice according to 3042 professionals globally. In phase 2, a shortlist of 15 intervention domains was generated. In phase 3, interventions were deemed acceptable for more than 90 per cent of patients except for reducing general anaesthesia (84 per cent) and re-sterilization of ‘single-use’ consumables (86 per cent). In phase 4, the top three shortlisted interventions for high-income countries were: introducing recycling; reducing use of anaesthetic gases; and appropriate clinical waste processing. In phase 4, the top three shortlisted interventions for low–middle-income countries were: introducing reusable surgical devices; reducing use of consumables; and reducing the use of general anaesthesia. Conclusion This is a step toward environmentally sustainable operating environments with actionable interventions applicable to both high– and low–middle–income countries

    Longitudinal nominal data analysis using marginalized models

    No full text
    Recently, marginalized transition models have become popular for the analysis of longitudinal data. Heagerty (2002) and Lee and Daniels (2007) proposed marginalized transition models for the analysis of longitudinal binary data and ordinal data, respectively. In this paper, we extend their work to accommodate longitudinal nominal data using a Markovian dependence structure. A Fisher-scoring algorithm is developed for estimation. Methods are illustrated with a real dataset and are compared with other standard methods.

    Bayesian mediation analysis for time-to-event outcome: Investigating racial disparity in breast cancer survival

    No full text
    Mediation analysis is conducted to make inferences on the effects of mediators that intervene in the relationship between an exposure variable and an outcome. Bayesian mediation analysis (BMA) naturally considers the hierarchical structure of the effects from the exposure variable to mediators and then to the outcome. We propose three BMA methods on survival outcomes, where mediation effects are measured in terms of hazard rate, survival time, or log of survival time respectively. In addition, we allow setting a limited survival time in the time-to-event analysis. The methods are validated by comparing the estimation precision at different scenarios through simulations. The three methods all give effective estimates. Finally, the methods are applied to the Surveillance, Epidemiology, and End Results Program (SEER) supported special studies to explore the racial disparity in breast cancer survival. The included variable completely explained the observed racial disparities. We provide visual aids to help with the result interpretations.</p

    An Artificial Functional Family Filter in Homolog Searching in Next-generation Sequencing Metagenomics

    No full text
    <div><p>In functional metagenomics, BLAST homology search is a common method to classify metagenomic reads into protein/domain sequence families such as Clusters of Orthologous Groups of proteins (COGs) in order to quantify the abundance of each COG in the community. The resulting functional profile of the community is then used in downstream analysis to correlate the change in abundance to environmental perturbation, clinical variation, and so on. However, the short read length coupled with next-generation sequencing technologies poses a barrier in this approach, essentially because similarity significance cannot be discerned by searching with short reads. Consequently, artificial functional families are produced, in which those with a large number of reads assigned decreases the accuracy of functional profile dramatically. There is no method available to address this problem. We intended to fill this gap in this paper. We revealed that BLAST similarity scores of homologues for short reads from COG protein members coding sequences are distributed differently from the scores of those derived elsewhere. We showed that, by choosing an appropriate score cut-off, we are able to filter out most artificial families and simultaneously to preserve sufficient information in order to build the functional profile. We also showed that, by incorporated application of BLAST and RPS-BLAST, some artificial families with large read counts can be further identified after the score cutoff filtration. Evaluated on three experimental metagenomic datasets with different coverages, we found that the proposed method is robust against read coverage and consistently outperforms the other E-value cutoff methods currently used in literatures.</p> </div

    Influential artificial COGs in M3_2X detected by Step 1 and Step 2.

    No full text
    <p>Note: Two columns of read counts are obtained before score filtration (Step 1) and after score filtration (Step 2).</p

    Influential artificial COGs identified in <b>Step 2</b>.

    No full text
    <p>Influential artificial COGs identified in <b>Step 2</b>.</p
    corecore