23 research outputs found

    A method for comparing multiple imputation techniques: A case study on the U.S. national COVID cohort collaborative.

    Get PDF
    Healthcare datasets obtained from Electronic Health Records have proven to be extremely useful for assessing associations between patients’ predictors and outcomes of interest. However, these datasets often suffer from missing values in a high proportion of cases, whose removal may introduce severe bias. Several multiple imputation algorithms have been proposed to attempt to recover the missing information under an assumed missingness mechanism. Each algorithm presents strengths and weaknesses, and there is currently no consensus on which multiple imputation algorithm works best in a given scenario. Furthermore, the selection of each algorithm’s pa- rameters and data-related modeling choices are also both crucial and challenging

    Characterizing Long COVID: Deep Phenotype of a Complex Condition.

    Get PDF
    BACKGROUND: Numerous publications describe the clinical manifestations of post-acute sequelae of SARS-CoV-2 (PASC or long COVID ), but they are difficult to integrate because of heterogeneous methods and the lack of a standard for denoting the many phenotypic manifestations. Patient-led studies are of particular importance for understanding the natural history of COVID-19, but integration is hampered because they often use different terms to describe the same symptom or condition. This significant disparity in patient versus clinical characterization motivated the proposed ontological approach to specifying manifestations, which will improve capture and integration of future long COVID studies. METHODS: The Human Phenotype Ontology (HPO) is a widely used standard for exchange and analysis of phenotypic abnormalities in human disease but has not yet been applied to the analysis of COVID-19. FINDINGS: We identified 303 articles published before April 29, 2021, curated 59 relevant manuscripts that described clinical manifestations in 81 cohorts three weeks or more following acute COVID-19, and mapped 287 unique clinical findings to HPO terms. We present layperson synonyms and definitions that can be used to link patient self-report questionnaires to standard medical terminology. Long COVID clinical manifestations are not assessed consistently across studies, and most manifestations have been reported with a wide range of synonyms by different authors. Across at least 10 cohorts, authors reported 31 unique clinical features corresponding to HPO terms; the most commonly reported feature was Fatigue (median 45.1%) and the least commonly reported was Nausea (median 3.9%), but the reported percentages varied widely between studies. INTERPRETATION: Translating long COVID manifestations into computable HPO terms will improve analysis, data capture, and classification of long COVID patients. If researchers, clinicians, and patients share a common language, then studies can be compared/pooled more effectively. Furthermore, mapping lay terminology to HPO will help patients assist clinicians and researchers in creating phenotypic characterizations that are computationally accessible, thereby improving the stratification, diagnosis, and treatment of long COVID. FUNDING: U24TR002306; UL1TR001439; P30AG024832; GBMF4552; R01HG010067; UL1TR002535; K23HL128909; UL1TR002389; K99GM145411

    Modern American populism: Analyzing the economics behind the Silent Majority, the Tea Party and Trumpism

    Get PDF
    This article researches populism, more specifically, Modern American Populism (MAP), constructed of white, rural, and economically oppressed reactionarianism, which was borne out of the political upheaval of the 1960’s Civil Rights movement. The research looks to explain the causes of populism and what leads voters to support populist movements and politicians. The research focuses on economic anxiety as the main cause but also examines an alternative theory of racial resentment. In an effort to answer the question, what causes populist movements and motivations, I apply a research approach that utilizes qualitative and quantitative methods. There is an examination of literature that defines populism, its causes and a detailed discussion of the case studies, including the 1972 election of Richard Nixon; the Tea Party election of 2010; and the 2016 election of Donald Trump. In addition, statistical data analysis was run using American National Election Studies (ANES) surveys associated with each specific case study. These case studies were chosen because they most represent forms of populist movements in modern American history. While ample qualitative evidence suggested support for the hypothesis that economic anxiety is a necessary condition for populist voting patterns that elected Nixon, the Tea Party and Trump, the statistical data only supported the hypothesis in two cases, 2010 and 2016, with 1972 coming back inconclusive. The data also suggested that both economic anxiety and racial resentment played a role in 2010 and 2016, while having no significant effect in 1972 in either case. This suggests that further research needs to be conducted into additional populist case studies, as well as an examination into the role economic anxiety and economic crises play on racial resentment and racially motivated voting behavior

    Evaluation of vitamin D biosynthesis and pathway target genes reveals UGT2A1/2 and EGFR polymorphisms associated with epithelial ovarian cancer in African American Women.

    Get PDF
    An association between genetic variants in the vitamin D receptor (VDR) gene and epithelial ovarian cancer (EOC) was previously reported in women of African ancestry (AA). We sought to examine associations between genetic variants in VDR and additional genes from vitamin D biosynthesis and pathway targets (EGFR, UGT1A, UGT2A1/2, UGT2B, CYP3A4/5, CYP2R1, CYP27B1, CYP24A1, CYP11A1, and GC). Genotyping was performed using the custom-designed 533,631 SNP Illumina OncoArray with imputation to the 1,000 Genomes Phase 3 v5 reference set in 755 EOC cases, including 537 high-grade serous (HGSOC), and 1,235 controls. All subjects are of African ancestry (AA). Logistic regression was performed to estimate odds ratios (OR) and 95% confidence intervals (CI). We further evaluated statistical significance of selected SNPs using the Bayesian False Discovery Probability (BFDP). A significant association with EOC was identified in the UGT2A1/2 region for the SNP rs10017134 (per allele OR = 1.4, 95% CI = 1.2-1.7, P = 1.2 × 10-6 , BFDP = 0.02); and an association with HGSOC was identified in the EGFR region for the SNP rs114972508 (per allele OR = 2.3, 95% CI = 1.6-3.4, P = 1.6 × 10-5 , BFDP = 0.29) and in the UGT2A1/2 region again for rs1017134 (per allele OR = 1.4, 95% CI = 1.2-1.7, P = 2.3 × 10-5 , BFDP = 0.23). Genetic variants in the EGFR and UGT2A1/2 may increase susceptibility of EOC in AA women. Future studies to validate these findings are warranted. Alterations in EGFR and UGT2A1/2 could perturb enzyme efficacy, proliferation in ovaries, impact and mark susceptibility to EOC.Includes NIHR and CRUK

    A Methodological Framework for the Comparative Evaluation of Multiple Imputation Methods: Multiple Imputation of Race, Ethnicity and Body Mass Index in the U.S. National COVID Cohort Collaborative

    Full text link
    While electronic health records are a rich data source for biomedical research, these systems are not implemented uniformly across healthcare settings and significant data may be missing due to healthcare fragmentation and lack of interoperability between siloed electronic health records. Considering that the deletion of cases with missing data may introduce severe bias in the subsequent analysis, several authors prefer applying a multiple imputation strategy to recover the missing information. Unfortunately, although several literature works have documented promising results by using any of the different multiple imputation algorithms that are now freely available for research, there is no consensus on which MI algorithm works best. Beside the choice of the MI strategy, the choice of the imputation algorithm and its application settings are also both crucial and challenging. In this paper, inspired by the seminal works of Rubin and van Buuren, we propose a methodological framework that may be applied to evaluate and compare several multiple imputation techniques, with the aim to choose the most valid for computing inferences in a clinical research work. Our framework has been applied to validate, and extend on a larger cohort, the results we presented in a previous literature study, where we evaluated the influence of crucial patients' descriptors and COVID-19 severity in patients with type 2 diabetes mellitus whose data is provided by the National COVID Cohort Collaborative Enclave
    corecore