140 research outputs found

    The modENCODE Data Coordination Center: lessons in harvesting comprehensive experimental details.

    Get PDF
    The model organism Encyclopedia of DNA Elements (modENCODE) project is a National Human Genome Research Institute (NHGRI) initiative designed to characterize the genomes of Drosophila melanogaster and Caenorhabditis elegans. A Data Coordination Center (DCC) was created to collect, store and catalog modENCODE data. An effective DCC must gather, organize and provide all primary, interpreted and analyzed data, and ensure the community is supplied with the knowledge of the experimental conditions, protocols and verification checks used to generate each primary data set. We present here the design principles of the modENCODE DCC, and describe the ramifications of collecting thorough and deep metadata for describing experiments, including the use of a wiki for capturing protocol and reagent information, and the BIR-TAB specification for linking biological samples to experimental results. modENCODE data can be found at http://www.modencode.org

    modMine: flexible access to modENCODE data.

    Get PDF
    In an effort to comprehensively characterize the functional elements within the genomes of the important model organisms Drosophila melanogaster and Caenorhabditis elegans, the NHGRI model organism Encyclopaedia of DNA Elements (modENCODE) consortium has generated an enormous library of genomic data along with detailed, structured information on all aspects of the experiments. The modMine database (http://intermine.modencode.org) described here has been built by the modENCODE Data Coordination Center to allow the broader research community to (i) search for and download data sets of interest among the thousands generated by modENCODE; (ii) access the data in an integrated form together with non-modENCODE data sets; and (iii) facilitate fine-grained analysis of the above data. The sophisticated search features are possible because of the collection of extensive experimental metadata by the consortium. Interfaces are provided to allow both biologists and bioinformaticians to exploit these rich modENCODE data sets now available via modMine

    Using symptom-based case predictions to identify host genetic factors that contribute to COVID-19 susceptibility

    Get PDF
    Epidemiological and genetic studies on COVID-19 are currently hindered by inconsistent and limited testing policies to confirm SARS-CoV-2 infection. Recently, it was shown that it is possible to predict COVID-19 cases using cross-sectional self-reported disease-related symptoms. Here, we demonstrate that this COVID-19 prediction model has reasonable and consistent performance across multiple independent cohorts and that our attempt to improve upon this model did not result in improved predictions. Using the existing COVID-19 prediction model, we then conducted a GWAS on the predicted phenotype using a total of 1,865 predicted cases and 29,174 controls. While we did not find any common, large-effect variants that reached genome-wide significance, we do observe suggestive genetic associations at two SNPs (rs11844522, p = 1.9x10-7; rs5798227, p = 2.2x10-7). Explorative analyses furthermore suggest that genetic variants associated with other viral infectious diseases do not overlap with COVID-19 susceptibility and that severity of COVID-19 may have a different genetic architecture compared to COVID-19 susceptibility. This study represents a first effort that uses a symptom-based predicted phenotype as a proxy for COVID-19 in our pursuit of understanding the genetic susceptibility of the disease. We conclude that the inclusion of symptom-based predicted cases could be a useful strategy in a scenario of limited testing, either during the current COVID-19 pandemic or any future viral outbreak

    CD44 Upregulation in E-Cadherin-Negative Esophageal Cancers Results in Cell Invasion

    Get PDF
    E-cadherin is frequently lost during epithelial-mesenchymal transition and the progression of epithelial tumorigenesis. We found a marker of epithelial-mesenchymal transition, CD44, upregulated in response to functional loss of E-cadherin in esophageal cell lines and cancer. Loss of E-cadherin expression correlates with increased expression of CD44 standard isoform. Using an organotypic reconstruct model, we show increased CD44 expression in areas of cell invasion is associated with MMP-9 at the leading edge. Moreover, Activin A increases cell invasion through CD44 upregulation after E-cadherin loss. Taken together, our results provide functional evidence of CD44 upregulation in esophageal cancer invasion

    Linking Human Diseases to Animal Models Using Ontology-Based Phenotype Annotation

    Get PDF
    A novel method for quantifying the similarity between phenotypes by the use of ontologies can be used to search for candidate genes, pathway members, and human disease models on the basis of phenotypes alone

    Exome-wide association study to identify rare variants influencing COVID-19 outcomes : Results from the Host Genetics Initiative

    Get PDF
    Publisher Copyright: Copyright: © 2022 Butler-Laporte et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.Host genetics is a key determinant of COVID-19 outcomes. Previously, the COVID-19 Host Genetics Initiative genome-wide association study used common variants to identify multiple loci associated with COVID-19 outcomes. However, variants with the largest impact on COVID-19 outcomes are expected to be rare in the population. Hence, studying rare variants may provide additional insights into disease susceptibility and pathogenesis, thereby informing therapeutics development. Here, we combined whole-exome and whole-genome sequencing from 21 cohorts across 12 countries and performed rare variant exome-wide burden analyses for COVID-19 outcomes. In an analysis of 5,085 severe disease cases and 571,737 controls, we observed that carrying a rare deleterious variant in the SARS-CoV-2 sensor toll-like receptor TLR7 (on chromosome X) was associated with a 5.3-fold increase in severe disease (95% CI: 2.75–10.05, p = 5.41x10-7). This association was consistent across sexes. These results further support TLR7 as a genetic determinant of severe disease and suggest that larger studies on rare variants influencing COVID-19 outcomes could provide additional insights.Peer reviewe

    Exome-wide association study to identify rare variants influencing COVID-19 outcomes: Results from the Host Genetics Initiative

    Get PDF

    An inclusive Research and Education Community (iREC) model to facilitate undergraduate science education reform

    Get PDF
    Funding: This work was supported by Howard Hughes Medical Institute grants to DIH is GT12052 and MJG is GT15338.Over the last two decades, there have been numerous initiatives to improve undergraduate student outcomes in STEM. One model for scalable reform is the inclusive Research Education Community (iREC). In an iREC, STEM faculty from colleges and universities across the nation are supported to adopt and sustainably implement course-based research – a form of science pedagogy that enhances student learning and persistence in science. In this study, we used pathway modeling to develop a qualitative description that explicates the HHMI Science Education Alliance (SEA) iREC as a model for facilitating the successful adoption and continued advancement of new curricular content and pedagogy. In particular, outcomes that faculty realize through their participation in the SEA iREC were identified, organized by time, and functionally linked. The resulting pathway model was then revised and refined based on several rounds of feedback from over 100 faculty members in the SEA iREC who participated in the study. Our results show that in an iREC, STEM faculty organized as a long-standing community of practice leverage one another, outside expertise, and data to adopt, implement, and iteratively advance their pedagogy. The opportunity to collaborate in this manner and, additionally, to be recognized for pedagogical contributions sustainably engages STEM faculty in the advancement of their pedagogy. Here, we present a detailed pathway model of SEA that, together with underpinning features of an iREC identified in this study, offers a framework to facilitate transformations in undergraduate science education.Peer reviewe

    Genetic effects on gene expression across human tissues

    Get PDF
    Characterization of the molecular function of the human genome and its variation across individuals is essential for identifying the cellular mechanisms that underlie human genetic traits and diseases. The Genotype-Tissue Expression (GTEx) project aims to characterize variation in gene expression levels across individuals and diverse tissues of the human body, many of which are not easily accessible. Here we describe genetic effects on gene expression levels across 44 human tissues. We find that local genetic variation affects gene expression levels for the majority of genes, and we further identify inter-chromosomal genetic effects for 93 genes and 112 loci. On the basis of the identified genetic effects, we characterize patterns of tissue specificity, compare local and distal effects, and evaluate the functional properties of the genetic effects. We also demonstrate that multi-tissue, multi-individual data can be used to identify genes and pathways affected by human disease-associated variation, enabling a mechanistic interpretation of gene regulation and the genetic basis of diseas
    corecore