10 research outputs found

    MIBiG 3.0 : a community-driven effort to annotate experimentally validated biosynthetic gene clusters

    Get PDF
    With an ever-increasing amount of (meta)genomic data being deposited in sequence databases, (meta)genome mining for natural product biosynthetic pathways occupies a critical role in the discovery of novel pharmaceutical drugs, crop protection agents and biomaterials. The genes that encode these pathways are often organised into biosynthetic gene clusters (BGCs). In 2015, we defined the Minimum Information about a Biosynthetic Gene cluster (MIBiG): a standardised data format that describes the minimally required information to uniquely characterise a BGC. We simultaneously constructed an accompanying online database of BGCs, which has since been widely used by the community as a reference dataset for BGCs and was expanded to 2021 entries in 2019 (MIBiG 2.0). Here, we describe MIBiG 3.0, a database update comprising large-scale validation and re-annotation of existing entries and 661 new entries. Particular attention was paid to the annotation of compound structures and biological activities, as well as protein domain selectivities. Together, these new features keep the database up-to-date, and will provide new opportunities for the scientific community to use its freely available data, e.g. for the training of new machine learning models to predict sequence-structure-function relationships for diverse natural products. MIBiG 3.0 is accessible online at https://mibig.secondarymetabolites.org/

    Whole-genome sequencing reveals host factors underlying critical COVID-19

    Get PDF
    Critical COVID-19 is caused by immune-mediated inflammatory lung injury. Host genetic variation influences the development of illness requiring critical care1 or hospitalization2,3,4 after infection with SARS-CoV-2. The GenOMICC (Genetics of Mortality in Critical Care) study enables the comparison of genomes from individuals who are critically ill with those of population controls to find underlying disease mechanisms. Here we use whole-genome sequencing in 7,491 critically ill individuals compared with 48,400 controls to discover and replicate 23 independent variants that significantly predispose to critical COVID-19. We identify 16 new independent associations, including variants within genes that are involved in interferon signalling (IL10RB and PLSCR1), leucocyte differentiation (BCL11A) and blood-type antigen secretor status (FUT2). Using transcriptome-wide association and colocalization to infer the effect of gene expression on disease severity, we find evidence that implicates multiple genes—including reduced expression of a membrane flippase (ATP11A), and increased expression of a mucin (MUC1)—in critical disease. Mendelian randomization provides evidence in support of causal roles for myeloid cell adhesion molecules (SELE, ICAM5 and CD209) and the coagulation factor F8, all of which are potentially druggable targets. Our results are broadly consistent with a multi-component model of COVID-19 pathophysiology, in which at least two distinct mechanisms can predispose to life-threatening disease: failure to control viral replication; or an enhanced tendency towards pulmonary inflammation and intravascular coagulation. We show that comparison between cases of critical illness and population controls is highly efficient for the detection of therapeutically relevant mechanisms of disease

    The Harderian gland transcriptomes of Caraiba andreae, Cubophis cantherigerus and Tretanorhinus variabilis, three colubroid snakes from Cuba

    No full text
    The Harderian gland is a cephalic structure, widely distributed among vertebrates. In snakes, the Harderian gland is anatomically connected to the vomeronasal organ via the nasolacrimal duct, and in some species can be larger than the eyes. The function of the Harderian gland remains elusive, but it has been proposed to play a role in the production of saliva, pheromones, thermoregulatory lipids and growth factors, among others. Here, we have profiled the transcriptomes of the Harderian glands of three non-front-fanged colubroid snakes from Cuba: Caraiba andreae (Cuban Lesser Racer); Cubophis cantherigerus (Cuban Racer); and Tretanorhinus variabilis (Caribbean Water Snake), using Illumina HiSeq2000 100 bp paired-end. In addition to ribosomal and non-characterized proteins, the most abundant transcripts encode putative transport/binding, lipocalin/lipocalin-like, and bactericidal/permeability-increasing-like proteins. Transcripts coding for putative canonical toxins described in venomous snakes were also identified. This transcriptional profile suggests a more complex function than previously recognized for this enigmatic organ. © 2018DDP was supported by a PhD grant ( SFRH/BD/80592/2011 ) from the Portuguese Foundation for Science and Technology (FCT—Fundação para a Ciência e a Tecnologia, Portugal). GACh was also supported by a Postdoctoral grant ( SFRH/BPD/92978/2013 ) from the FCT . This study was funded in part by the projects PTDC/AAG-GLO/6887/2014 and by the Strategic Funding UID/Multi/04423/2013 through national funds provided by FCT and the European Regional Development Fund (ERDF) in the framework of the program PT2020 , by the European Structural and Investment Funds (ESIF) through the Competitiveness and Internationalization Operational Program — COMPETE 2020 and by National Funds through the FCT under the project PTDC/AAG-GLO/6887/2014 (POCI-01-0124-FEDER-016845), and by the Structured Programs of R&D&I INNOVMAR —Innovation and Sustainability in the Management and Exploitation of Marine Resources ( NORTE-01-0145-FEDER-000035 , Research Line NOVELMAR), and funded by the Northern Regional Operational Program ( NORTE2020 ) through the ERDF. Work performed at the Evolutionary and Translational Venomics Laboratory, Instituto de Biomedicina de Valencia (CSIC) was funded by grant BFU2013-42833-P from the Ministerio de Economía y Competitividad , Madrid, Spain (to JJC). We are grateful to: Tomás Michel Rodríguez Cabrera (Sociedad Cubana de Zoología) and Lázaro Cuellar Yanes (undergraduate Biology student from La Universidad de La Habana, Cuba) for collecting specimens; Bruno Reis (CIIMAR, FCUP, University of Porto) for his help supervising the total RNA extraction; Prof. Alan H. Savitzky (Utah State University, USA), for his useful comments and insights on the Harderian Gland; to Yudermys Moya Chaviano, for helping with the confection of Fig. 1 ; Filipe Silva and Emanuel Maldonado (CIIMAR, FCUP, University of Porto) for helping with the analysis of contig expression performed with the CLC Genomics Worbench 8.5.1, and with the confection of Supplementary Table S1, respectively. Appendix

    Changes in the microbiome composition in Churra sheep with a resistant phenotype to infection

    No full text
    Trabajo presentado al: COMBAR meeting (Combatting Anthelmintic Resistance in Ruminants). Atenas (Grecia). Febrero. 2022

    Evaluation of the Integrated Tuberculosis Research Program Sponsored by the Spanish society of pulmonology and thoracic surgery: 11 years on

    No full text

    Whole-genome sequencing reveals host factors underlying critical COVID-19

    No full text
    Altres ajuts: Department of Health and Social Care (DHSC); Illumina; LifeArc; Medical Research Council (MRC); UKRI; Sepsis Research (the Fiona Elizabeth Agnew Trust); the Intensive Care Society, Wellcome Trust Senior Research Fellowship (223164/Z/21/Z); BBSRC Institute Program Support Grant to the Roslin Institute (BBS/E/D/20002172, BBS/E/D/10002070, BBS/E/D/30002275); UKRI grants (MC_PC_20004, MC_PC_19025, MC_PC_1905, MRNO2995X/1); UK Research and Innovation (MC_PC_20029); the Wellcome PhD training fellowship for clinicians (204979/Z/16/Z); the Edinburgh Clinical Academic Track (ECAT) programme; the National Institute for Health Research, the Wellcome Trust; the MRC; Cancer Research UK; the DHSC; NHS England; the Smilow family; the National Center for Advancing Translational Sciences of the National Institutes of Health (CTSA award number UL1TR001878); the Perelman School of Medicine at the University of Pennsylvania; National Institute on Aging (NIA U01AG009740); the National Institute on Aging (RC2 AG036495, RC4 AG039029); the Common Fund of the Office of the Director of the National Institutes of Health; NCI; NHGRI; NHLBI; NIDA; NIMH; NINDS.Critical COVID-19 is caused by immune-mediated inflammatory lung injury. Host genetic variation influences the development of illness requiring critical care or hospitalization after infection with SARS-CoV-2. The GenOMICC (Genetics of Mortality in Critical Care) study enables the comparison of genomes from individuals who are critically ill with those of population controls to find underlying disease mechanisms. Here we use whole-genome sequencing in 7,491 critically ill individuals compared with 48,400 controls to discover and replicate 23 independent variants that significantly predispose to critical COVID-19. We identify 16 new independent associations, including variants within genes that are involved in interferon signalling (IL10RB and PLSCR1), leucocyte differentiation (BCL11A) and blood-type antigen secretor status (FUT2). Using transcriptome-wide association and colocalization to infer the effect of gene expression on disease severity, we find evidence that implicates multiple genes-including reduced expression of a membrane flippase (ATP11A), and increased expression of a mucin (MUC1)-in critical disease. Mendelian randomization provides evidence in support of causal roles for myeloid cell adhesion molecules (SELE, ICAM5 and CD209) and the coagulation factor F8, all of which are potentially druggable targets. Our results are broadly consistent with a multi-component model of COVID-19 pathophysiology, in which at least two distinct mechanisms can predispose to life-threatening disease: failure to control viral replication; or an enhanced tendency towards pulmonary inflammation and intravascular coagulation. We show that comparison between cases of critical illness and population controls is highly efficient for the detection of therapeutically relevant mechanisms of disease
    corecore