18 research outputs found

    Comparative genomics of the major parasitic worms

    Get PDF
    Parasitic nematodes (roundworms) and platyhelminths (flatworms) cause debilitating chronic infections of humans and animals, decimate crop production and are a major impediment to socioeconomic development. Here we report a broad comparative study of 81 genomes of parasitic and non-parasitic worms. We have identified gene family births and hundreds of expanded gene families at key nodes in the phylogeny that are relevant to parasitism. Examples include gene families that modulate host immune responses, enable parasite migration though host tissues or allow the parasite to feed. We reveal extensive lineage-specific differences in core metabolism and protein families historically targeted for drug development. From an in silico screen, we have identified and prioritized new potential drug targets and compounds for testing. This comparative genomics resource provides a much-needed boost for the research community to understand and combat parasitic worms

    PDBe-KB: collaboratively defining the biological context of structural data

    Get PDF
    The Protein Data Bank in Europe – Knowledge Base (PDBe-KB, https://pdbe-kb.org) is an open collaboration between world-leading specialist data resources contributing functional and biophysical annotations derived from or relevant to the Protein Data Bank (PDB). The goal of PDBe-KB is to place macromolecular structure data in their biological context by developing standardised data exchange formats and integrating functional annotations from the contributing partner resources into a knowledge graph that can provide valuable biological insights. Since we described PDBe-KB in 2019, there have been significant improvements in the variety of available annotation data sets and user functionality. Here, we provide an overview of the consortium, highlighting the addition of annotations such as predicted covalent binders, phosphorylation sites, effects of mutations on the protein structure and energetic local frustration. In addition, we describe a library of reusable web-based visualisation components and introduce new features such as a bulk download data service and a novel superposition service that generates clusters of superposed protein chains weekly for the whole PDB archive

    Solution structure and DNA-binding properties of the C-terminal domain of UvrC from E.coli

    No full text
    The C-terminal domain of the UvrC protein (UvrC CTD) is essential for 5′ incision in the prokaryotic nucleotide excision repair process. We have determined the three-dimensional structure of the UvrC CTD using heteronuclear NMR techniques. The structure shows two helix–hairpin–helix (HhH) motifs connected by a small connector helix. The UvrC CTD is shown to mediate structure-specific DNA binding. The domain binds to a single-stranded–double-stranded junction DNA, with a strong specificity towards looped duplex DNA that contains at least six unpaired bases per loop (‘bubble DNA’). Using chemical shift perturbation experiments, the DNA-binding surface is mapped to the first hairpin region encompassing the conserved glycine–valine–glycine residues followed by lysine–arginine–arginine, a positively charged surface patch and the second hairpin region consisting of glycine–isoleucine–serine. A model for the protein– DNA complex is proposed that accounts for this specificity

    Crystal structure of NAD(+)-dependent DNA ligase: modular architecture and functional implications

    No full text
    DNA ligases catalyze the crucial step of joining the breaks in duplex DNA during DNA replication, repair and recombination, utilizing either ATP or NAD(+) as a cofactor. Despite the difference in cofactor specificity and limited overall sequence similarity, the two classes of DNA ligase share basically the same catalytic mechanism. In this study, the crystal structure of an NAD(+)-dependent DNA ligase from Thermus filiformis, a 667 residue multidomain protein, has been determined by the multiwavelength anomalous diffraction (MAD) method. It reveals highly modular architecture and a unique circular arrangement of its four distinct domains. It also provides clues for protein flexibility and DNA-binding sites. A model for the multidomain ligase action involving large conformational changes is proposed

    A comparative analysis of KMT2D missense variants in Kabuki syndrome, cancers and the general population

    No full text
    Determining the clinical significance of germline and somatic KMT2D missense variants (MVs) in Kabuki syndrome (KS) and cancers can be challenging. We analysed 1920 distinct KMT2D MVs that included 1535 germline MVs in controls (Control-MVs), 584 somatic MVs in cancers (Cancer-MVs) and 201 MV in individuals with KS (KS-MVs). The proportion of MVs likely to affect splicing was significantly higher for Cancer-MVs and KS-MVs than in Control-MVs (p = 0.000018). Our analysis identified significant clustering of Cancer-MVs and KS-MVs in the PHD#3 and #4, RING#4 and SET domains. Areas of enrichment restricted to just Cancer-MVs (FYR-C and between amino acids 3043–3248) or KS-MVs (coiled-coil#5, FYR-N and between amino acids 4995–5090) were also found. Cancer-MVs and KS-MVs tended to affect more conserved residues (lower BLOSUM scores, p < 0.001 and p = 0.007). KS-MVs are more likely to increase the energy for protein folding (higher ELASPIC ∆∆G scores, p = 0.03). Cancer-MVs are more likely to disrupt protein interactions (higher StructMAn scores, p = 0.019). We reclassify several presumed pathogenic MVs as benign or as variants of uncertain significance. We raise the possibility of as yet unrecognised ‘non-KS’ phenotype(s) associated with some germline pathogenic KMT2D MVs. Overall, this work provides insights into the disease mechanism of KMT2D variants and can be extended to other genes, mutations in which also cause developmental syndromes and cancer
    corecore