30 research outputs found

    Protein signatures using electrostatic molecular surfaces in harmonic space

    Full text link
    We developed a novel method based on the Fourier analysis of protein molecular surfaces to speed up the analysis of the vast structural data generated in the post-genomic era. This method computes the power spectrum of surfaces of the molecular electrostatic potential, whose three-dimensional coordinates have been either experimentally or theoretically determined. Thus we achieve a reduction of the initial three-dimensional information on the molecular surface to the one-dimensional information on pairs of points at a fixed scale apart. Consequently, the similarity search in our method is computationally less demanding and significantly faster than shape comparison methods. As proof of principle, we applied our method to a training set of viral proteins that are involved in major diseases such as Hepatitis C, Dengue fever, Yellow fever, Bovine viral diarrhea and West Nile fever. The training set contains proteins of four different protein families, as well as a mammalian representative enzyme. We found that the power spectrum successfully assigns a unique signature to each protein included in our training set, thus providing a direct probe of functional similarity among proteins. The results agree with established biological data from conventional structural biochemistry analyses.Comment: 9 pages, 10 figures Published in PeerJ (2013), https://peerj.com/articles/185

    IMGT/mAb-KG: the knowledge graph for therapeutic monoclonal antibodies

    Get PDF
    IntroductionTherapeutic monoclonal antibodies (mAbs) have demonstrated promising outcomes in diverse clinical indications, including but not limited to graft rejection, cancer, and autoimmune diseases lately.Recognizing the crucial need for the scientific community to quickly and easily access dependable information on monoclonal antibodies (mAbs), IMGT®, the international ImMunoGeneTics information system®, provides a unique and invaluable resource: IMGT/mAb-DB, a comprehensive database of therapeutic mAbs, accessible via a user-friendly web interface. However, this approach restricts more sophisticated queries and segregates information from other databases.MethodsTo connect IMGT/mAb-DB with the rest of the IMGT databases, we created IMGT/mAb-KG, a knowledge graph for therapeutic monoclonal antibodies connected to IMGT structures and genomics databases. IMGT/mAb-KG is developed using the most effective methodologies and standards of semantic web and acquires data from IMGT/mAb-DB. Concerning interoperability, IMGT/mAb-KG reuses terms from biomedical resources and is connected to related resources.Results and discussionIn February 2024, IMGT/mAb-KG, encompassing a total of 139,629 triplets, provides access to 1,489 mAbs, approximately 500 targets, and over 500 clinical indications. It offers detailed insights into the mechanisms of action of mAbs, their construction, and their various products and associated studies. Linked to other resources such as Thera-SAbDab (Therapeutic Structural Antibody Database), PharmGKB (a comprehensive resource curating knowledge on the impact of genetic variation on drug response), PubMed, and HGNC (HUGO Gene Nomenclature Committee), IMGT/mAb-KG is an essential resource for mAb development. A user-friendly web interface facilitates the exploration and analyse of the content of IMGT/mAb-KG

    Inferred Allelic Variants of Immunoglobulin Receptor Genes: a system for their evaluation, documentation, and naming

    Get PDF
    Immunoglobulins or antibodies are the main effector molecules of the B-cell lineage and are encoded by hundreds of variable (V), diversity (D), and joining (J) germline genes, which recombine to generate enormous IG diversity. Recently, high-throughput adaptive immune receptor repertoire sequencing (AIRR-seq) of recombined V-(D)-J genes has offered unprecedented insights into the dynamics of IG repertoires in health and disease. Faithful biological interpretation of AIRR-seq studies depends upon the annotation of raw AIRR-seq data, using reference germline gene databases to identify the germline genes within each rearrangement. Existing reference databases are incomplete, as shown by recent AIRR-seq studies that have inferred the existence of many previously unreported polymorphisms. Completing the documentation of genetic variation in germline gene databases is therefore of crucial importance. Lymphocyte receptor genes and alleles are currently assigned by the Immunoglobulins, T cell Receptors and Major Histocompatibility Nomenclature Subcommittee of the International Union of Immunological Societies (IUIS) and managed in IMGT®, the international ImMunoGeneTics information system® (IMGT). In 2017, the IMGT Group reached agreement with a group of AIRR-seq researchers on the principles of a streamlined process for identifying and naming inferred allelic sequences, for their incorporation into IMGT®. These researchers represented the AIRR Community, a network of over 300 researchers whose objective is to promote all aspects of immunoglobulin and T-cell receptor repertoire studies, including the standardization of experimental and computational aspects of AIRR-seq data generation and analysis. The Inferred Allele Review Committee (IARC) was established by the AIRR Community to devise policies, criteria, and procedures to perform this function. Formalized evaluations of novel inferred sequences have now begun and submissions are invited via a new dedicated portal (https://ogrdb.airr-community.org). Here, we summarize recommendations developed by the IARC—focusing, to begin with, on human IGHV genes—with the goal of facilitating the acceptance of inferred allelic variants of germline IGHV genes. We believe that this initiative will improve the quality of AIRR-seq studies by facilitating the description of human IG germline gene variation, and that in time, it will expand to the documentation of TR and IG genes in many vertebrate species

    Functional implications of glycans and their curation:insights from the workshop held at the 16th Annual International Biocuration Conference in Padua, Italy

    Get PDF
    Dynamic changes in protein glycosylation impact human health and disease progression. However, current resources that capture disease and phenotype information focus primarily on the macromolecules within the central dogma of molecular biology (DNA, RNA, proteins). To gain a better understanding of organisms, there is a need to capture the functional impact of glycans and glycosylation on biological processes. A workshop titled "Functional impact of glycans and their curation" was held in conjunction with the 16th Annual International Biocuration Conference to discuss ongoing worldwide activities related to glycan function curation. This workshop brought together subject matter experts, tool developers, and biocurators from over 20 projects and bioinformatics resources. Participants discussed four key topics for each of their resources: (i) how they curate glycan function-related data from publications and other sources, (ii) what type of data they would like to acquire, (iii) what data they currently have, and (iv) what standards they use. Their answers contributed input that provided a comprehensive overview of state-of-the-art glycan function curation and annotations. This report summarizes the outcome of discussions, including potential solutions and areas where curators, data wranglers, and text mining experts can collaborate to address current gaps in glycan and glycosylation annotations, leveraging each other's work to improve their respective resources and encourage impactful data sharing among resources. Database URL: https://wiki.glygen.org/Glycan_Function_Workshop_2023

    A Genetic Approach to Spot Detection in Two-Dimensional Gel

    No full text
    (2D PAGE) is a proteomic technique that allows the analysis of large collections and complex mixtures of proteins. The 2D-PAGE gel images depict protein signals as spots of various intensities and sizes. In this paper, we present a novel approach to unsupervised protein spot detection in 2D-PAGE images based on a genetic algorithm. This approach involves three main steps: a) wavelet-based noise reduction, b) segmentation of the input images into regions around the local maxima of the image intensities, c) detection and model-based quantification of the spots within each region using a genetic algorithm. This algorithm searches within a multidimensional parameter space to determine, in parallel, the parameters of multiple diffusion models that optimally fit the characteristics of possible spots. The detection and quantification of the spots is achieved by superposition of diffusion functions modeling adjacent spots. Experiments with 16-bit 2D-PAGE images show that the proposed method is effective and results in low spurious spot detection rate

    Lipid Droplets Are a Physiological Nucleoporin Reservoir

    No full text
    Lipid Droplets (LD) are dynamic organelles that originate in the Endoplasmic Reticulum and mostly bud off toward the cytoplasm, where they store neutral lipids for energy and protection purposes. LD also have diverse proteins on their surface, many of which are necessary for the their correct homeostasis. However, these organelles also act as reservoirs of proteins that can be made available elsewhere in the cell. In this sense, they act as sinks that titrate key regulators of many cellular processes. Among the specialized factors that reside on cytoplasmic LD are proteins destined for functions in the nucleus, but little is known about them and their impact on nuclear processes. By screening for nuclear proteins in publicly available LD proteomes, we found that they contain a subset of nucleoporins from the Nuclear Pore Complex (NPC). Exploring this, we demonstrate that LD act as a physiological reservoir, for nucleoporins, that impacts the conformation of NPCs and hence their function in nucleo-cytoplasmic transport, chromatin configuration, and genome stability. Furthermore, our in silico modeling predicts a role for LD-released fatty acids in regulating the transit of nucleoporins from LD through the cytoplasm and to nuclear pores

    Correction: IMGT/HighV-QUEST Statistical Significance of IMGT Clonotype (AA) Diversity per Gene for Standardized Comparisons of Next Generation Sequencing Immunoprofiles of Immunoglobulins and T Cell Receptors.

    No full text
    The adaptive immune responses of humans and of other jawed vertebrate species (gnasthostomata) are characterized by the B and T cells and their specific antigen receptors, the immunoglobulins (IG) or antibodies and the T cell receptors (TR) (up to 2.1012 different IG and TR per individual). IMGT, the international ImMunoGeneTics information system (http://www.imgt.org), was created in 1989 by Marie-Paule Lefranc (Montpellier University and CNRS) to manage the huge and complex diversity of these antigen receptors. IMGT built on IMGT-ONTOLOGY concepts of identification (keywords), description (labels), classification (gene and allele nomenclature) and numerotation (IMGT unique numbering), is at the origin of immunoinformatics, a science at the interface between immunogenetics and bioinformatics. IMGT/HighV-QUEST, the first web portal, and so far the only one, for the next generation sequencing (NGS) analysis of IG and TR, is the paradigm for immune repertoire standardized outputs and immunoprofiles of the adaptive immune responses. It provides the identification of the variable (V), diversity (D) and joining (J) genes and alleles, analysis of the V-(D)-J junction and complementarity determining region 3 (CDR3) and the characterization of the 'IMGT clonotype (AA)' (AA for amino acid) diversity and expression. IMGT/HighV-QUEST compares outputs of different batches, up to one million nucleotide sequencesfor the statistical module. These high throughput IG and TR repertoire immunoprofiles are of prime importance in vaccination, cancer, infectious diseases, autoimmunity and lymphoproliferative disorders, however their comparative statistical analysis still remains a challenge. We present a standardized statistical procedure to analyze IMGT/HighV-QUEST outputs for the evaluation of the significance of the IMGT clonotype (AA) diversity differences in proportions, per gene of a given group, between NGS IG and TR repertoire immunoprofiles. The procedure is generic and suitable for evaluating significance of the IMGT clonotype (AA) diversity and expression per gene, and for any IG and TR immunoprofiles of any species

    An updated evolutionary study of the Notch family reveals a new ancient origin and novel invariable motifs as potential pharmacological targets

    No full text
    Notch family proteins play a key role in a variety of developmental processes by controlling cell fate decisions and operating in a great number of biological processes in several organ systems, such as hematopoiesis, somatogenesis, vasculogenesis, neurogenesis and homeostasis. The Notch signaling pathway is crucial for the majority of developmental programs and regulates multiple pathogenic processes. Notch family receptors’ activation has been largely related to its multiple effects in sustaining oncogenesis. The Notch signaling pathway constitutes an ancient and conserved mechanism for cell to cell communication. Much of what is known about Notch family proteins function comes from studies done in Caenorhabditis Elegans and Drosophila Melanogaster. Although, human Notch homologs had also been identified, the molecular mechanisms which modulate the Notch signaling pathway remained substantially unknown. In this study, an updated evolutionary analysis of the Notch family members among 603 different organisms of all kingdoms, from bacteria to humans, was performed in order to discover key regions that have been conserved throughout evolution and play a major role in the Notch signaling pathway. The major goal of this study is the presentation of a novel updated phylogenetic tree for the Notch family as a reliable phylogeny “map”, in order to correlate information of the closely related members and identify new possible pharmacological targets that can be used in pathogenic cases, including cancer
    corecore