20 research outputs found

    Protein Sequence, Structure, and Dynamics Reveal Insights in the Divergence of Protein Functions

    Get PDF
    Proteins participate in every important aspect of known living. The amino acid sequence of which a protein is composed contains information about the physicochemical properties, the three-dimensional structure, and its function. However, connecting protein sequence to function is still an open challenge, particularly for protein families with complex inter-relationships, i.e., heteromeric interactions. Such is the case of the Epidermal Growth Factor (EGF) receptor system, comprising four or more paralogs of the EGF receptor interacting with seven or more paralogs of the peptide ligand.In this thesis, I use the evolutionary history of the EGF receptor system to show how phylogenetic patterns of evolution relate to functional divergence at the protein sequence level. By combining measures of residue conservation and residue co-evolution I developed a method to identify residues responsible of a specific protein function. Mutations on the residues highlighted by this method altered the auto-phosphorylation level of the EGF receptor and affected cellular growth. Next, I studied a fish-specific gene duplication of the EGF receptor and used it to describe and model a rare pattern of sequence evolution. I showed that this pattern could be related to functional divergence, thus providing a way to identify the occurrence of the event and the residues responsible of it. Ultimately, I analyzed whole protein families using protein similarity networks. My results showed how the networks made from structural similarity of predicted 3D-models give a better representation of the protein functions compared to sequence similarity networks, thus supporting a paradigm shift from sequence-based to predicted-structure-based bioinformatics software.Overall, my thesis shows a deep interconnection between functional divergence and protein sequence evolution that can be exploited for prediction of function or identification of evolutionary events. The conceptual foundations of this study could be used in other fields where gene duplication and functional residues play an important part, as for example protein engineering and the study of copy number variation in cancer biology.Okinawa Institute of Science and Technology Graduate Universit

    Bifidobacterium mellis sp. nov., isolated from the honey stomach of the honey bee Apis mellifera

    Get PDF
    : A novel Bifidobacterium strain, Bin7NT, was isolated from the honey stomach of the honey bee Apis mellifera. Cells are Gram-positive, non-motile, non-sporulating, facultative anaerobic and fructose 6-phosphate phosphoketolase-positive. Their optimal growth is at 37 °C in anaerobiosis in MRS (De Man, Rogosa and Sharpe) added with cysteine. The honey bee microbiota was composed of several phylotypes of Bifidobacterium and Lactobacillus. Comparative analysis of 16S rRNA gene sequence similarity revealed that strain Bin7NT grouped with Bifidobacterium species originating from honey bees and was closely related to Bifidobacterium asteroides DSM 20089T (99.67 % similarity). However, the highest average nucleotide identity and digital DNA-DNA hybridization values of 94.88 and 60.6 %, respectively, were obtained with Bifidobacterium choladohabitans JCM 34586T. The DNA G+C content of the type strain is 60.8 mol%. The cell-wall peptidoglycan is of the A4β l-Orn-d-Asp type. The main cellular fatty acids of strain Bin7NT are C18 : 1 ω9c, C16 : 0, C18 : 1 ω7c and C18 : 0. Phenotypic characterization and genotyping based on the genome sequences clearly show that this strain is distinct from the type strains of the so far recognized Bifidobacterium species. Thus, Bifidobacterium mellis sp. nov. (Bin7NT=DSM 29108T=CCUG 66113T) is proposed as novel Bifidobacterium species

    The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

    Get PDF
    Background The Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function. Results Here, we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes. Specifically, we performed experimental whole-genome mutation screening in Candida albicans and Pseudomonas aureginosa genomes, which provided us with genome-wide experimental data for genes associated with biofilm formation and motility. We further performed targeted assays on selected genes in Drosophila melanogaster, which we suspected of being involved in long-term memory. Conclusion We conclude that while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not. Term-centric prediction of experimental annotations remains equally challenging; although the performance of the top methods is significantly better than the expectations set by baseline methods in C. albicans and D. melanogaster, it leaves considerable room and need for improvement. Finally, we report that the CAFA community now involves a broad range of participants with expertise in bioinformatics, biological experimentation, biocuration, and bio-ontologies, working together to improve functional annotation, computational function prediction, and our ability to manage big data in the era of large experimental screens.Peer reviewe

    The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

    Get PDF
    BackgroundThe Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function.ResultsHere, we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes. Specifically, we performed experimental whole-genome mutation screening in Candida albicans and Pseudomonas aureginosa genomes, which provided us with genome-wide experimental data for genes associated with biofilm formation and motility. We further performed targeted assays on selected genes in Drosophila melanogaster, which we suspected of being involved in long-term memory.ConclusionWe conclude that while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not. Term-centric prediction of experimental annotations remains equally challenging; although the performance of the top methods is significantly better than the expectations set by baseline methods in C. albicans and D. melanogaster, it leaves considerable room and need for improvement. Finally, we report that the CAFA community now involves a broad range of participants with expertise in bioinformatics, biological experimentation, biocuration, and bio-ontologies, working together to improve functional annotation, computational function prediction, and our ability to manage big data in the era of large experimental screens.</p

    Inter-paralog amino acid inversion events in large phylogenies of duplicated proteins

    Get PDF
    Connecting protein sequence to function is becoming increasingly relevant since high-throughput sequencing studies accumulate large amounts of genomic data. In order to go beyond the existing database annotation, it is fundamental to understand the mechanisms underlying functional inheritance and divergence. If the homology relationship between proteins is known, can we determine whether the function diverged? In this work, we analyze different possibilities of protein sequence evolution after gene duplication and identify “inter-paralog inversions”, i.e., sites where the relationship between the ancestry and the functional signal is decoupled. The amino acids in these sites are masked from being recognized by other prediction tools. Still, they play a role in functional divergence and could indicate a shift in protein function. We develop a method to specifically recognize inter-paralog amino acid inversions in a phylogeny and test it on real and simulated datasets. In a dataset built from the Epidermal Growth Factor Receptor (EGFR) sequences found in 88 fish species, we identify 19 amino acid sites that went through inversion after gene duplication, mostly located at the ligand-binding extracellular domain. Our work uncovers an outcome of protein duplications with direct implications in protein functional annotation and sequence evolution. The developed method is optimized to work with large protein datasets and can be readily included in a targeted protein analysis pipeline

    Binding of single-mutant epidermal growth factor (EGF) ligands alters the stability of the EGF receptor dimer and promotes growth signaling

    Get PDF
    The epidermal growth factor receptor (EGFR) is a membrane-anchored tyrosine kinase that is able to selectively respond to multiple extracellular stimuli. Previous studies have indicated that the modularity of this system may be caused by ligand-induced differences in the stability of the receptor dimer. However, this hypothesis has not been explored using single-mutant ligands thus far. Herein, we developed a new approach to identify residues responsible for functional divergence by selecting residues in the epidermal growth factor (EGF) ligand that are conserved among orthologs yet divergent between paralogs. Then, we mutated these residues and assessed the mutants\u27 effects on the receptor using a combination of molecular dynamics (MD) and biochemical techniques. Although the EGF mutants had binding affinities for the EGFR comparable with the WT ligand, the EGF mutants showed differential patterns of receptor phosphorylation and cell growth in multiple cell lines. The MD simulations of the EGF mutants indicated that mutations had long-range effects on the receptor dimer interface. This study shows for the first time that a single mutation in the EGF is sufficient to alter the activation of the EGFR signaling pathway at the cellular level. These results also support that biased ligand-receptor signaling in the tyrosine kinase receptor system can lead to differential downstream outcomes and demonstrate a promising new method to study ligand-receptor interactions

    Can hybrid hyaluronic acid represent a valid approach to treat rizoarthrosis? A retrospective comparative study

    No full text
    6Osteoarthritis (OA) of the trapeziometacarpal joint (TMJ) is a disabling condition with a significant impact on quality of life. The optimal management of hand OA requires a combination of non-pharmacological and pharmacological treatments that include intra-articular (i.a.) therapy. EULAR experts recommend corticosteroid injections in TMJ OA and underline the usefulness of hyaluronic acid (HA). The aim of this study was the assessment of the efficacy and tolerability of i.a. injections of a hybrid formulation of HA (Sinovial H-L®) in comparison to triamcinolone in patients with TMJ OA.nonenoneTenti, Sara; Pascarelli, NICOLA ANTONIO; Giannotti, Stefano; Galeazzi, Mauro; Giordano, NICOLA GIUSEPPE; Fioravanti, AntonellaTenti, Sara; Pascarelli, Nicola Antonio; Giannotti, Stefano; Galeazzi, Mauro; Giordano, Nicola; Fioravanti, Antonell
    corecore