4,060 research outputs found

    Spherical polar Fourier assembly of protein complexes with arbitrary point group symmetry

    Get PDF
    International audienceA novel fast Fourier transform-based ab inito docking algorithm called SAM is presented, for building perfectly symmetrical models of protein complexes with arbitrary point group symmetry. The basic approach uses a novel and very fast one-dimensional symmetry-constrained spherical polar Fourier search to assemble cyclic Cn systems from a given protein monomer. Structures with higher-order (Dn, T, O and I) point group symmetries may be built using a subsequent symmetry-constrained Fourier domain search to assemble trimeric sub-units. The results reported here show that the SAM algorithm can correctly assemble monomers of up to around 500 residues to produce a near-native complex structure with the given point group symmetry in 17 out of 18 test cases. The SAM program may be downloaded for academic use at http://sam.loria.fr/

    HexServer: an FFT-based protein docking server powered by graphics processors

    Get PDF
    HexServer (http://hexserver.loria.fr/) is the first Fourier transform (FFT)-based protein docking server to be powered by graphics processors. Using two graphics processors simultaneously, a typical 6D docking run takes ∼15 s, which is up to two orders of magnitude faster than conventional FFT-based docking approaches using comparable resolution and scoring functions. The server requires two protein structures in PDB format to be uploaded, and it produces a ranked list of up to 1000 docking predictions. Knowledge of one or both protein binding sites may be used to focus and shorten the calculation when such information is available. The first 20 predictions may be accessed individually, and a single file of all predicted orientations may be downloaded as a compressed multi-model PDB file. The server is publicly available and does not require any registration or identification by the user

    Unraveling the molecular architecture of a G protein-coupled receptor/β-arrestin/Erk module complex

    Get PDF
    International audienceβ-arrestins serve as signaling scaffolds downstream of G protein-coupled receptors, and thus play a crucial role in a plethora of cellular processes. Although it is largely accepted that the ability of β-arrestins to interact simultaneously with many protein partners is key in G protein-independent signaling of GPCRs, only the precise knowledge of these multimeric arrangements will allow a full understanding of the dynamics of these interactions and their functional consequences. However, current experimental procedures for the determination of the three-dimensional structures of protein-protein complexes are not well adapted to analyze these short-lived, multi-component assemblies. We propose a model of the receptor/β-arrestin/Erk1 signaling module, which is consistent with most of the available experimental data. Moreover, for the β-arrestin/Raf1 and the β-arrestin/ERK interactions, we have used the model to design interfering peptides and shown that they compete with both partners, hereby demonstrating the validity of the predicted interaction regions

    Functional Annotation of Proteins using Domain Embedding based Sequence Classification

    Get PDF
    International audienceDue to the recent advancement in genomic sequencing technologies, the number of protein sequences in public databases is growing exponentially. The UniProt Knowledgebase (UniProtKB) is currently the largest and most comprehensive resource for protein sequence and annotation data. The May 2019 release of the Uniprot Knowledge base (UniprotKB) contains around 158 million protein sequences. For the complete exploitation of this huge knowledge base, protein sequences need to be annotated with functional properties such as Enzyme Commission (EC) numbers and Gene Ontology terms. However, there is only about half a million sequences (UniprotKB/SwissProt) are reviewed and functionally annotated by expert curators using information extracted from the published literature and computational analyses. The manual annotation by experts are expensive, slow and insufficient to fill the gap between the annotated and unannotated protein sequences. In this paper, we present an automatic functional annotation technique using neural network based based word embedding exploiting domain and family information of proteins. Domains are the most conserved regions in protein sequences and constitute the building blocks of 3D protein structures. To do the experiment, we used fastText a , a library for learning of word embeddings and text classification developed by Facebook's AI Research lab. The experimental results show that domain embeddings perform much better than k-mer based word embeddings. a https://github.com/facebookresearch/fasttex

    Using Content-Based Filtering to Infer Direct Associations between the CATH, Pfam, and SCOP Domain Databases

    Get PDF
    International audienceProtein domain structure classification systems such as CATH and SCOP provide a useful way todescribe evolutionary structure-function relationships. Similarly, the Pfam sequence-basedclassification identifies sequence-function relationships. Nonetheless, there is no completedirect mapping from one classification to another. This means that functional annotations thathave been assigned to one classification cannot always be assigned to another. Here, wepresent a novel content-based filtering approach called CAPS (Computing direct Associationsbetween annotations of Protein Sequences and Structures) to systematically analyze multipleprotein-domain relationships in the SIFTS and UniProt databases in order to infer directmappings between CATH superfamilies, Pfam clans or families, and SCOP superfamilies. Wethen compare the result with existing mappings in Pfam, InterPro, and Genome3D

    ECDomainMiner: discovering hidden associations between enzyme commission numbers and Pfam domains

    Get PDF
    International audienceBackgroundMany entries in the protein data bank (PDB) are annotated to show their component protein domains according to the Pfam classification, as well as their biological function through the enzyme commission (EC) numbering scheme. However, despite the fact that the biological activity of many proteins often arises from specific domain-domain and domain-ligand interactions, current on-line resources rarely provide a direct mapping from structure to function at the domain level. Since the PDB now contains many tens of thousands of protein chains, and since protein sequence databases can dwarf such numbers by orders of magnitude, there is a pressing need to develop automatic structure-function annotation tools which can operate at the domain level
    corecore