3 research outputs found

    Predictive Models and Impact of Interfacial Contacts and Amino Acids on Protein–Protein Binding Affinity

    No full text
    Protein–protein interactions (PPIs) play a central role in nearly all cellular processes. The strength of the binding in a PPI is characterized by the binding affinity (BA) and is a key factor in controlling protein–protein complex formation and defining the structure–function relationship. Despite advancements in understanding protein–protein binding, much remains unknown about the interfacial region and its association with BA. New models are needed to predict BA with improved accuracy for therapeutic design. Here, we use machine learning approaches to examine how well different types of interfacial contacts can be used to predict experimentally determined BA and to reveal the impact of the specific amino acids at the binding interface on BA. We create a series of multivariate linear regression models incorporating different contact features at both residue and atomic levels and examine how different methods of identifying and characterizing these properties impact the performance of these models. Particularly, we introduce a new and simple approach to predict BA based on the quantities of specific amino acids at the protein–protein interface. We found that the numbers of specific amino acids at the protein–protein interface were correlated with BA. We show that the interfacial numbers of amino acids can be used to produce models with consistently good performance across different data sets, indicating the importance of the identities of interfacial amino acids in underlying BA. When trained on a diverse set of complexes from two benchmark data sets, the best performing BA model was generated with an explicit linear equation involving six amino acids. Tyrosine, in particular, was identified as the key amino acid in controlling BA, as it had the strongest correlation with BA and was consistently identified as the most important amino acid in feature importance studies. Glycine and serine were identified as the next two most important amino acids in predicting BA. The results from this study further our understanding of PPIs and can be used to make improved predictions of BA, giving them implications for drug design and screening in the pharmaceutical industry

    Predictive Models and Impact of Interfacial Contacts and Amino Acids on Protein–Protein Binding Affinity

    No full text
    Protein–protein interactions (PPIs) play a central role in nearly all cellular processes. The strength of the binding in a PPI is characterized by the binding affinity (BA) and is a key factor in controlling protein–protein complex formation and defining the structure–function relationship. Despite advancements in understanding protein–protein binding, much remains unknown about the interfacial region and its association with BA. New models are needed to predict BA with improved accuracy for therapeutic design. Here, we use machine learning approaches to examine how well different types of interfacial contacts can be used to predict experimentally determined BA and to reveal the impact of the specific amino acids at the binding interface on BA. We create a series of multivariate linear regression models incorporating different contact features at both residue and atomic levels and examine how different methods of identifying and characterizing these properties impact the performance of these models. Particularly, we introduce a new and simple approach to predict BA based on the quantities of specific amino acids at the protein–protein interface. We found that the numbers of specific amino acids at the protein–protein interface were correlated with BA. We show that the interfacial numbers of amino acids can be used to produce models with consistently good performance across different data sets, indicating the importance of the identities of interfacial amino acids in underlying BA. When trained on a diverse set of complexes from two benchmark data sets, the best performing BA model was generated with an explicit linear equation involving six amino acids. Tyrosine, in particular, was identified as the key amino acid in controlling BA, as it had the strongest correlation with BA and was consistently identified as the most important amino acid in feature importance studies. Glycine and serine were identified as the next two most important amino acids in predicting BA. The results from this study further our understanding of PPIs and can be used to make improved predictions of BA, giving them implications for drug design and screening in the pharmaceutical industry

    Dual Imaging Single Vesicle Surface Protein Profiling and Early Cancer Detection

    No full text
    Single vesicle molecular profiling has the potential to transform cancer detection and monitoring by precisely probing cancer-associated extracellular vesicles (EVs) in the presence of normal EVs in body fluids, but it is challenging due to the small EV size, low abundance of antigens on individual vesicles, and a complex biological matrix. Here, we report a facile dual imaging single vesicle technology (DISVT) for surface protein profiling of individual EVs and quantification of target-specific EV subtypes based on direct molecular capture of EVs from diluted biofluids, dual EV-protein fluorescence-light scattering imaging, and fast image analysis using Bash scripts, Python, and ImageJ. Plasmonic gold nanoparticles (AuNPs) were used to label and detect targeted surface protein markers on individual EVs with dark-field light scattering imaging at the single particle level. Monte Carlo calculations estimated that the AuNPs could detect EVs down to 40 nm in diameter. Using the DISVT, we profiled surface protein markers of interest across individual EVs derived from several breast cancer cell lines, which reflected the parental cells. Studies with plasma EVs from healthy donors and breast cancer patients revealed that the DISVT, but not the traditional bulk enzyme-linked immunosorbent assay, detected human epidermal growth factor receptor 2 (HER2)-positive breast cancer at an early stage. The DISVT also precisely differentiated HER2-positive breast cancer from HER2-negative breast cancer. We additionally showed that the amount of tumor-associated EVs was tripled in locally advanced patients compared to that in early-stage patients. These studies suggest that single EV surface protein profiling with DISVT can provide a facile and high-sensitivity method for early cancer detection and quantitative monitoring
    corecore