151 research outputs found
Genome Interpretation Using In Silico Predictors of Variant Impact
Estimating the effects of variants found in disease driver genes opens the door to personalized therapeutic opportunities. Clinical associations and laboratory experiments can only characterize a tiny fraction of all the available variants, leaving the majority as variants of unknown significance (VUS). In silico methods bridge this gap by providing instant estimates on a large scale, most often based on the numerous genetic differences between species. Despite concerns that these methods may lack reliability in individual subjects, their numerous practical applications over cohorts suggest they are already helpful and have a role to play in genome interpretation when used at the proper scale and context. In this review, we aim to gain insights into the training and validation of these variant effect predicting methods and illustrate representative types of experimental and clinical applications. Objective performance assessments using various datasets that are not yet published indicate the strengths and limitations of each method. These show that cautious use of in silico variant impact predictors is essential for addressing genome interpretation challenges
Coevolutionary Signals in Metabotropic Glutamate Receptors Capture Residue Contacts and Long-Range Functional Interactions
Upon ligand binding to a G protein-coupled receptor, extracellular signals are transmitted into a cell through sets of residue interactions that translate ligand binding into structural rearrangements. These interactions needed for functions impose evolutionary constraints so that, on occasion, mutations in one position may be compensated by other mutations at functionally coupled positions. To quantify the impact of amino acid substitutions in the context of major evolutionary divergence in the G protein-coupled receptor subfamily of metabotropic glutamate receptors (mGluRs), we combined two phylogenetic-based algorithms, Evolutionary Trace and covariation Evolutionary Trace, to infer potential structure-function couplings and roles in mGluRs. We found a subset of evolutionarily important residues at known functional sites and evidence of coupling among distinct structural clusters in mGluR. In addition, experimental mutagenesis and functional assays confirmed that some highly covariant residues are coupled, revealing their synergy. Collectively, these findings inform a critical step toward understanding the molecular and structural basis of amino acid variation patterns within mGluRs and provide insight for drug development, protein engineering, and analysis of naturally occurring variants
A Method to Delineate De Novo Missense Variants Across Pathways Prioritizes Genes Linked to Autism
Genotype-phenotype relationships shape health and population fitness but remain difficult to predict and interpret. Here, we apply an evolutionary action method in mutational landscapes to unravel genes and pathways connected to autism spectrum disorder (ASD). Evolutionary action predicts the impact of missense variants on protein function by measuring motions in fitness landscapes, based on phylogenetic distances and substitution odds in homologous sequences. By examining 368 pathways across 2,384 individuals with ASD (probands), we found that 23 pathways, a total of 398 genes, had de novo missense variants biased to higher evolutionary action scores than expected by random chance, including axonogenesis, synaptic transmission, and neurodevelopmental pathways. The predicted fitness impact of de novo and inherited missense variants in candidate genes correlated with the IQ of individuals with ASD, even for using only the new gene candidates. This approach demonstrates how the evolutionary action method can be applied in biology to integrate missense variants over a cohort to identify genes contributing a shared phenotype. Using this approach, we have detected those missense variants most likely to contribute to ASD pathogenesis and have elucidated their phenotypic impact
CovET: A Covariation-Evolutionary Trace Method That Identifies Protein Structure-Function Modules
Measuring the relative effect that any two sequence positions have on each other may improve protein design or help better interpret coding variants. Current approaches use statistics and machine learning but rarely consider phylogenetic divergences which, as shown by Evolutionary Trace studies, provide insight into the functional impact of sequence perturbations. Here, we reframe covariation analyses in the Evolutionary Trace framework to measure the relative tolerance to perturbation of each residue pair during evolution. This approach (CovET) systematically accounts for phylogenetic divergences: at each divergence event, we penalize covariation patterns that belie evolutionary coupling. We find that while CovET approximates the performance of existing methods to predict individual structural contacts, it performs significantly better at finding structural clusters of coupled residues and ligand binding sites. For example, CovET found more functionally critical residues when we examined the RNA recognition motif and WW domains. It correlates better with large-scale epistasis screen data. In the dopamine D2 receptor, top CovET residue pairs recovered accurately the allosteric activation pathway characterized for Class A G protein-coupled receptors. These data suggest that CovET ranks highest the sequence position pairs that play critical functional roles through epistatic and allosteric interactions in evolutionarily relevant structure-function motifs. CovET complements current methods and may shed light on fundamental molecular mechanisms of protein structure and function
Recurrent High-Impact Mutations at Cognate Structural Positions in Class a G Protein-Coupled Receptors Expressed in Tumors
G protein-coupled receptors (GPCRs) are the largest family of human proteins. They have a common structure and, signaling through a much smaller set of G proteins, arrestins, and effectors, activate downstream pathways that often modulate hallmark mechanisms of cancer. Because there are many more GPCRs than effectors, mutations in different receptors could perturb signaling similarly so as to favor a tumor. We hypothesized that somatic mutations in tumor samples may not be enriched within a single gene but rather that cognate mutations with similar effects on GPCR function are distributed across many receptors. To test this possibility, we systematically aggregated somatic cancer mutations across class A GPCRs and found a nonrandom distribution of positions with variant amino acid residues. Individual cancer types were enriched for highly impactful, recurrent mutations at selected cognate positions of known functional motifs. We also discovered that no single receptor drives this pattern, but rather multiple receptors contain amino acid substitutions at a few cognate positions. Phenotypic characterization suggests these mutations induce perturbation of G protein activation and/or β-arrestin recruitment. These data suggest that recurrent impactful oncogenic mutations perturb different GPCRs to subvert signaling and promote tumor growth or survival. The possibility that multiple different GPCRs could moonlight as drivers or enablers of a given cancer through mutations located at cognate positions across GPCR paralogs opens a window into cancer mechanisms and potential approaches to therapeutics
Accounting for epistatic interactions improves the functional analysis of protein structures
Motivation: The constraints under which sequence, structure and function coevolve are not fully understood. Bringing this mutual relationship to light can reveal the molecular basis of binding, catalysis and allostery, thereby identifying function and rationally guiding protein redesign. Underlying these relationships are the epistatic interactions that occur when the consequences of a mutation to a protein are determined by the genetic background in which it occurs. Based on prior data, we hypothesize that epistatic forces operate most strongly between residues nearby in the structure, resulting in smooth evolutionary importance across the structure. Methods and Results: We find that when residue scores of evolutionary importance are distributed smoothly between nearby residues, functional site prediction accuracy improves. Accordingly, we designed a novel measure of evolutionary importance that focuses on the interaction between pairs of structurally neighboring residues. This measure that we term pair-interaction Evolutionary Trace yields greater functional site overlap and better structure-based proteome-wide functional predictions. Conclusions: Our data show that the structural smoothness of evolutionary importance is a fundamental feature of the coevolution of sequence, structure and function. Mutations operate on individual residues, but selective pressure depends in part on the extent to which a mutation perturbs interactions with neighboring residues. In practice, this principle led us to redefine the importance of a residue in terms of the importance of its epistatic interactions with neighbors, yielding better annotation of functional residues, motivating experimental validation of a novel functional site in LexA and refining protein function prediction. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online
Structure and evolutionary trace-assisted screening of a residue swapping the substrate ambiguity and chiral specificity in an esterase
11 pags., 6figs., 3 pags.Our understanding of enzymes with high substrate ambiguity remains limited because their large active sites allow substrate docking freedom to an extent that seems incompatible with stereospecificity. One possibility is that some of these enzymes evolved a set of evolutionarily fitted sequence positions that stringently allow switching substrate ambiguity and chiral specificity. To explore this hypothesis, we targeted for mutation a serine ester hydrolase (EH) that exhibits an impressive 71-substrate repertoire but is not stereospecific (e.e. 50%). We used structural actions and the computational evolutionary trace method to explore specificity-swapping sequence positions and hypothesized that position I244 was critical. Driven by evolutionary action analysis, this position was substituted to leucine, which together with isoleucine appears to be the amino acid most commonly present in the closest homologous sequences (max. identity, ca. 67.1%), and to phenylalanine, which appears in distant homologues. While the I244L mutation did not have any functional consequences, the I244F mutation allowed the esterase to maintain a remarkable 53-substrate range while gaining stereospecificity properties (e.e. 99.99%). These data support the possibility that some enzymes evolve sequence positions that control the substrate scope and stereospecificity. Such residues, which can be evolutionarily screened, may serve as starting points for further designing substrate-ambiguous, yet chiral-specific, enzymes that are greatly appreciated in biotechnology and synthetic chemistry.MF acknowledges the grant ‘INMARE’ from the EuropeanUnion’s Horizon 2020 (grant agreement no. 634486), the grantsPCIN-2017-078 (within the Marine Biotechnology ERA-NET) and BIO2017-85522-R from the Ministerio de EconomÃa, Industria y Competitividad, Agencia Estatal de Investigación (AEI), Fondo Eur-opeo de Desarrollo Regional (FEDER) and the European Union (EU),and the grant 2020AEP061 from the Agencia Estatal CSIC. J.S-A.acknowledges grant PID2019-105838RB-C33 from the Ministeriode Ciencia e Innovación, Agencia Estatal de Investigación (AEI),Fondo Europeo de Desarrollo Regional (FEDER) and the EuropeanUnion (EU). P.N.G. acknowledges the support of the Era-Net IB Pro-ject MetaCat funded through UK Biotechnology and BiologicalSciences Research Council (BBSRC), grant No. BB/M029085/1, andthe Centre for Environmental Biotechnology Project, co-fundedby European Regional Development Fund (ERDF) via the WelshGovernment (WEFO); R.B. acknowledges the Supercomputing Wales project, co-funded by ERDF via WEFO. OL and PK were sup-ported by the National Institutes of Health (NIH) grants 5R01AG061105, 5R01GM066099, and 5R01GM079656. C. CoscolÃnthanks the Ministerio de EconomÃa y Competitividad and FEDER fora PhD fellowship (Grant BES-2015-073829). Staff of the Synchrotron Radiation Source at Alba (Barcelona, Spain) for assistance at the BL13-XALOC beamlin
Evolutionary Action of Mutations Reveals Antimicrobial Resistance Genes in Escherichia coli
Since antibiotic development lags, we search for potential drug targets through directed evolution experiments. A challenge is that many resistance genes hide in a noisy mutational background as mutator clones emerge in the adaptive population. Here, to overcome this noise, we quantify the impact of mutations through evolutionary action (EA). After sequencing ciprofloxacin or colistin resistance strains grown under different mutational regimes, we find that an elevated sum of the evolutionary action of mutations in a gene identifies known resistance drivers. This EA integration approach also suggests new antibiotic resistance genes which are then shown to provide a fitness advantage in competition experiments. Moreover, EA integration analysis of clinical and environmental isolates of antibiotic resistant of E. coli identifies gene drivers of resistance where a standard approach fails. Together these results inform the genetic basis of de novo colistin resistance and support the robust discovery of phenotype-driving genes via the evolutionary action of genetic perturbations in fitness landscapes
Recommended from our members
Event detection using roles and relationships of entities
A method, system, and computer program product for event detection using roles and relationships of entities are provided in the illustrative embodiments. A training event and a set of entities participating in the training event are identified in a training data. For a first entity in the set of entities, a first role occupied by the entity in the event is determined. A behavior attribute is assigned to the first role. A relationship of the first role with a second role corresponding to a second entity in the set of entities is determined. An event rule is constructed to detect an event corresponding to the training event in new data and comprising a plurality of roles, behavior attributes, and the relationship. The plurality of roles includes the first role and the second role, and the plurality of behavior attributes includes the behavior attribute assigned to the first role.Board of Regents, University of Texas Syste
- …