211 research outputs found

    Identifying and Seeing beyond Multiple Sequence Alignment Errors Using Intra-Molecular Protein Covariation

    Get PDF
    BACKGROUND: There is currently no way to verify the quality of a multiple sequence alignment that is independent of the assumptions used to build it. Sequence alignments are typically evaluated by a number of established criteria: sequence conservation, the number of aligned residues, the frequency of gaps, and the probable correct gap placement. Covariation analysis is used to find putatively important residue pairs in a sequence alignment. Different alignments of the same protein family give different results demonstrating that covariation depends on the quality of the sequence alignment. We thus hypothesized that current criteria are insufficient to build alignments for use with covariation analyses. METHODOLOGY/PRINCIPAL FINDINGS: We show that current criteria are insufficient to build alignments for use with covariation analyses as systematic sequence alignment errors are present even in hand-curated structure-based alignment datasets like those from the Conserved Domain Database. We show that current non-parametric covariation statistics are sensitive to sequence misalignments and that this sensitivity can be used to identify systematic alignment errors. We demonstrate that removing alignment errors due to 1) improper structure alignment, 2) the presence of paralogous sequences, and 3) partial or otherwise erroneous sequences, improves contact prediction by covariation analysis. Finally we describe two non-parametric covariation statistics that are less sensitive to sequence alignment errors than those described previously in the literature. CONCLUSIONS/SIGNIFICANCE: Protein alignments with errors lead to false positive and false negative conclusions (incorrect assignment of covariation and conservation, respectively). Covariation analysis can provide a verification step, independent of traditional criteria, to identify systematic misalignments in protein alignments. Two non-parametric statistics are shown to be somewhat insensitive to misalignment errors, providing increased confidence in contact prediction when analyzing alignments with erroneous regions because of an emphasis on they emphasize pairwise covariation over group covariation

    Identification of Coevolving Residues and Coevolution Potentials Emphasizing Structure, Bond Formation and Catalytic Coordination in Protein Evolution

    Get PDF
    The structure and function of a protein is dependent on coordinated interactions between its residues. The selective pressures associated with a mutation at one site should therefore depend on the amino acid identity of interacting sites. Mutual information has previously been applied to multiple sequence alignments as a means of detecting coevolutionary interactions. Here, we introduce a refinement of the mutual information method that: 1) removes a significant, non-coevolutionary bias and 2) accounts for heteroscedasticity. Using a large, non-overlapping database of protein alignments, we demonstrate that predicted coevolving residue-pairs tend to lie in close physical proximity. We introduce coevolution potentials as a novel measure of the propensity for the 20 amino acids to pair amongst predicted coevolutionary interactions. Ionic, hydrogen, and disulfide bond-forming pairs exhibited the highest potentials. Finally, we demonstrate that pairs of catalytic residues have a significantly increased likelihood to be identified as coevolving. These correlations to distinct protein features verify the accuracy of our algorithm and are consistent with a model of coevolution in which selective pressures towards preserving residue interactions act to shape the mutational landscape of a protein by restricting the set of admissible neutral mutations

    Structural and Functional Roles of Coevolved Sites in Proteins

    Get PDF
    Understanding the residue covariations between multiple positions in protein families is very crucial and can be helpful for designing protein engineering experiments. These simultaneous changes or residue coevolution allow protein to maintain its overall structural-functional integrity while enabling it to acquire specific functional modifications. Despite the significant efforts in the field there is still controversy in terms of the preferable locations of coevolved residues on different regions of protein molecules, the strength of coevolutionary signal and role of coevolution in functional diversification.In this paper we study the scale and nature of residue coevolution in maintaining the overall functionality and structural integrity of proteins. We employed a large scale study to investigate the structural and functional aspects of coevolved residues. We found that the networks representing the coevolutionary residue connections within our dataset are in general of 'small-world' type as they have clustering coefficient values higher than random networks and also show smaller mean shortest path lengths similar and/or lower than random and regular networks. We also found that altogether 11% of functionally important sites are coevolved with any other sites. Active sites are found more frequently to coevolve with any other sites (15%) compared to protein (11%) and ligand (9%) binding sites. Metal binding and active sites are also found to be more frequently coevolved with other metal binding and active sites, respectively. Analysis of the coupling between coevolutionary processes and the spatial distribution of coevolved sites reveals that a high fraction of coevolved sites are located close to each other. Moreover, approximately 80% of charge compensatory substitutions within coevolved sites are found at very close spatial proximity (<or= 5A), pointing to the possible preservation of salt bridges in evolution.Our findings show that a noticeable fraction of functionally important sites undergo coevolution and also point towards compensatory substitutions as a probable coevolutionary mechanism within spatially proximal coevolved functional sites

    Prevalence of Epistasis in the Evolution of Influenza A Surface Proteins

    Get PDF
    The surface proteins of human influenza A viruses experience positive selection to escape both human immunity and, more recently, antiviral drug treatments. In bacteria and viruses, immune-escape and drug-resistant phenotypes often appear through a combination of several mutations that have epistatic effects on pathogen fitness. However, the extent and structure of epistasis in influenza viral proteins have not been systematically investigated. Here, we develop a novel statistical method to detect positive epistasis between pairs of sites in a protein, based on the observed temporal patterns of sequence evolution. The method rests on the simple idea that a substitution at one site should rapidly follow a substitution at another site if the sites are positively epistatic. We apply this method to the surface proteins hemagglutinin and neuraminidase of influenza A virus subtypes H3N2 and H1N1. Compared to a non-epistatic null distribution, we detect substantial amounts of epistasis and determine the identities of putatively epistatic pairs of sites. In particular, using sequence data alone, our method identifies epistatic interactions between specific sites in neuraminidase that have recently been demonstrated, in vitro, to confer resistance to the drug oseltamivir; these epistatic interactions are responsible for widespread drug resistance among H1N1 viruses circulating today. This experimental validation demonstrates the predictive power of our method to identify epistatic sites of importance for viral adaptation and public health. We conclude that epistasis plays a large role in shaping the molecular evolution of influenza viruses. In particular, sites with , which would normally not be identified as positively selected, can facilitate viral adaptation through epistatic interactions with their partner sites. The knowledge of specific interactions among sites in influenza proteins may help us to predict the course of antigenic evolution and, consequently, to select more appropriate vaccines and drugs

    Measurement of the W±Z boson pair-production cross section in pp collisions at √s=13TeV with the ATLAS detector

    Get PDF
    published_or_final_versio

    Search for High-Mass Resonances Decaying to τν in pp Collisions at √s=13 TeV with the ATLAS Detector

    Get PDF
    A search for high-mass resonances decaying to τν using proton-proton collisions at √s=13 TeV produced by the Large Hadron Collider is presented. Only τ-lepton decays with hadrons in the final state are considered. The data were recorded with the ATLAS detector and correspond to an integrated luminosity of 36.1 fb−1. No statistically significant excess above the standard model expectation is observed; model-independent upper limits are set on the visible τν production cross section. Heavy W′ bosons with masses less than 3.7 TeV in the sequential standard model and masses less than 2.2–3.8 TeV depending on the coupling in the nonuniversal G(221) model are excluded at the 95% credibility level

    Search for the direct production of charginos and neutralinos in final states with tau leptons in √s=13 TeV collisions with the ATLAS detector

    Get PDF
    A search for the direct production of charginos and neutralinos in final states with at least two hadronically decaying tau leptons is presented. The analysis uses a dataset of pp collisions corresponding to an integrated luminosity of 36.1 fb−1, recorded with the ATLAS detector at the Large Hadron Collider at a centre-of-mass energy of 13TeV.Nosignificant deviation from the expected Standard Model background is observed. Limits are derived in scenarios of ˜χ+1 ˜χ−1 pair production and of ˜χ±1 ˜χ02 and ˜χ+1 ˜χ−1 production in simplified models where the neutralinos and charginos decay solely via intermediate left-handed staus and tau sneutrinos, and the mass of the ˜ τL state is set to be halfway between the masses of the ˜χ±1 and the ˜χ01. Chargino masses up to 630 GeV are excluded at 95% confidence level in the scenario of direct production of ˜χ+1 ˜χ−1 for a massless ˜χ01. Common ˜χ±1 and ˜χ02 masses up to 760 GeV are excluded in the case of production of ˜χ±1 ˜χ02 and ˜χ+1 ˜χ−1 assuming a massless ˜χ01. Exclusion limits for additional benchmark scenarios with large and small mass-splitting between the ˜χ±1 and the ˜χ01 are also studied by varying the ˜ τL mass between the masses of the ˜χ±1 and the ˜χ01

    Combined measurement of differential and total cross sections in the H → γγ and the H → ZZ* → 4ℓ decay channels at s=13 TeV with the ATLAS detector

    Get PDF
    A combined measurement of differential and inclusive total cross sections of Higgs boson production is performed using 36.1 fb−1 of 13 TeV proton–proton collision data produced by the LHC and recorded by the ATLAS detector in 2015 and 2016. Cross sections are obtained from measured H→γγ and H→ZZ*(→4ℓ event yields, which are combined taking into account detector efficiencies, resolution, acceptances and branching fractions. The total Higgs boson production cross section is measured to be 57.0−5.9 +6.0 (stat.) −3.3 +4.0 (syst.) pb, in agreement with the Standard Model prediction. Differential cross-section measurements are presented for the Higgs boson transverse momentum distribution, Higgs boson rapidity, number of jets produced together with the Higgs boson, and the transverse momentum of the leading jet. The results from the two decay channels are found to be compatible, and their combination agrees with the Standard Model predictions

    Search for strong gravity in multijet final states produced in pp collisions at √s=13 TeV using the ATLAS detector at the LHC

    Get PDF
    A search is conducted for new physics in multijet final states using 3.6 inverse femtobarns of data from proton-proton collisions at √s = 13TeV taken at the CERN Large Hadron Collider with the ATLAS detector. Events are selected containing at least three jets with scalar sum of jet transverse momenta (HT) greater than 1TeV. No excess is seen at large HT and limits are presented on new physics: models which produce final states containing at least three jets and having cross sections larger than 1.6 fb with HT > 5.8 TeV are excluded. Limits are also given in terms of new physics models of strong gravity that hypothesize additional space-time dimensions

    Measurement of differential cross sections and W + /W − cross-section ratios for W boson production in association with jets at √s =8 TeV with the ATLAS detector

    Get PDF
    This paper presents a measurement of the W boson production cross section and the W + /W − cross-section ratio, both in association with jets, in proton--proton collisions at s √ =8 TeV with the ATLAS experiment at the Large Hadron Collider. The measurement is performed in final states containing one electron and missing transverse momentum using data corresponding to an integrated luminosity of 20.2 fb −1 . Differential cross sections for events with one or two jets are presented for a range of observables, including jet transverse momenta and rapidities, the scalar sum of transverse momenta of the visible particles and the missing transverse momentum in the event, and the transverse momentum of the W boson. For a subset of the observables, the differential cross sections of positively and negatively charged W bosons are measured separately. In the cross-section ratio of W + /W − the dominant systematic uncertainties cancel out, improving the measurement precision by up to a factor of nine. The observables and ratios selected for this paper provide valuable input for the up quark, down quark, and gluon parton distribution functions of the proto
    corecore