48 research outputs found
Identifying and Seeing beyond Multiple Sequence Alignment Errors Using Intra-Molecular Protein Covariation
BACKGROUND: There is currently no way to verify the quality of a multiple sequence alignment that is independent of the assumptions used to build it. Sequence alignments are typically evaluated by a number of established criteria: sequence conservation, the number of aligned residues, the frequency of gaps, and the probable correct gap placement. Covariation analysis is used to find putatively important residue pairs in a sequence alignment. Different alignments of the same protein family give different results demonstrating that covariation depends on the quality of the sequence alignment. We thus hypothesized that current criteria are insufficient to build alignments for use with covariation analyses.
METHODOLOGY/PRINCIPAL FINDINGS: We show that current criteria are insufficient to build alignments for use with covariation analyses as systematic sequence alignment errors are present even in hand-curated structure-based alignment datasets like those from the Conserved Domain Database. We show that current non-parametric covariation statistics are sensitive to sequence misalignments and that this sensitivity can be used to identify systematic alignment errors. We demonstrate that removing alignment errors due to 1) improper structure alignment, 2) the presence of paralogous sequences, and 3) partial or otherwise erroneous sequences, improves contact prediction by covariation analysis. Finally we describe two non-parametric covariation statistics that are less sensitive to sequence alignment errors than those described previously in the literature.
CONCLUSIONS/SIGNIFICANCE: Protein alignments with errors lead to false positive and false negative conclusions (incorrect assignment of covariation and conservation, respectively). Covariation analysis can provide a verification step, independent of traditional criteria, to identify systematic misalignments in protein alignments. Two non-parametric statistics are shown to be somewhat insensitive to misalignment errors, providing increased confidence in contact prediction when analyzing alignments with erroneous regions because of an emphasis on they emphasize pairwise covariation over group covariation
A Coevolutionary Residue Network at the Site of a Functionally Important Conformational Change in a Phosphohexomutase Enzyme Family
Coevolution analyses identify residues that co-vary with each other during evolution, revealing sequence relationships unobservable from traditional multiple sequence alignments. Here we describe a coevolutionary analysis of phosphomannomutase/phosphoglucomutase (PMM/PGM), a widespread and diverse enzyme family involved in carbohydrate biosynthesis. Mutual information and graph theory were utilized to identify a network of highly connected residues with high significance. An examination of the most tightly connected regions of the coevolutionary network reveals that most of the involved residues are localized near an interdomain interface of this enzyme, known to be the site of a functionally important conformational change. The roles of four interface residues found in this network were examined via site-directed mutagenesis and kinetic characterization. For three of these residues, mutation to alanine reduces enzyme specificity to ∼10% or less of wild-type, while the other has ∼45% activity of wild-type enzyme. An additional mutant of an interface residue that is not densely connected in the coevolutionary network was also characterized, and shows no change in activity relative to wild-type enzyme. The results of these studies are interpreted in the context of structural and functional data on PMM/PGM. Together, they demonstrate that a network of coevolving residues links the highly conserved active site with the interdomain conformational change necessary for the multi-step catalytic reaction. This work adds to our understanding of the functional roles of coevolving residue networks, and has implications for the definition of catalytically important residues
Integrated Analysis of Residue Coevolution and Protein Structure in ABC Transporters
Intraprotein side chain contacts can couple the evolutionary process of amino acid substitution at one position to that at another. This coupling, known as residue coevolution, may vary in strength. Conserved contacts thus not only define 3-dimensional protein structure, but also indicate which residue-residue interactions are crucial to a protein’s function. Therefore, prediction of strongly coevolving residue-pairs helps clarify molecular mechanisms underlying function. Previously, various coevolution detectors have been employed separately to predict these pairs purely from multiple sequence alignments, while disregarding available structural information. This study introduces an integrative framework that improves the accuracy of such predictions, relative to previous approaches, by combining multiple coevolution detectors and incorporating structural contact information. This framework is applied to the ABC-B and ABC-C transporter families, which include the drug exporter P-glycoprotein involved in multidrug resistance of cancer cells, as well as the CFTR chloride channel linked to cystic fibrosis disease. The predicted coevolving pairs are further analyzed based on conformational changes inferred from outward- and inward-facing transporter structures. The analysis suggests that some pairs coevolved to directly regulate conformational changes of the alternating-access transport mechanism, while others to stabilize rigid-body-like components of the protein structure. Moreover, some identified pairs correspond to residues previously implicated in cystic fibrosis
The Role of Tourism and Recreation in the Spread of Non-Native Species: A Systematic Review and Meta-Analysis
Managing the pathways by which non-native species are introduced and spread is considered the most effective way of preventing species invasions. Tourism and outdoor recreation involve the frequent congregation of people, vehicles and vessels from geographically diverse areas. They are therefore perceived to be major pathways for the movement of non-native species, and ones that will become increasingly important with the continued growth of these sectors. However, a global assessment of the relationship between tourism activities and the introduction of non-native species–particularly in freshwater and marine environments–is lacking. We conducted a systematic review and meta-analysis to determine the impact of tourism and outdoor recreation on non-native species in terrestrial, marine and freshwater environments. Our results provide quantitative evidence that the abundance and richness of non-native species are significantly higher in sites where tourist activities take place than in control sites. The pattern was consistent across terrestrial, freshwater and marine environments; across a variety of vectors (e.g. horses, hikers, yachts); and across a range of taxonomic groups. These results highlight the need for widespread biosecurity interventions to prevent the inadvertent introduction of invasive non-native species (INNS) as the tourism and outdoor recreation sectors grow
Global, regional, and national incidence, prevalence, and years lived with disability for 310 diseases and injuries, 1990-2015:a systematic analysis for the Global Burden of Disease Study 2015
Background Non-fatal outcomes of disease and injury increasingly detract from the ability of the world's population to live in full health, a trend largely attributable to an epidemiological transition in many countries from causes affecting children, to non-communicable diseases (NCDs) more common in adults. For the Global Burden of Diseases, Injuries, and Risk Factors Study 2015 (GBD 2015), we estimated the incidence, prevalence, and years lived with disability for diseases and injuries at the global, regional, and national scale over the period of 1990 to 2015.Methods We estimated incidence and prevalence by age, sex, cause, year, and geography with a wide range of updated and standardised analytical procedures. Improvements from GBD 2013 included the addition of new data sources, updates to literature reviews for 85 causes, and the identification and inclusion of additional studies published up to November, 2015, to expand the database used for estimation of non-fatal outcomes to 60 900 unique data sources. Prevalence and incidence by cause and sequelae were determined with DisMod-MR 2.1, an improved version of the DisMod-MR Bayesian meta-regression tool first developed for GBD 2010 and GBD 2013. For some causes, we used alternative modelling strategies where the complexity of the disease was not suited to DisMod-MR 2.1 or where incidence and prevalence needed to be determined from other data. For GBD 2015 we created a summary indicator that combines measures of income per capita, educational attainment, and fertility (the Socio-demographic Index [SDI]) and used it to compare observed patterns of health loss to the expected pattern for countries or locations with similar SDI scores.Findings We generated 9.3 billion estimates from the various combinations of prevalence, incidence, and YLDs for causes, sequelae, and impairments by age, sex, geography, and year. In 2015, two causes had acute incidences in excess of 1 billion: upper respiratory infections (17.2 billion, 95% uncertainty interval [UI] 15.4-19.2 billion) and diarrhoeal diseases (2.39 billion, 2.30-2.50 billion). Eight causes of chronic disease and injury each affected more than 10% of the world's population in 2015: permanent caries, tension-type headache, iron-deficiency anaemia, age-related and other hearing loss, migraine, genital herpes, refraction and accommodation disorders, and ascariasis. The impairment that affected the greatest number of people in 2015 was anaemia, with 2.36 billion (2.35-2.37 billion) individuals affected. The second and third leading impairments by number of individuals affected were hearing loss and vision loss, respectively. Between 2005 and 2015, there was little change in the leading causes of years lived with disability (YLDs) on a global basis. NCDs accounted for 18 of the leading 20 causes of age-standardised YLDs on a global scale. Where rates were decreasing, the rate of decrease for YLDs was slower than that of years of life lost (YLLs) for nearly every cause included in our analysis. For low SDI geographies, Group 1 causes typically accounted for 20-30% of total disability, largely attributable to nutritional deficiencies, malaria, neglected tropical diseases, HIV/AIDS, and tuberculosis. Lower back and neck pain was the leading global cause of disability in 2015 in most countries. The leading cause was sense organ disorders in 22 countries in Asia and Africa and one in central Latin America; diabetes in four countries in Oceania; HIV/AIDS in three southern sub-Saharan African countries; collective violence and legal intervention in two north African and Middle Eastern countries; iron-deficiency anaemia in Somalia and Venezuela; depression in Uganda; onchoceriasis in Liberia; and other neglected tropical diseases in the Democratic Republic of the Congo.Interpretation Ageing of the world's population is increasing the number of people living with sequelae of diseases and injuries. Shifts in the epidemiological profile driven by socioeconomic change also contribute to the continued increase in years lived with disability (YLDs) as well as the rate of increase in YLDs. Despite limitations imposed by gaps in data availability and the variable quality of the data available, the standardised and comprehensive approach of the GBD study provides opportunities to examine broad trends, compare those trends between countries or subnational geographies, benchmark against locations at similar stages of development, and gauge the strength or weakness of the estimates available. Copyright (C) The Author(s). Published by Elsevier Ltd.</p
Five insights from the Global Burden of Disease Study 2019
The Global Burden of Diseases, Injuries, and Risk Factors Study (GBD) 2019 provides a rules-based synthesis of the available evidence on levels and trends in health outcomes, a diverse set of risk factors, and health system responses. GBD 2019 covered 204 countries and territories, as well as first administrative level disaggregations for 22 countries, from 1990 to 2019. Because GBD is highly standardised and comprehensive, spanning both fatal and non-fatal outcomes, and uses a mutually exclusive and collectively exhaustive list of hierarchical disease and injury causes, the study provides a powerful basis for detailed and broad insights on global health trends and emerging challenges. GBD 2019 incorporates data from 281 586 sources and provides more than 3.5 billion estimates of health outcome and health system measures of interest for global, national, and subnational policy dialogue. All GBD estimates are publicly available and adhere to the Guidelines on Accurate and Transparent Health Estimate Reporting. From this vast amount of information, five key insights that are important for health, social, and economic development strategies have been distilled. These insights are subject to the many limitations outlined in each of the component GBD capstone papers.Peer reviewe