159 research outputs found

    Reconciliation Revisited: Handling Multiple Optima when Reconciling with Duplication, Transfer, and Loss

    Get PDF
    Phylogenetic tree reconciliation is a powerful approach for inferring evolutionary events like gene duplication, horizontal gene transfer, and gene loss, which are fundamental to our understanding of molecular evolution. While duplication–loss (DL) reconciliation leads to a unique maximum-parsimony solution, duplication-transfer-loss (DTL) reconciliation yields a multitude of optimal solutions, making it difficult to infer the true evolutionary history of the gene family. This problem is further exacerbated by the fact that different event cost assignments yield different sets of optimal reconciliations. Here, we present an effective, efficient, and scalable method for dealing with these fundamental problems in DTL reconciliation. Our approach works by sampling the space of optimal reconciliations uniformly at random and aggregating the results. We show that even gene trees with only a few dozen genes often have millions of optimal reconciliations and present an algorithm to efficiently sample the space of optimal reconciliations uniformly at random in O(mn[superscript 2]) time per sample, where m and n denote the number of genes and species, respectively. We use these samples to understand how different optimal reconciliations vary in their node mappings and event assignments and to investigate the impact of varying event costs. We apply our method to a biological dataset of approximately 4700 gene trees from 100 taxa and observe that 93% of event assignments and 73% of mappings remain consistent across different multiple optima. Our analysis represents the first systematic investigation of the space of optimal DTL reconciliations and has many important implications for the study of gene family evolution.National Science Foundation (U.S.) (CAREER Award 0644282)National Institutes of Health (U.S.) (Grant RC2 HG005639)National Science Foundation (U.S.). Assembling the Tree of Life (Program) (Grant 0936234

    The evolution and appearance of c3 duplications in fish originate an exclusive teleost c3 gene form with anti- inflammatory activity

    Get PDF
    12 páginas, 6 figuras, 3 tablas.-- This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are creditedThe complement system acts as a first line of defense and promotes organism homeostasis by modulating the fates of diverse physiological processes. Multiple copies of component genes have been previously identified in fish, suggesting a key role for this system in aquatic organisms. Herein, we confirm the presence of three different previously reported complement c3 genes (c3.1, c3.2, c3.3) and identify five additional c3 genes (c3.4, c3.5, c3.6, c3.7, c3.8) in the zebrafish genome. Additionally, we evaluate the mRNA expression levels of the different c3 genes during ontogeny and in different tissues under steady-state and inflammatory conditions. Furthermore, while reconciling the phylogenetic tree with the fish species tree, we uncovered an event of c3 duplication common to all teleost fishes that gave rise to an exclusive c3 paralog (c3.7 and c3.8). These paralogs showed a distinct ability to regulate neutrophil migration in response to injury compared with the other c3 genes and may play a role in maintaining the balance between inflammatory and homeostatic processes in zebrafishThis work has been funded by the project CSD2007-00002 “Aquagenomics” from the Spanish Ministerio de Ciencia e Innovación, the ITN 289209 “FISHFORPHARMA” (EU) and project 201230E057 from the Agencia Estatal Consejo Superior de Investigaciones Científicas (CSIC).Peer reviewe

    Testing the Ortholog Conjecture with Comparative Functional Genomic Data from Mammals

    Get PDF
    A common assumption in comparative genomics is that orthologous genes share greater functional similarity than do paralogous genes (the “ortholog conjecture”). Many methods used to computationally predict protein function are based on this assumption, even though it is largely untested. Here we present the first large-scale test of the ortholog conjecture using comparative functional genomic data from human and mouse. We use the experimentally derived functions of more than 8,900 genes, as well as an independent microarray dataset, to directly assess our ability to predict function using both orthologs and paralogs. Both datasets show that paralogs are often a much better predictor of function than are orthologs, even at lower sequence identities. Among paralogs, those found within the same species are consistently more functionally similar than those found in a different species. We also find that paralogous pairs residing on the same chromosome are more functionally similar than those on different chromosomes, perhaps due to higher levels of interlocus gene conversion between these pairs. In addition to offering implications for the computational prediction of protein function, our results shed light on the relationship between sequence divergence and functional divergence. We conclude that the most important factor in the evolution of function is not amino acid sequence, but rather the cellular context in which proteins act

    Identification of a novel proinsulin-associated SNP and demonstration that proinsulin is unlikely to be a causal factor in subclinical vascular remodelling using Mendelian randomisation

    Get PDF
    Background and aims Increased proinsulin relative to insulin levels have been associated with subclinical atherosclerosis (measured by carotid intima-media thickness (cIMT)) and are predictive of future cardiovascular disease (CVD), independently of established risk factors. The mechanisms linking proinsulin to atherosclerosis and CVD are unclear. A genome-wide meta-analysis has identified nine loci associated with circulating proinsulin levels. Using proinsulin-associated SNPs, we set out to use a Mendelian randomisation approach to test the hypothesis that proinsulin plays a causal role in subclinical vascular remodelling. Methods We studied the high CVD-risk IMPROVE cohort (n = 3345), which has detailed biochemical phenotyping and repeated, state-of-the-art, high-resolution carotid ultrasound examinations. Genotyping was performed using Illumina Cardio-Metabo and Immuno arrays, which include reported proinsulin-associated loci. Participants with type 2 diabetes (n = 904) were omitted from the analysis. Linear regression was used to identify proinsulin-associated genetic variants. Results We identified a proinsulin locus on chromosome 15 (rs8029765) and replicated it in data from 20,003 additional individuals. An 11-SNP score, including the previously identified and the chromosome 15 proinsulin-associated loci, was significantly and negatively associated with baseline IMTmean and IMTmax (the primary cIMT phenotypes) but not with progression measures. However, MR-Eggers refuted any significant effect of the proinsulin-associated 11-SNP score, and a non-pleiotropic SNP score of three variants (including rs8029765) demonstrated no effect on baseline or progression cIMT measures. Conclusions We identified a novel proinsulin-associated locus and demonstrated that whilst proinsulin levels are associated with cIMT measures, proinsulin per se is unlikely to have a causative effect on cIMT

    Genome-Wide Association Study of Circulating Interleukin 6 Levels Identifies Novel Loci

    Get PDF
    Interleukin-6 (IL-6) is a multifunctional cytokine with both pro- and anti-inflammatory properties with a heritability estimate of up to 61%. The circulating levels of IL-6 in blood have been associated with an increased risk of complex disease pathogenesis. We conducted a two-staged, discovery, and replication meta genome-wide association study (GWAS) of circulating serum IL-6 levels comprising up to 67 428 (n{discovery} = 52 654 and n_{replication} = 14 774) individuals of European ancestry. The inverse variance fixed-effects based discovery meta-analysis, followed by replication led to the identification of two independent loci, IL1F10/IL1RN rs6734238 on Chromosome (Chr) 2q14, (pcombined = 1.8 × 10^{−11}), HLA-DRB1/DRB5 rs660895 on Chr6p21 (p_{combined} = 1.5 × 10^{−10}) in the combined meta-analyses of all samples. We also replicated the IL6R rs4537545 locus on Chr1q21 (p_{combined} = 1.2 × 10^{−122}). Our study identifies novel loci for circulating IL-6 levels uncovering new immunological and inflammatory pathways that may influence IL-6 pathobiology

    Genetic fine mapping and genomic annotation defines causal mechanisms at type 2 diabetes susceptibility loci.

    Get PDF
    We performed fine mapping of 39 established type 2 diabetes (T2D) loci in 27,206 cases and 57,574 controls of European ancestry. We identified 49 distinct association signals at these loci, including five mapping in or near KCNQ1. 'Credible sets' of the variants most likely to drive each distinct signal mapped predominantly to noncoding sequence, implying that association with T2D is mediated through gene regulation. Credible set variants were enriched for overlap with FOXA2 chromatin immunoprecipitation binding sites in human islet and liver cells, including at MTNR1B, where fine mapping implicated rs10830963 as driving T2D association. We confirmed that the T2D risk allele for this SNP increases FOXA2-bound enhancer activity in islet- and liver-derived cells. We observed allele-specific differences in NEUROD1 binding in islet-derived cells, consistent with evidence that the T2D risk allele increases islet MTNR1B expression. Our study demonstrates how integration of genetic and genomic information can define molecular mechanisms through which variants underlying association signals exert their effects on disease

    HMG-coenzyme A reductase inhibition, type 2 diabetes, and bodyweight: evidence from genetic analysis and randomised trials.

    Get PDF
    BACKGROUND: Statins increase the risk of new-onset type 2 diabetes mellitus. We aimed to assess whether this increase in risk is a consequence of inhibition of 3-hydroxy-3-methylglutaryl-CoA reductase (HMGCR), the intended drug target. METHODS: We used single nucleotide polymorphisms in the HMGCR gene, rs17238484 (for the main analysis) and rs12916 (for a subsidiary analysis) as proxies for HMGCR inhibition by statins. We examined associations of these variants with plasma lipid, glucose, and insulin concentrations; bodyweight; waist circumference; and prevalent and incident type 2 diabetes. Study-specific effect estimates per copy of each LDL-lowering allele were pooled by meta-analysis. These findings were compared with a meta-analysis of new-onset type 2 diabetes and bodyweight change data from randomised trials of statin drugs. The effects of statins in each randomised trial were assessed using meta-analysis. FINDINGS: Data were available for up to 223 463 individuals from 43 genetic studies. Each additional rs17238484-G allele was associated with a mean 0·06 mmol/L (95% CI 0·05-0·07) lower LDL cholesterol and higher body weight (0·30 kg, 0·18-0·43), waist circumference (0·32 cm, 0·16-0·47), plasma insulin concentration (1·62%, 0·53-2·72), and plasma glucose concentration (0·23%, 0·02-0·44). The rs12916 SNP had similar effects on LDL cholesterol, bodyweight, and waist circumference. The rs17238484-G allele seemed to be associated with higher risk of type 2 diabetes (odds ratio [OR] per allele 1·02, 95% CI 1·00-1·05); the rs12916-T allele association was consistent (1·06, 1·03-1·09). In 129 170 individuals in randomised trials, statins lowered LDL cholesterol by 0·92 mmol/L (95% CI 0·18-1·67) at 1-year of follow-up, increased bodyweight by 0·24 kg (95% CI 0·10-0·38 in all trials; 0·33 kg, 95% CI 0·24-0·42 in placebo or standard care controlled trials and -0·15 kg, 95% CI -0·39 to 0·08 in intensive-dose vs moderate-dose trials) at a mean of 4·2 years (range 1·9-6·7) of follow-up, and increased the odds of new-onset type 2 diabetes (OR 1·12, 95% CI 1·06-1·18 in all trials; 1·11, 95% CI 1·03-1·20 in placebo or standard care controlled trials and 1·12, 95% CI 1·04-1·22 in intensive-dose vs moderate dose trials). INTERPRETATION: The increased risk of type 2 diabetes noted with statins is at least partially explained by HMGCR inhibition. FUNDING: The funding sources are cited at the end of the paper

    Формирование эмоциональной культуры как компонента инновационной культуры студентов

    Get PDF
    Homozygosity has long been associated with rare, often devastating, Mendelian disorders1 and Darwin was one of the first to recognise that inbreeding reduces evolutionary fitness2. However, the effect of the more distant parental relatedness common in modern human populations is less well understood. Genomic data now allow us to investigate the effects of homozygosity on traits of public health importance by observing contiguous homozygous segments (runs of homozygosity, ROH), which are inferred to be homozygous along their complete length. Given the low levels of genome-wide homozygosity prevalent in most human populations, information is required on very large numbers of people to provide sufficient power3,4. Here we use ROH to study 16 health-related quantitative traits in 354,224 individuals from 102 cohorts and find statistically significant associations between summed runs of homozygosity (SROH) and four complex traits: height, forced expiratory lung volume in 1 second (FEV1), general cognitive ability (g) and educational attainment (nominal p<1 × 10−300, 2.1 × 10−6, 2.5 × 10−10, 1.8 × 10−10). In each case increased homozygosity was associated with decreased trait value, equivalent to the offspring of first cousins being 1.2 cm shorter and having 10 months less education. Similar effect sizes were found across four continental groups and populations with different degrees of genome-wide homozygosity, providing convincing evidence for the first time that homozygosity, rather than confounding, directly contributes to phenotypic variance. Contrary to earlier reports in substantially smaller samples5,6, no evidence was seen of an influence of genome-wide homozygosity on blood pressure and low density lipoprotein (LDL) cholesterol, or ten other cardio-metabolic traits. Since directional dominance is predicted for traits under directional evolutionary selection7, this study provides evidence that increased stature and cognitive function have been positively selected in human evolution, whereas many important risk factors for late-onset complex diseases may not have been

    Impact of common genetic determinants of Hemoglobin A1c on type 2 diabetes risk and diagnosis in ancestrally diverse populations : A transethnic genome-wide meta-analysis

    Get PDF
    Background Glycated hemoglobin (HbA1c) is used to diagnose type 2 diabetes (T2D) and assess glycemic control in patients with diabetes. Previous genome-wide association studies (GWAS) have identified 18 HbA1c-associated genetic variants. These variants proved to be classifiable by their likely biological action as erythrocytic (also associated with erythrocyte traits) or glycemic (associated with other glucose-related traits). In this study, we tested the hypotheses that, in a very large scale GWAS, we would identify more genetic variants associated with HbA1c and that HbA1c variants implicated in erythrocytic biology would affect the diagnostic accuracy of HbA1c. We therefore expanded the number of HbA1c-associated loci and tested the effect of genetic risk-scores comprised of erythrocytic or glycemic variants on incident diabetes prediction and on prevalent diabetes screening performance. Throughout this multiancestry study, we kept a focus on interancestry differences in HbA1c genetics performance that might influence race-ancestry differences in health outcomes. Methods & findings Using genome-wide association meta-analyses in up to 159,940 individuals from 82 cohorts of European, African, East Asian, and South Asian ancestry, we identified 60 common genetic variants associated with HbA1c. We classified variants as implicated in glycemic, erythrocytic, or unclassified biology and tested whether additive genetic scores of erythrocytic variants (GS-E) or glycemic variants (GS-G) were associated with higher T2D incidence in multiethnic longitudinal cohorts (N = 33,241). Nineteen glycemic and 22 erythrocytic variants were associated with HbA1c at genome-wide significance. GS-G was associated with higher T2D risk (incidence OR = 1.05, 95% CI 1.04-1.06, per HbA1c-raising allele, p = 3 x 10-29); whereas GS-E was not (OR = 1.00, 95% CI 0.99-1.01, p = 0.60). In Europeans and Asians, erythrocytic variants in aggregate had only modest effects on the diagnostic accuracy of HbA1c. Yet, in African Americans, the X-linked G6PD G202A variant (T-allele frequency 11%) was associated with an absolute decrease in HbA1c of 0.81%-units (95% CI 0.66-0.96) per allele in hemizygous men, and 0.68%-units (95% CI 0.38-0.97) in homozygous women. The G6PD variant may cause approximately 2% (N = 0.65 million, 95% CI0.55-0.74) of African American adults with T2Dto remain undiagnosed when screened with HbA1c. Limitations include the smaller sample sizes for non-European ancestries and the inability to classify approximately one-third of the variants. Further studies in large multiethnic cohorts with HbA1c, glycemic, and erythrocytic traits are required to better determine the biological action of the unclassified variants. Conclusions As G6PD deficiency can be clinically silent until illness strikes, we recommend investigation of the possible benefits of screening for the G6PD genotype along with using HbA1c to diagnose T2D in populations of African ancestry or groups where G6PD deficiency is common. Screening with direct glucose measurements, or genetically-informed HbA1c diagnostic thresholds in people with G6PD deficiency, may be required to avoid missed or delayed diagnoses.Peer reviewe

    Genome-wide meta-analysis of 241,258 adults accounting for smoking behaviour identifies novel loci for obesity traits

    Get PDF
    Few genome-wide association studies (GWAS) account for environmental exposures, like smoking, potentially impacting the overall trait variance when investigating the genetic contribution to obesity-related traits. Here, we use GWAS data from 51,080 current smokers and 190,178 nonsmokers (87% European descent) to identify loci influencing BMI and central adiposity, measured as waist circumference and waist-to-hip ratio both adjusted for BMI. We identify 23 novel genetic loci, and 9 loci with convincing evidence of gene-smoking interaction (GxSMK) on obesity-related traits. We show consistent direction of effect for all identified loci and significance for 18 novel and for 5 interaction loci in an independent study sample. These loci highlight novel biological functions, including response to oxidative stress, addictive behaviour, and regulatory functions emphasizing the importance of accounting for environment in genetic analyses. Our results suggest that tobacco smoking may alter the genetic susceptibility to overall adiposity and body fat distribution.Peer reviewe
    corecore