376 research outputs found

    The IMEx coronavirus interactome: an evolving map of Coronaviridae-host molecular interactions

    Get PDF
    The current coronavirus disease of 2019 (COVID-19) pandemic, caused by the severe acute respiratory syndrome coronavirus (SARS-CoV)-2, has spurred a wave of research of nearly unprecedented scale. Among the different strategies that are being used to understand the disease and develop effective treatments, the study of physical molecular interactions can provide fine-grained resolution of the mechanisms behind the virus biology and the human organism response. We present a curated dataset of physical molecular interactions focused on proteins from SARS-CoV-2, SARS-CoV-1 and other members of the Coronaviridae family that has been manually extracted by International Molecular Exchange (IMEx) Consortium curators. Currently, the dataset comprises over 4400 binarized interactions extracted from 151 publications. The dataset can be accessed in the standard formats recommended by the Proteomics Standards Initiative (HUPO-PSI) at the IntAct database website (https://www.ebi.ac.uk/intact) and will be continuously updated as research on COVID-19 progresses

    Improving the Gene Ontology Resource to Facilitate More Informative Analysis and Interpretation of Alzheimer's Disease Data

    Get PDF
    The analysis and interpretation of high-throughput datasets relies on access to high-quality bioinformatics resources, as well as processing pipelines and analysis tools. Gene Ontology (GO, geneontology.org) is a major resource for gene enrichment analysis. The aim of this project, funded by the Alzheimer's Research United Kingdom (ARUK) foundation and led by the University College London (UCL) biocuration team, was to enhance the GO resource by developing new neurological GO terms, and use GO terms to annotate gene products associated with dementia. Specifically, proteins and protein complexes relevant to processes involving amyloid-beta and tau have been annotated and the resulting annotations are denoted in GO databases as 'ARUK-UCL'. Biological knowledge presented in the scientific literature was captured through the association of GO terms with dementia-relevant protein records; GO itself was revised, and new GO terms were added. This literature biocuration increased the number of Alzheimer's-relevant gene products that were being associated with neurological GO terms, such as 'amyloid-beta clearance' or 'learning or memory', as well as neuronal structures and their compartments. Of the total 2055 annotations that we contributed for the prioritised gene products, 526 have associated proteins and complexes with neurological GO terms. To ensure that these descriptive annotations could be provided for Alzheimer's-relevant gene products, over 70 new GO terms were created. Here, we describe how the improvements in ontology development and biocuration resulting from this initiative can benefit the scientific community and enhance the interpretation of dementia data

    Encompassing new use cases - level 3.0 of the HUPO-PSI format for molecular interactions.

    Get PDF
    BACKGROUND: Systems biologists study interaction data to understand the behaviour of whole cell systems, and their environment, at a molecular level. In order to effectively achieve this goal, it is critical that researchers have high quality interaction datasets available to them, in a standard data format, and also a suite of tools with which to analyse such data and form experimentally testable hypotheses from them. The PSI-MI XML standard interchange format was initially published in 2004, and expanded in 2007 to enable the download and interchange of molecular interaction data. PSI-XML2.5 was designed to describe experimental data and to date has fulfilled this basic requirement. However, new use cases have arisen that the format cannot properly accommodate. These include data abstracted from more than one publication such as allosteric/cooperative interactions and protein complexes, dynamic interactions and the need to link kinetic and affinity data to specific mutational changes. RESULTS: The Molecular Interaction workgroup of the HUPO-PSI has extended the existing, well-used XML interchange format for molecular interaction data to meet new use cases and enable the capture of new data types, following extensive community consultation. PSI-MI XML3.0 expands the capabilities of the format beyond simple experimental data, with a concomitant update of the tool suite which serves this format. The format has been implemented by key data producers such as the International Molecular Exchange (IMEx) Consortium of protein interaction databases and the Complex Portal. CONCLUSIONS: PSI-MI XML3.0 has been developed by the data producers, data users, tool developers and database providers who constitute the PSI-MI workgroup. This group now actively supports PSI-MI XML2.5 as the main interchange format for experimental data, PSI-MI XML3.0 which additionally handles more complex data types, and the simpler, tab-delimited MITAB2.5, 2.6 and 2.7 for rapid parsing and download

    Sample richness and genetic diversity as drivers of chimera formation in nSSU metagenetic analyses

    Get PDF
    Eukaryotic diversity in environmental samples is often assessed via PCR-based amplification of nSSU genes. However, estimates of diversity derived from pyrosequencing environmental data sets are often inflated, mainly because of the formation of chimeric sequences during PCR amplification. Chimeras are hybrid products composed of distinct parental sequences that can lead to the misinterpretation of diversity estimates. We have analyzed the effect of sample richness, evenness and phylogenetic diversity on the formation of chimeras using a nSSU data set derived from 454 Roche pyrosequencing of replicated, large control pools of closely and distantly related nematode mock communities, of known intragenomic identity and richness. To further investigate how chimeric molecules are formed, the nSSU gene secondary structure was analyzed in several individuals. For the first time in eukaryotes, chimera formation proved to be higher in both richer and more genetically diverse samples, thus providing a novel perspective of chimera formation in pyrosequenced environmental data sets. Findings contribute to a better understanding of the nature and mechanisms involved in chimera formation during PCR amplification of environmentally derived DNA. Moreover, given the similarities between biodiversity analyses using amplicon sequencing and those used to assess genomic variation, our findings have potential broad application for identifying genetic variation in homologous loci or multigene families in general

    Capturing variation impact on molecular interactions in the IMEx Consortium mutations data set

    Get PDF
    The current wealth of genomic variation data identified at nucleotide level presents the challenge of understanding by which mechanisms amino acid variation affects cellular processes. These effects may manifest as distinct phenotypic differences between individuals or result in the development of disease. Physical interactions between molecules are the linking steps underlying most, if not all, cellular processes. Understanding the effects that sequence variation has on a molecule's interactions is a key step towards connecting mechanistic characterization of nonsynonymous variation to phenotype. We present an open access resource created over 14 years by IMEx database curators, featuring 28,000 annotations describing the effect of small sequence changes on physical protein interactions. We describe how this resource was built, the formats in which the data is provided and offer a descriptive analysis of the data set. The data set is publicly available through the IntAct website and is enhanced with every monthly release

    Incidence trend and risk factors for campylobacter infections in humans in Norway

    Get PDF
    BACKGROUND: The objectives of the study were to evaluate whether the increase in incidence of campylobacteriosis observed in humans in Norway from 1995 to 2001 was statistically significant and whether different biologically plausible risk factors were associated with the incidence of campylobacteriosis in the different counties in Norway. METHODS: To model the incidence of domestically acquired campylobacteriosis from 1995 to 2001, a population average random effect poisson model was applied (the trend model). To case data and assumed risk-factor/protective data such as sale of chicken, receiving treated drinking water, density of dogs and grazing animals, occupation of people in the municipalities and climatic factors from 2000 and 2001, an equivalent model accounting for geographical clustering was applied (the ecological model). RESULTS: The increase in incidence of campylobacteriosis in humans in Norway from 1995 to 2001 was statistically significant from 1998. Treated water was a protective factor against Campylobacter infections in humans with an IRR of 0.78 per percentage increase in people supplied. The two-level modelling technique showed no evidence of clustering of campylobacteriosis in any particular county. Aggregation of data on municipality level makes interpretation of the results at the individual level difficult. CONCLUSION: The increase in incidence of Campylobacter infections in humans from 1995 to 2001 was statistically significant from 1998. Treated water was a protective factor against Campylobacter infections in humans with an IRR of 0.78 per percentage increase in people supplied. Campylobacter infections did not appear to be clustered in any particular county in Norway

    Allosteric Modulation of the HIV-1 gp120-gp41 Association Site by Adjacent gp120 Variable Region 1 (V1) N-Glycans Linked to Neutralization Sensitivity

    Get PDF
    The HIV-1 gp120-gp41 complex, which mediates viral fusion and cellular entry, undergoes rapid evolution within its external glycan shield to enable escape from neutralizing antibody (NAb). Understanding how conserved protein determinants retain functionality in the context of such evolution is important for their evaluation and exploitation as potential drug and/ or vaccine targets. In this study, we examined how the conserved gp120-gp41 association site, formed by the N- and Cterminal segments of gp120 and the disulfide-bonded region (DSR) of gp41, adapts to glycan changes that are linked to neutralization sensitivity. To this end, a DSR mutant virus (K601D) with defective gp120-association was sequentially passaged in peripheral blood mononuclear cells to select suppressor mutations. We reasoned that the locations of suppressors point to structural elements that are functionally linked to the gp120-gp41 association site. In culture 1, gp120 association and viral replication was restored by loss of the conserved glycan at Asn136 in V1 (T138N mutation) inconjunction with the L494I substitution in C5 within the association site. In culture 2, replication was restored with deletion of the N139INN sequence, which ablates the overlapping Asn141-Asn142-Ser-Ser potential N-linked glycosylation sequons inV1, in conjunction with D601N in the DSR. The 136 and 142 glycan mutations appeared to exert their suppressive effects by altering the dependence of gp120-gp41 interactions on the DSR residues, Leu593, Trp596 and Lys601. The 136 and/or 142glycan mutations increased the sensitivity of HIV-1 pseudovirions to the glycan-dependent NAbs 2G12 and PG16, and also pooled IgG obtained from HIV-1-infected individuals. Thus adjacent V1 glycans allosterically modulate the distal gp120-gp41 association site. We propose that this represents a mechanism for functional adaptation of the gp120-gp41 association site to an evolving glycan shield in a setting of NAb selection

    Spatial and Temporal Dynamics of Hepatitis B Virus D Genotype in Europe and the Mediterranean Basin

    Get PDF
    Hepatitis B virus genotype D can be found in many parts of the world and is the most prevalent strain in south-eastern Europe, the Mediterranean Basin, the Middle East, and the Indian sub-continent. The epidemiological history of the D genotype and its subgenotypes is still obscure because of the scarcity of appropriate studies. We retrieved from public databases a total of 312 gene P sequences of HBV genotype D isolated in various countries throughout the world, and reconstructed the spatio-temporal evolutionary dynamics of the HBV-D epidemic using a Bayesian framework
    corecore