8 research outputs found

    Annotation and curation of human genomic variations: an ELIXIR Implementation Study

    Get PDF
    Background: ELIXIR is an intergovernmental organization, primarily based around European countries, established to host life science resources, including databases, software tools, training material and cloud storage for the scientific community under a single infrastructure. Methods: In 2018, ELIXIR commissioned an international survey on the usage of databases and tools for annotating and curating human genomic variants with the aim of improving ELIXIR resources. The 27-question survey was made available on-line between September and December 2018 to rank the importance and explore the usage and limitations of a wide range of databases and tools for annotating and curating human genomic variants, including resources specific for next generation sequencing, research into mitochondria and protein structure. Results: Eighteen countries participated in the survey and a total of 92 questionnaires were collected and analysed. Most respondents (89%, n=82) were from academia or a research environment. 51% (n=47) of respondents gave answers on behalf of a small research group (10 people). The survey showed that the scientific community considers several resources supported by ELIXIR crucial or very important. Moreover, it showed that the work done by ELIXIR is greatly valued. In particular, most respondents acknowledged the importance of key features and benefits promoted by ELIXIR, such as the verified scientific quality and maintenance of ELIXIR-approved resources. Conclusions ELIXIR is a "one-stop-shop" that helps researchers identify the most suitable, robust and well-maintained bioinformatics resources for delivering their research tasks

    SwissGenVar: A platform for clinical grade interpretation of genetic variants to foster personalized health care in Switzerland

    Full text link
    Large-scale next-generation sequencing (NGS) germline testing is technically feasible today, but variant interpretation represents a major bottleneck in analysis workflows including the extensive variant prioritization, annotation, and time-consuming evidence curation. The scale of the interpretation problem is massive, and variants of uncertain significance (VUS) are a challenge to personalized medicine. This challenge is further compounded by the complexity and heterogeneity of standards used to describe genetic variants and associated phenotypes when searching for relevant information to inform clinical decision-making. For this purpose, all five Swiss academic Medical Genetics Institutions joined forces with the Swiss Institute of Bioinformatics (SIB) to create SwissGenVar as a user-friendly nationwide repository and sharing platform for genetic variant data generated during routine diagnostic procedures and research sequencing projects. Its objective is to provide a protected environment for expert evidence sharing about individual variants to harmonize and up-scale their significance interpretation at clinical grade following international standards. To corroborate the clinical assessment, the variant-related data are combined with consented high-quality clinical information. Broader visibility will be gained by interfacing with international databases, thus supporting global initiatives in personalized health care

    SwissGenVar: A Platform for Clinical-Grade Interpretation of Genetic Variants to Foster Personalized Healthcare in Switzerland

    Get PDF
    Large-scale next-generation sequencing (NGS) germline testing is technically feasible today, but variant interpretation represents a major bottleneck in analysis workflows. This includes extensive variant prioritization, annotation, and time-consuming evidence curation. The scale of the interpretation problem is massive, and variants of uncertain significance (VUSs) are a challenge to personalized medicine. This challenge is further compounded by the complexity and heterogeneity of the standards used to describe genetic variants and the associated phenotypes when searching for relevant information to support clinical decision making. To address this, all five Swiss academic institutions for Medical Genetics joined forces with the Swiss Institute of Bioinformatics (SIB) to create SwissGenVar as a user-friendly nationwide repository and sharing platform for genetic variant data generated during routine diagnostic procedures and research sequencing projects. Its aim is to provide a protected environment for expert evidence sharing about individual variants to harmonize and upscale their significance interpretation at the clinical grade according to international standards. To corroborate the clinical assessment, the variant-related data will be combined with consented high-quality clinical information. Broader visibility will be achieved by interfacing with international databases, thus supporting global initiatives in personalized healthcare

    Text-Mining Services of the Swiss Variant Interpretation Platform for Oncology

    No full text
    The Swiss Variant Interpretation Platform for Oncology is a centralized, joint and curated database for clinical somatic variants piloted by a board of Swiss healthcare institutions and operated by the SIB Swiss Institute of Bioinformatics. To support this effort, SIB Text Mining designed a set of text analytics services. This report focuses on three of those services. First, the automatic annotations of the literature with a set of terminologies have been performed, resulting in a large annotated version of MEDLINE and PMC. Second, a generator of variant synonyms for single nucleotide variants has been developed using publicly available data resources, as well as patterns of non-standard formats, often found in the literature. Third, a literature ranking service enables to retrieve a ranked set of MEDLINE abstracts given a variant and optionally a diagnosis. The annotation of MEDLINE and PMC resulted in a total of respectively 785,181,199 and 1,156,060,212 annotations, which means an average of 26 and 425 annotations per abstract and full-text article. The generator of variant synonyms enables to retrieve up to 42 synonyms for a variant. The literature ranking service reaches a precision (P10) of 63%, which means that almost two-thirds of the top-10 returned abstracts are judged relevant. Further services will be implemented to complete this set of services, such as a service to retrieve relevant clinical trials for a patient and a literature ranking service for full-text articles.ISSN:0926-9630ISSN:1879-836

    IMGT, the international ImMunoGeneTics database

    No full text
    IMGT, the international ImMunoGeneTics database (http://imgt.cnusc. fr:8104), is a high-quality integrated database specialising in Immunoglobulins (Ig), T cell Receptors (TcR) and Major Histocompatibility Complex (MHC) molecules of all vertebrate species, created in 1989 by Marie-Paule Lefranc, Université Montpellier II, CNRS, Montpellier, France ([email protected]). IMGT comprises three databases: LIGM-DB, a comprehensive database of Ig and TcR, MHC/HLA-DB, and PRIMER-DB (the last two in development); a tool, IMGT/DNAPLOT, developed for sequence analysis and alignments; and expertised data based on the IMGT scientific chart, the IMGT repertoire. By its high quality and its easy data distribution, IMGT has important implications in medical research (repertoire in autoimmune diseases, AIDS, leukemias, lymphomas), therapeutic approaches (antibody engineering), genome diversity and genome evolution studies. IMGT is freely available at http://imgt.cnusc. fr:810

    Viral Metagenomics in the Clinical Realm: Lessons Learned from a Swiss-Wide Ring Trial.

    Get PDF
    Shotgun metagenomics using next generation sequencing (NGS) is a promising technique to analyze both DNA and RNA microbial material from patient samples. Mostly used in a research setting, it is now increasingly being used in the clinical realm as well, notably to support diagnosis of viral infections, thereby calling for quality control and the implementation of ring trials (RT) to benchmark pipelines and ensure comparable results. The Swiss NGS clinical virology community therefore decided to conduct a RT in 2018, in order to benchmark current metagenomic workflows used at Swiss clinical virology laboratories, and thereby contribute to the definition of common best practices. The RT consisted of two parts (increments), in order to disentangle the variability arising from the experimental compared to the bioinformatics parts of the laboratory pipeline. In addition, the RT was also designed to assess the impact of databases compared to bioinformatics algorithms on the final results, by asking participants to perform the bioinformatics analysis with a common database, in addition to using their own in-house database. Five laboratories participated in the RT (seven pipelines were tested). We observed that the algorithms had a stronger impact on the overall performance than the choice of the reference database. Our results also suggest that differences in sample preparation can lead to significant differences in the performance, and that laboratories should aim for at least 5-10 Mio reads per sample and use depth of coverage in addition to other interpretation metrics such as the percent of coverage. Performance was generally lower when increasing the number of viruses per sample. The lessons learned from this pilot study will be useful for the development of larger-scale RTs to serve as regular quality control tests for laboratories performing NGS analyses of viruses in a clinical setting

    Viral metagenomics in the clinical realm: lessons learned from a Swiss-wide ring trial

    Get PDF
    Shotgun metagenomics using next generation sequencing (NGS) is a promising technique to analyze both DNA and RNA microbial material from patient samples. Mostly used in a research setting, it is now increasingly being used in the clinical realm as well, notably to support diagnosis of viral infections, thereby calling for quality control and the implementation of ring trials (RT) to benchmark pipelines and ensure comparable results. The Swiss NGS clinical virology community therefore decided to conduct a RT in 2018, in order to benchmark current metagenomic workflows used at Swiss clinical virology laboratories, and thereby contribute to the definition of common best practices. The RT consisted of two parts (increments), in order to disentangle the variability arising from the experimental compared to the bioinformatics parts of the laboratory pipeline. In addition, the RT was also designed to assess the impact of databases compared to bioinformatics algorithms on the final results, by asking participants to perform the bioinformatics analysis with a common database, in addition to using their own in-house database. Five laboratories participated in the RT (seven pipelines were tested). We observed that the algorithms had a stronger impact on the overall performance than the choice of the reference database. Our results also suggest that differences in sample preparation can lead to significant differences in the performance, and that laboratories should aim for at least 5-10 Mio reads per sample and use depth of coverage in addition to other interpretation metrics such as the percent of coverage. Performance was generally lower when increasing the number of viruses per sample. The lessons learned from this pilot study will be useful for the development of larger-scale RTs to serve as regular quality control tests for laboratories performing NGS analyses of viruses in a clinical setting

    NGS-Based S. aureus Typing and Outbreak Analysis in Clinical Microbiology Laboratories: Lessons Learned From a Swiss-Wide Proficiency Test

    Get PDF
    Whole genome sequencing (WGS) enables high resolution typing of bacteria up to the single nucleotide polymorphism (SNP) level. WGS is used in clinical microbiology laboratories for infection control, molecular surveillance and outbreak analyses. Given the large palette of WGS reagents and bioinformatics tools, the Swiss clinical bacteriology community decided to conduct a ring trial (RT) to foster harmonization of NGS-based bacterial typing. The RT aimed at assessing methicillin-susceptible Staphylococcus aureus strain relatedness from WGS and epidemiological data. The RT was designed to disentangle the variability arising from differences in sample preparation, SNP calling and phylogenetic methods. Nine laboratories participated. The resulting phylogenetic tree and cluster identification were highly reproducible across the laboratories. Cluster interpretation was, however, more laboratory dependent, suggesting that an increased sharing of expertise across laboratories would contribute to further harmonization of practices. More detailed bioinformatic analyses unveiled that while similar clusters were found across laboratories, these were actually based on different sets of SNPs, differentially retained after sample preparation and SNP calling procedures. Despite this, the observed number of SNP differences between pairs of strains, an important criterion to determine strain relatedness given epidemiological information, was similar across pipelines for closely related strains when restricting SNP calls to a common core genome defined by S. aureus cgMLST schema. The lessons learned from this pilot study will serve the implementation of larger-scale RT, as a mean to have regular external quality assessments for laboratories performing WGS analyses in a clinical setting
    corecore