190 research outputs found

    Validating Paired-End Read Alignments in Sequence Graphs

    Get PDF
    Graph based non-linear reference structures such as variation graphs and colored de Bruijn graphs enable incorporation of full genomic diversity within a population. However, transitioning from a simple string-based reference to graphs requires addressing many computational challenges, one of which concerns accurately mapping sequencing read sets to graphs. Paired-end Illumina sequencing is a commonly used sequencing platform in genomics, where the paired-end distance constraints allow disambiguation of repeats. Many recent works have explored provably good index-based and alignment-based strategies for mapping individual reads to graphs. However, validating distance constraints efficiently over graphs is not trivial, and existing sequence to graph mappers rely on heuristics. We introduce a mathematical formulation of the problem, and provide a new algorithm to solve it exactly. We take advantage of the high sparsity of reference graphs, and use sparse matrix-matrix multiplications (SpGEMM) to build an index which can be queried efficiently by a mapping algorithm for validating the distance constraints. Effectiveness of the algorithm is demonstrated using real reference graphs, including a human MHC variation graph, and a pan-genome de-Bruijn graph built using genomes of 20 B. anthracis strains. While the one-time indexing time can vary from a few minutes to a few hours using our algorithm, answering a million distance queries takes less than a second

    The role of microbiome-host interactions in the development of Alzheimer´s disease

    Get PDF
    Alzheimer`s disease (AD) is the most prevalent cause of dementia. It is often assumed that AD is caused by an aggregation of extracellular beta-amyloid and intracellular tau-protein, supported by a recent study showing reduced brain amyloid levels and reduced cognitive decline under treatment with a beta-amyloid-binding antibody. Confirmation of the importance of amyloid as a therapeutic target notwithstanding, the underlying causes of beta-amyloid aggregation in the human brain, however, remain to be elucidated. Multiple lines of evidence point towards an important role of infectious agents and/or inflammatory conditions in the etiology of AD. Various microorganisms have been detected in the cerebrospinal fluid and brains of AD-patients and have thus been hypothesized to be linked to the development of AD, including Porphyromonas gingivalis (PG) and Spirochaetes. Intriguingly, these microorganisms are also found in the oral cavity under normal physiological conditions, which is often affected by multiple pathologies like caries or tooth loss in AD patients. Oral cavity pathologies are mostly accompanied by a compositional shift in the community of oral microbiota, mainly affecting commensal microorganisms and referred to as ‘dysbiosis’. Oral dysbiosis seems to be at least partly mediated by key pathogens such as PG, and it is associated with a pro-inflammatory state that promotes the destruction of connective tissue in the mouth, possibly enabling the translocation of pathogenic microbiota from the oral cavity to the nervous system. It has therefore been hypothesized that dysbiosis of the oral microbiome may contribute to the development of AD. In this review, we discuss the infectious hypothesis of AD in the light of the oral microbiome and microbiome-host interactions, which may contribute to or even cause the development of AD. We discuss technical challenges relating to the detection of microorganisms in relevant body fluids and approaches for avoiding false-positives, and introduce the antibacterial protein lactoferrin as a potential link between the dysbiotic microbiome and the host inflammatory reaction

    Imputation of KIR Types from SNP Variation Data.

    Get PDF
    Large population studies of immune system genes are essential for characterizing their role in diseases, including autoimmune conditions. Of key interest are a group of genes encoding the killer cell immunoglobulin-like receptors (KIRs), which have known and hypothesized roles in autoimmune diseases, resistance to viruses, reproductive conditions, and cancer. These genes are highly polymorphic, which makes typing expensive and time consuming. Consequently, despite their importance, KIRs have been little studied in large cohorts. Statistical imputation methods developed for other complex loci (e.g., human leukocyte antigen [HLA]) on the basis of SNP data provide an inexpensive high-throughput alternative to direct laboratory typing of these loci and have enabled important findings and insights for many diseases. We present KIR∗IMP, a method for imputation of KIR copy number. We show that KIR∗IMP is highly accurate and thus allows the study of KIRs in large cohorts and enables detailed investigation of the role of KIRs in human disease.This work was supported by the Australian National Health and Medical Research Council (NHMRC), Career Development Fellowship ID 1053756 (S.L.); by a Victorian Life Sciences Computation Initiative (VLSCI) grant number VR0240 on its Peak Computing Facility at the University of Melbourne, an initiative of the Victorian Government, Australia (S.L.); by the UK Multiple Sclerosis Society, grant 894/08 (S.S.); and by the Wellcome Trust and the MRC with partial funding from the National Institute of Health Cambridge Biomedical Research Centre (J.T., J.A.T.). Research at the Murdoch Childrens Research Institute was supported by the Victorian Government's Operational Infrastructure Support Program.This is the final version of the article. It first appeared from Elsevier via http://dx.doi.org/10.1016/j.ajhg.2015.09.00

    Hermeneutics and Nature

    Get PDF
    This paper contributes to the on-going research into the ways in which the humanities transformed the natural sciences in the late Eighteenth and early Nineteenth Centuries. By investigating the relationship between hermeneutics -- as developed by Herder -- and natural history, it shows how the methods used for the study of literary and artistic works played a crucial role in the emergence of key natural-scientific fields, including geography and ecology

    NovoGraph: Human genome graph construction from multiple long-read de novo assemblies [version 2; referees: 2 approved]

    Get PDF
    Genome graphs are emerging as an important novel approach to the analysis of high-throughput human sequencing data. By explicitly representing genetic variants and alternative haplotypes in a mappable data structure, they can enable the improved analysis of structurally variable and hyperpolymorphic regions of the genome. In most existing approaches, graphs are constructed from variant call sets derived from short-read sequencing. As long-read sequencing becomes more cost-effective and enables de novo assembly for increasing numbers of whole genomes, a method for the direct construction of a genome graph from sets of assembled human genomes would be desirable. Such assembly-based genome graphs would encompass the wide spectrum of genetic variation accessible to long-read-based de novo assembly, including large structural variants and divergent haplotypes. Here we present NovoGraph, a method for the construction of a human genome graph directly from a set of de novo assemblies. NovoGraph constructs a genome-wide multiple sequence alignment of all input contigs and creates a graph by merging the input sequences at positions that are both homologous and sequence-identical. NovoGraph outputs resulting graphs in VCF format that can be loaded into third-party genome graph toolkits. To demonstrate NovoGraph, we construct a genome graph with 23,478,835 variant sites and 30,582,795 variant alleles from de novo assemblies of seven ethnically diverse human genomes (AK1, CHM1, CHM13, HG003, HG004, HX1, NA19240). Initial evaluations show that mapping against the constructed graph reduces the average mismatch rate of reads from sample NA12878 by approximately 0.2%, albeit at a slightly increased rate of reads that remain unmapped

    Consumer behaviour and the life-course: shopper reactions to self service grocery shops and supermarkets in England c.1947-1975

    Get PDF
    This is the author accepted manuscript. The final version is available from SAGE Publications via the DOI in this recordThe paper examines the development of self-service grocery shopping from a consumer perspective. Using qualitative data constructed through a nationwide biographical survey and oral histories, it is possible to go beyond contemporary market surveys which give insufficient attention to shopping as a socially and culturally embedded practice. The paper uses the conceptual framework of the life-course, to demonstrate how grocery shopping is a complex activity, in which the retail encounter is shaped by the specific interconnection of different retail formats with consumer characteristics and situational influences. Consumer reactions to retail modernization must be understood in relation to the development of consumer practices at points of transition and stability within the life-course. These practices are accessed by examining retrospective consumer narratives about food shopping

    Distinct genetic architectures and environmental factors associate with host response to the γ 2-herpesvirus infections

    Get PDF
    Abstract: Kaposi’s sarcoma-associated herpesvirus (KSHV) and Epstein-Barr Virus (EBV) establish life-long infections and are associated with malignancies. Striking geographic variation in incidence and the fact that virus alone is insufficient to cause disease, suggests other co-factors are involved. Here we present epidemiological analysis and genome-wide association study (GWAS) in 4365 individuals from an African population cohort, to assess the influence of host genetic and non-genetic factors on virus antibody responses. EBV/KSHV co-infection (OR = 5.71(1.58–7.12)), HIV positivity (OR = 2.22(1.32–3.73)) and living in a more rural area (OR = 1.38(1.01–1.89)) are strongly associated with immunogenicity. GWAS reveals associations with KSHV antibody response in the HLA-B/C region (p = 6.64 × 10−09). For EBV, associations are identified for VCA (rs71542439, p = 1.15 × 10−12). Human leucocyte antigen (HLA) and trans-ancestry fine-mapping substantiate that distinct variants in HLA-DQA1 (p = 5.24 × 10−44) are driving associations for EBNA-1 in Africa. This study highlights complex interactions between KSHV and EBV, in addition to distinct genetic architectures resulting in important differences in pathogenesis and transmission

    Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis.

    Get PDF
    Multiple sclerosis is a common disease of the central nervous system in which the interplay between inflammatory and neurodegenerative processes typically results in intermittent neurological disturbance followed by progressive accumulation of disability. Epidemiological studies have shown that genetic factors are primarily responsible for the substantially increased frequency of the disease seen in the relatives of affected individuals, and systematic attempts to identify linkage in multiplex families have confirmed that variation within the major histocompatibility complex (MHC) exerts the greatest individual effect on risk. Modestly powered genome-wide association studies (GWAS) have enabled more than 20 additional risk loci to be identified and have shown that multiple variants exerting modest individual effects have a key role in disease susceptibility. Most of the genetic architecture underlying susceptibility to the disease remains to be defined and is anticipated to require the analysis of sample sizes that are beyond the numbers currently available to individual research groups. In a collaborative GWAS involving 9,772 cases of European descent collected by 23 research groups working in 15 different countries, we have replicated almost all of the previously suggested associations and identified at least a further 29 novel susceptibility loci. Within the MHC we have refined the identity of the HLA-DRB1 risk alleles and confirmed that variation in the HLA-A gene underlies the independent protective effect attributable to the class I region. Immunologically relevant genes are significantly overrepresented among those mapping close to the identified loci and particularly implicate T-helper-cell differentiation in the pathogenesis of multiple sclerosis

    An appeal for strengthening genomic pathogen surveillance to improve pandemic preparedness and infection prevention: the German perspective

    Get PDF
    The SARS-CoV-2 pandemic has highlighted the importance of viable infection surveillance and the relevant infrastructure. From a German perspective, an integral part of this infrastructure, genomic pathogen sequencing, was at best fragmentary and stretched to its limits due to the lack or inefficient use of equipment, human resources, data management and coordination. The experience in other countries has shown that the rate of sequenced positive samples and linkage of genomic and epidemiological data (person, place, time) represent important factors for a successful application of genomic pathogen surveillance. Planning, establishing and consistently supporting adequate structures for genomic pathogen surveillance will be crucial to identify and combat future pandemics as well as other challenges in infectious diseases such as multi-drug resistant bacteria and healthcare-associated infections. Therefore, the authors propose a multifaceted and coordinated process for the definition of procedural, legal and technical standards for comprehensive genomic pathogen surveillance in Germany, covering the areas of genomic sequencing, data collection and data linkage, as well as target pathogens. A comparative analysis of the structures established in Germany and in other countries is applied. This proposal aims to better tackle epi- and pandemics to come and take action from the “lessons learned” from the SARS-CoV-2 pandemic

    Constitutivism

    Get PDF
    A brief explanation and overview of constitutivism
    corecore