421 research outputs found

    A gene-by-gene population genomics platform: de novo assembly, annotation and genealogical analysis of 108 representative Neisseria meningitidis genomes

    Get PDF
    Background: Highly parallel,‘second generation’ sequencing technologies have rapidly expanded the number of bacterial whole genome sequences available for study, permitting the emergence of the discipline of population genomics. Most of these data are publically available as unassembled short-read sequence files that require extensive processing before they can be used for analysis. The provision of data in a uniform format, which can be easily assessed for quality, linked to provenance and phenotype and used for analysis, is therefore necessary. Results: The performance of de novo short-read assembly followed by automatic annotation using the pubMLST. orgNeisseriadatabase was assessed and evaluated for 108 diverse, representative, and well-characterisedNeisseria meningitidisisolates. High-quality sequences were obtained for >99% of known meningococcal genes among the de novoassembled genomes and four resequenced genomes and less than 1% of reassembled genes had sequence discrepancies or misassembled sequences. A core genome of 1600 loci, present in at least 95% of the population, was determined using the Genome Comparator tool. Genealogical relationships compatible with, but at a higher resolution than, those identified by multilocus sequence typing were obtained with core genome comparisons and ribosomal protein gene analysis which revealed a genomic structure for a number of previously described phenotypes. This unified system for cataloguing Neisseria genetic variation in the genome was implemented and used for multiple analyses and the data are publically available in the PubMLST Neisseria database. Conclusions: The de novo assembly, combined with automated gene-by-gene annotation, generates high quality draft genomes in which the majority of protein-encoding genes are present with high accuracy. The approach catalogues diversity efficiently, permits analyses of a single genome or multiple genome comparisons, and is a practical approach to interpreting WGS data for large bacterial population samples. The method generates novel insights into the biology of the meningococcus and improves our understanding of the whole population structure, not just disease causing lineages.</p

    Twenty-eight divergent polysaccharide loci specifying within- and amongst-strain capsule diversity in three strains of Bacteroides fragilis

    Get PDF
    Comparison of the complete genome sequence of Bacteroides fragilis 638R, originally isolated in the USA, was made with two previously sequenced strains isolated in the UK (NCTC 9343) and Japan (YCH46). The presence of 10 loci containing genes associated with polysaccharide (PS) biosynthesis, each including a putative Wzx flippase and Wzy polymerase, was confirmed in all three strains, despite a lack of cross-reactivity between NCTC 9343 and 638R surface PS-specific antibodies by immunolabelling and microscopy. Genomic comparisons revealed an exceptional level of PS biosynthesis locus diversity. Of the 10 divergent PS-associated loci apparent in each strain, none is similar between NCTC 9343 and 638R. YCH46 shares one locus with NCTC 9343, confirmed by mAb labelling, and a second different locus with 638R, making a total of 28 divergent PS biosynthesis loci amongst the three strains. The lack of expression of the phase-variable large capsule (LC) in strain 638R, observed in NCTC 9343, is likely to be due to a point mutation that generates a stop codon within a putative initiating glycosyltransferase, necessary for the expression of the LC in NCTC 9343. Other major sequence differences were observed to arise from different numbers and variety of inserted extra-chromosomal elements, in particular prophages. Extensive horizontal gene transfer has occurred within these strains, despite the presence of a significant number of divergent DNA restriction and modification systems that act to prevent acquisition of foreign DNA. The level of amongst-strain diversity in PS biosynthesis loci is unprecedented

    Antigenic diversity is generated by distinct evolutionary mechanisms in African trypanosome species

    Get PDF
    Antigenic variation enables pathogens to avoid the host immune response by continual switching of surface proteins. The protozoan blood parasite Trypanosoma brucei causes human African trypanosomiasis ("sleeping sickness") across sub-Saharan Africa and is a model system for antigenic variation, surviving by periodically replacing a monolayer of variant surface glycoproteins (VSG) that covers its cell surface. We compared the genome of Trypanosoma brucei with two closely related parasites Trypanosoma congolense and Trypanosoma vivax, to reveal how the variant antigen repertoire has evolved and how it might affect contemporary antigenic diversity. We reconstruct VSG diversification showing that Trypanosoma congolense uses variant antigens derived from multiple ancestral VSG lineages, whereas in Trypanosoma brucei VSG have recent origins, and ancestral gene lineages have been repeatedly co-opted to novel functions. These historical differences are reflected in fundamental differences between species in the scale and mechanism of recombination. Using phylogenetic incompatibility as a metric for genetic exchange, we show that the frequency of recombination is comparable between Trypanosoma congolense and Trypanosoma brucei but is much lower in Trypanosoma vivax. Furthermore, in showing that the C-terminal domain of Trypanosoma brucei VSG plays a crucial role in facilitating exchange, we reveal substantial species differences in the mechanism of VSG diversification. Our results demonstrate how past VSG evolution indirectly determines the ability of contemporary parasites to generate novel variant antigens through recombination and suggest that the current model for antigenic variation in Trypanosoma brucei is only one means by which these parasites maintain chronic infections

    Identification of two novel mutations in CDHR1 in consanguineous Spanish families with autosomal recessive retinal dystrophy.

    Get PDF
    Inherited retinal dystrophies present extensive phenotypic and genetic heterogeneity, posing a challenge for patients' molecular and clinical diagnoses. In this study, we wanted to clinically characterize and investigate the molecular etiology of an atypical form of autosomal recessive retinal dystrophy in two consanguineous Spanish families. Affected members of the respective families exhibited an array of clinical features including reduced visual acuity, photophobia, defective color vision, reduced or absent ERG responses, macular atrophy and pigmentary deposits in the peripheral retina. Genetic investigation included autozygosity mapping coupled with exome sequencing in the first family, whereas autozygome-guided candidate gene screening was performed by means of Sanger DNA sequencing in the second family. Our approach revealed nucleotide changes in CDHR1; a homozygous missense variant (c.1720C &gt; G, p.P574A) and a homozygous single base transition (c.1485 + 2T &gt; C) affecting the canonical 5' splice site of intron 13, respectively. Both changes co-segregated with the disease and were absent among cohorts of unrelated control individuals. To date, only five mutations in CDHR1 have been identified, all resulting in premature stop codons leading to mRNA nonsense mediated decay. Our work reports two previously unidentified homozygous mutations in CDHR1 further expanding the mutational spectrum of this gene

    The genome sequence of the European golden eagle, Aquila chrysaetos chrysaetos Linnaeus 1758.

    Get PDF
    We present a genome assembly from an individual female Aquila chrysaetos chrysaetos (the European golden eagle; Chordata; Aves; Accipitridae). The genome sequence is 1.23 gigabases in span. The majority of the assembly is scaffolded into 28 chromosomal pseudomolecules, including the W and Z sex chromosomes

    The genome sequence of the Norway rat, Rattus norvegicus Berkenhout 1769.

    Get PDF
    We present a genome assembly from an individual male Rattus norvegicus (the Norway rat; Chordata; Mammalia; Rodentia; Muridae). The genome sequence is 2.44 gigabases in span. The majority of the assembly is scaffolded into 20 chromosomal pseudomolecules, with both X and Y sex chromosomes assembled. This genome assembly, mRatBN7.2, represents the new reference genome for R. norvegicus and has been adopted by the Genome Reference Consortium

    A Comprehensive Analysis of Choroideremia: From Genetic Characterization to Clinical Practice.

    Get PDF
    Choroideremia (CHM) is a rare X-linked disease leading to progressive retinal degeneration resulting in blindness. The disorder is caused by mutations in the CHM gene encoding REP-1 protein, an essential component of the Rab geranylgeranyltransferase (GGTase) complex. In the present study, we evaluated a multi-technique analysis algorithm to describe the mutational spectrum identified in a large cohort of cases and further correlate CHM variants with phenotypic characteristics and biochemical defects of choroideremia patients. Molecular genetic testing led to the characterization of 36 out of 45 unrelated CHM families (80%), allowing the clinical reclassification of four CHM families. Haplotype reconstruction showed independent origins for the recurrent p.Arg293* and p.Lys178Argfs*5 mutations, suggesting the presence of hotspots in CHM, as well as the identification of two different unrelated events involving exon 9 deletion. No certain genotype-phenotype correlation could be established. Furthermore, all the patients´ fibroblasts analyzed presented significantly increased levels of unprenylated Rabs proteins compared to control cells; however, this was not related to the genotype. This research demonstrates the major potential of the algorithm proposed for diagnosis. Our data enhance the importance of establish a differential diagnosis with other retinal dystrophies, supporting the idea of an underestimated prevalence of choroideremia. Moreover, they suggested that the severity of the disorder cannot be exclusively explained by the genotype
    corecore