36 research outputs found

    Whole genome sequencing to investigate the emergence of clonal complex 23 Neisseria meningitidis serogroup Y disease in the United States

    Get PDF
    In the United States, serogroup Y, ST-23 clonal complex Neisseria meningitidis was responsible for an increase in meningococcal disease incidence during the 1990s. This increase was accompanied by antigenic shift of three outer membrane proteins, with a decrease in the population that predominated in the early 1990s as a different population emerged later in that decade. To understand factors that may have been responsible for the emergence of serogroup Y disease, we used whole genome pyrosequencing to investigate genetic differences between isolates from early and late N. meningitidis populations, obtained from meningococcal disease cases in Maryland in the 1990s. The genomes of isolates from the early and late populations were highly similar, with 1231 of 1776 shared genes exhibiting 100% amino acid identity and an average πN = 0.0033 and average πS = 0.0216. However, differences were found in predicted proteins that affect pilin structure and antigen profile and in predicted proteins involved in iron acquisition and uptake. The observed changes are consistent with acquisition of new alleles through horizontal gene transfer. Changes in antigen profile due to the genetic differences found in this study likely allowed the late population to emerge due to escape from population immunity. These findings may predict which antigenic factors are important in the cyclic epidemiology of meningococcal disease

    Exopolysaccharide-associated protein sorting in environmental organisms: the PEP-CTERM/EpsH system. Application of a novel phylogenetic profiling heuristic

    Get PDF
    BACKGROUND: Protein translocation to the proper cellular destination may be guided by various classes of sorting signals recognizable in the primary sequence. Detection in some genomes, but not others, may reveal sorting system components by comparison of the phylogenetic profile of the class of sorting signal to that of various protein families. RESULTS: We describe a short C-terminal homology domain, sporadically distributed in bacteria, with several key characteristics of protein sorting signals. The domain includes a near-invariant motif Pro-Glu-Pro (PEP). This possible recognition or processing site is followed by a predicted transmembrane helix and a cluster rich in basic amino acids. We designate this domain PEP-CTERM. It tends to occur multiple times in a genome if it occurs at all, with a median count of eight instances; Verrucomicrobium spinosum has sixty-five. PEP-CTERM-containing proteins generally contain an N-terminal signal peptide and exhibit high diversity and little homology to known proteins. All bacteria with PEP-CTERM have both an outer membrane and exopolysaccharide (EPS) production genes. By a simple heuristic for screening phylogenetic profiles in the absence of pre-formed protein families, we discovered that a homolog of the membrane protein EpsH (exopolysaccharide locus protein H) occurs in a species when PEP-CTERM domains are found. The EpsH family contains invariant residues consistent with a transpeptidase function. Most PEP-CTERM proteins are encoded by single-gene operons preceded by large intergenic regions. In the Proteobacteria, most of these upstream regions share a DNA sequence, a probable cis-regulatory site that contains a sigma-54 binding motif. The phylogenetic profile for this DNA sequence exactly matches that of three proteins: a sigma-54-interacting response regulator (PrsR), a transmembrane histidine kinase (PrsK), and a TPR protein (PrsT). CONCLUSION: These findings are consistent with the hypothesis that PEP-CTERM and EpsH form a protein export sorting system, analogous to the LPXTG/sortase system of Gram-positive bacteria, and correlated to EPS expression. It occurs preferentially in bacteria from sediments, soils, and biofilms. The novel method that led to these findings, partial phylogenetic profiling, requires neither global sequence clustering nor arbitrary similarity cutoffs and appears to be a rapid, effective alternative to other profiling methods

    In Vitro Identification of Novel Plasminogen-Binding Receptors of the Pathogen Leptospira interrogans

    Get PDF
    Background: Leptospirosis is a multisystem disease caused by pathogenic strains of the genus Leptospira. We have reported that Leptospira are able to bind plasminogen (PLG), to generate active plasmin in the presence of activator, and to degrade purified extracellular matrix fibronectin. Methodology/Principal Findings: We have now cloned, expressed and purified 14 leptospiral recombinant proteins. The proteins were confirmed to be surface exposed by immunofluorescence microscopy and were evaluated for their ability to bind plasminogen (PLG). We identified eight as PLG-binding proteins, including the major outer membrane protein LipL32, the previously published rLIC12730, rLIC10494, Lp29, Lp49, LipL40 and MPL36, and one novel leptospiral protein, rLIC12238. Bound PLG could be converted to plasmin by the addition of urokinase-type PLG activator (uPA), showing specific proteolytic activity, as assessed by its reaction with the chromogenic plasmin substrate, D-Val-Leu-Lys 4-nitroanilide dihydrochloride. The addition of the lysine analog 6-aminocaproic acid (ACA) inhibited the protein-PLG interaction, thus strongly suggesting the involvement of lysine residues in plasminogen binding. The binding of leptospiral surface proteins to PLG was specific, dose-dependent and saturable. PLG and collagen type IV competed with LipL32 protein for the same binding site, whereas separate binding sites were observed for plasma fibronectin. Conclusions/Significance: PLG-binding/activation through the proteins/receptors on the surface of Leptospira could help the bacteria to specifically overcome tissue barriers, facilitating its spread throughout the host.FAPESP (Fundacao de Amparo a Pesquisa do Estado de Sao Paulo)CNPq (Conselho Nacional de Desenvolvimento Cientifico e Tecnologico)Fundacao Butantan, BrazilFAPESP (Brazil

    Predicting DNA-Binding Specificities of Eukaryotic Transcription Factors

    Get PDF
    Today, annotated amino acid sequences of more and more transcription factors (TFs) are readily available. Quantitative information about their DNA-binding specificities, however, are hard to obtain. Position frequency matrices (PFMs), the most widely used models to represent binding specificities, are experimentally characterized only for a small fraction of all TFs. Even for some of the most intensively studied eukaryotic organisms (i.e., human, rat and mouse), roughly one-sixth of all proteins with annotated DNA-binding domain have been characterized experimentally. Here, we present a new method based on support vector regression for predicting quantitative DNA-binding specificities of TFs in different eukaryotic species. This approach estimates a quantitative measure for the PFM similarity of two proteins, based on various features derived from their protein sequences. The method is trained and tested on a dataset containing 1 239 TFs with known DNA-binding specificity, and used to predict specific DNA target motifs for 645 TFs with high accuracy

    Complete Genome Sequence of Mycoplasma suis and Insights into Its Biology and Adaption to an Erythrocyte Niche

    Get PDF
    Mycoplasma suis, the causative agent of porcine infectious anemia, has never been cultured in vitro and mechanisms by which it causes disease are poorly understood. Thus, the objective herein was to use whole genome sequencing and analysis of M. suis to define pathogenicity mechanisms and biochemical pathways. M. suis was harvested from the blood of an experimentally infected pig. Following DNA extraction and construction of a paired end library, whole-genome sequencing was performed using GS-FLX (454) and Titanium chemistry. Reads on paired-end constructs were assembled using GS De Novo Assembler and gaps closed by primer walking; assembly was validated by PFGE. Glimmer and Manatee Annotation Engine were used to predict and annotate protein-coding sequences (CDS). The M. suis genome consists of a single, 742,431 bp chromosome with low G+C content of 31.1%. A total of 844 CDS, 3 single copies, unlinked rRNA genes and 32 tRNAs were identified. Gene homologies and GC skew graph show that M. suis has a typical Mollicutes oriC. The predicted metabolic pathway is concise, showing evidence of adaptation to blood environment. M. suis is a glycolytic species, obtaining energy through sugars fermentation and ATP-synthase. The pentose-phosphate pathway, metabolism of cofactors and vitamins, pyruvate dehydrogenase and NAD+ kinase are missing. Thus, ribose, NADH, NADPH and coenzyme A are possibly essential for its growth. M. suis can generate purines from hypoxanthine, which is secreted by RBCs, and cytidine nucleotides from uracil. Toxins orthologs were not identified. We suggest that M. suis may cause disease by scavenging and competing for host' nutrients, leading to decreased life-span of RBCs. In summary, genome analysis shows that M. suis is dependent on host cell metabolism and this characteristic is likely to be linked to its pathogenicity. The prediction of essential nutrients will aid the development of in vitro cultivation systems

    Genome Sequence of a Lancefield Group C Streptococcus zooepidemicus Strain Causing Epidemic Nephritis: New Information about an Old Disease

    Get PDF
    Outbreaks of disease attributable to human error or natural causes can provide unique opportunities to gain new information about host-pathogen interactions and new leads for pathogenesis research. Poststreptococcal glomerulonephritis (PSGN), a sequela of infection with pathogenic streptococci, is a common cause of preventable kidney disease worldwide. Although PSGN usually occurs after infection with group A streptococci, organisms of Lancefield group C and G also can be responsible. Despite decades of study, the molecular pathogenesis of PSGN is poorly understood. As a first step toward gaining new information about PSGN pathogenesis, we sequenced the genome of Streptococcus equi subsp. zooepidemicus strain MGCS10565, a group C organism that caused a very large and unusually severe epidemic of nephritis in Brazil. The genome is a circular chromosome of 2,024,171 bp. The genome shares extensive gene content, including many virulence factors, with genetically related group A streptococci, but unexpectedly lacks prophages. The genome contains many apparently foreign genes interspersed around the chromosome, consistent with the presence of a full array of genes required for natural competence. An inordinately large family of genes encodes secreted extracellular collagen-like proteins with multiple integrin-binding motifs. The absence of a gene related to speB rules out the long-held belief that streptococcal pyrogenic exotoxin B or antibodies reacting with it singularly cause PSGN. Many proteins previously implicated in GAS PSGN, such as streptokinase, are either highly divergent in strain MGCS10565 or are not more closely related between these species than to orthologs present in other streptococci that do not commonly cause PSGN. Our analysis provides a comparative genomics framework for renewed appraisal of molecular events underlying APSGN pathogenesis

    An Anomalous Type IV Secretion System in Rickettsia Is Evolutionarily Conserved

    Get PDF
    Bacterial type IV secretion systems (T4SSs) comprise a diverse transporter family functioning in conjugation, competence, and effector molecule (DNA and/or protein) translocation. Thirteen genome sequences from Rickettsia, obligate intracellular symbionts/pathogens of a wide range of eukaryotes, have revealed a reduced T4SS relative to the Agrobacterium tumefaciens archetype (vir). However, the Rickettsia T4SS has not been functionally characterized for its role in symbiosis/virulence, and none of its substrates are known.Superimposition of T4SS structural/functional information over previously identified Rickettsia components implicate a functional Rickettsia T4SS. virB4, virB8 and virB9 are duplicated, yet only one copy of each has the conserved features of similar genes in other T4SSs. An extraordinarily duplicated VirB6 gene encodes five hydrophobic proteins conserved only in a short region known to be involved in DNA transfer in A. tumefaciens. virB1, virB2 and virB7 are newly identified, revealing a Rickettsia T4SS lacking only virB5 relative to the vir archetype. Phylogeny estimation suggests vertical inheritance of all components, despite gene rearrangements into an archipelago of five islets. Similarities of Rickettsia VirB7/VirB9 to ComB7/ComB9 proteins of epsilon-proteobacteria, as well as phylogenetic affinities to the Legionella lvh T4SS, imply the Rickettsiales ancestor acquired a vir-like locus from distantly related bacteria, perhaps while residing in a protozoan host. Modern modifications of these systems likely reflect diversification with various eukaryotic host cells.We present the rvh (Rickettsiales vir homolog) T4SS, an evolutionary conserved transporter with an unknown role in rickettsial biology. This work lays the foundation for future laboratory characterization of this system, and also identifies the Legionella lvh T4SS as a suitable genetic model

    Characterizing the Syphilis-Causing Treponema pallidum ssp. pallidum Proteome Using Complementary Mass Spectrometry

    Get PDF
    YesBackground. The spirochete bacterium Treponema pallidum ssp. pallidum is the etiological agent of syphilis, a chronic multistage disease. Little is known about the global T. pallidum proteome, therefore mass spectrometry studies are needed to bring insights into pathogenicity and protein expression profiles during infection. Methodology/Principal Findings. To better understand the T. pallidum proteome profile during infection, we studied T. pallidum ssp. pallidum DAL-1 strain bacteria isolated from rabbits using complementary mass spectrometry techniques, including multidimensional peptide separation and protein identification via matrix-assisted laser desorption ionization-time of flight (MALDI-TOF/TOF) and electrospray ionization (ESI-LTQ-Orbitrap) tandem mass spectrometry. A total of 6033 peptides were detected, corresponding to 557 unique T. pallidum proteins at a high level of confidence, representing 54% of the predicted proteome. A previous gel-based T. pallidum MS proteome study detected 58 of these proteins. One hundred fourteen of the detected proteins were previously annotated as hypothetical or uncharacterized proteins; this is the first account of 106 of these proteins at the protein level. Detected proteins were characterized according to their predicted biological function and localization; half were allocated into a wide range of functional categories. Proteins annotated as potential membrane proteins and proteins with unclear functional annotations were subjected to an additional bioinformatics pipeline analysis to facilitate further characterization. A total of 116 potential membrane proteins were identified, of which 16 have evidence supporting outer membrane localization. We found 8/12 proteins related to the paralogous tpr gene family: TprB, TprC/D, TprE, TprG, TprH, TprI and TprJ. Protein abundance was semi-quantified using label-free spectral counting methods. A low correlation (r = 0.26) was found between previous microarray signal data and protein abundance. Conclusions. This is the most comprehensive description of the global T. pallidum proteome to date. These data provide valuable insights into in vivo T. pallidum protein expression, paving the way for improved understanding of the pathogenicity of this enigmatic organism.This work was supported by the grants from the Flanders Research Foundation, SOFI-B Grant to CRK, http://www.fwo.be/, a Public Health Service Grant from the National Institutes of Health to CEC, (grant # AI-051334), https://www.nih.gov/ and a grant from the Grant Agency of the Czech Republic to DS and MS (P302/12/0574, GP14-29596P), https:// gacr.cz/
    corecore