56 research outputs found

    The man, the plant, and the insect: shooting host specificity determinants in Serratia marcescens pangenome

    Get PDF
    IntroductionSerratia marcescens is most commonly known as an opportunistic pathogen causing nosocomial infections. It, however, was shown to infect a wide range of hosts apart from vertebrates such as insects or plants as well, being either pathogenic or growth-promoting for the latter. Despite being extensively studied in terms of virulence mechanisms during human infections, there has been little evidence of which factors determine S. marcescens host specificity. On that account, we analyzed S. marcescens pangenome to reveal possible specificity factors.MethodsWe selected 73 high-quality genome assemblies of complete level and reconstructed the respective pangenome and reference phylogeny based on core genes alignment. To find an optimal pipeline, we tested current pangenomic tools and obtained several phylogenetic inferences. The pangenome was rich in its accessory component and was considered open according to the Heaps’ law. We then applied the pangenome-wide associating method (pan-GWAS) and predicted positively associated gene clusters attributed to three host groups, namely, humans, insects, and plants.ResultsAccording to the results, significant factors relating to human infections included transcriptional regulators, lipoproteins, ABC transporters, and membrane proteins. Host preference toward insects, in its turn, was associated with diverse enzymes, such as hydrolases, isochorismatase, and N-acetyltransferase with the latter possibly exerting a neurotoxic effect. Finally, plant infection may be conducted through type VI secretion systems and modulation of plant cell wall synthesis. Interestingly, factors associated with plants also included putative growth-promoting proteins like enzymes performing xenobiotic degradation and releasing ammonium irons. We also identified overrepresented functional annotations within the sets of specificity factors and found that their functional characteristics fell into separate clusters, thus, implying that host adaptation is represented by diverse functional pathways. Finally, we found that mobile genetic elements bore specificity determinants. In particular, prophages were mainly associated with factors related to humans, while genetic islands-with insects and plants, respectively.DiscussionIn summary, functional enrichments coupled with pangenomic inferences allowed us to hypothesize that the respective host preference is carried out through distinct molecular mechanisms of virulence. To the best of our knowledge, the presented research is the first to identify specific genomic features of S. marcescens assemblies isolated from different hosts at the pangenomic level

    Predicting Amyloidogenic Proteins in the Proteomes of Plants

    No full text
    Amyloids are protein fibrils with characteristic spatial structure. Though amyloids were long perceived to be pathogens that cause dozens of incurable pathologies in humans and mammals, it is currently clear that amyloids also represent a functionally important form of protein structure implicated in a variety of biological processes in organisms ranging from archaea and bacteria to fungi and animals. Despite their social significance, plants remain the most poorly studied group of organisms in the field of amyloid biology. To date, amyloid properties have only been demonstrated in vitro or in heterologous systems for a small number of plant proteins. Here, for the first time, we performed a comprehensive analysis of the distribution of potentially amyloidogenic proteins in the proteomes of approximately 70 species of land plants using the Waltz and SARP (Sequence Analysis based on the Ranking of Probabilities) bioinformatic algorithms. We analyzed more than 2.9 million protein sequences and found that potentially amyloidogenic proteins are abundant in plant proteomes. We found that such proteins are overrepresented among membrane as well as DNA- and RNA-binding proteins of plants. Moreover, seed storage and defense proteins of most plant species are rich in amyloidogenic regions. Taken together, our data demonstrate the diversity of potentially amyloidogenic proteins in plant proteomes and suggest biological processes where formation of amyloids might be functionally important

    SARP: A Novel Algorithm to Assess Compositional Biases in Protein Sequences

    Get PDF
    The composition of a defined set of subunits (nucleotides, amino acids) is one of the key features of biological sequences. Compositional biases are local shifts in amino acid or nucleotide frequencies that can occur as an adaptation of an organism to an extreme ecological niche, or as the signature of a specific function or localization of the corresponding protein. The calculation of probability is a method for annotating compositional bias and providing accurate detection of biased subsequences. Here, we present a Sequence Analysis based on the Ranking of Probabilities (SARP), a novel algorithm for the annotation of compositional biases based on ranking subsequences by their probabilities. SARP provides the same accuracy as the previously published Lower Probability Subsequences (LPS) algorithm but performs at an approximately 230-fold faster rate. It can be recommended for use when working with large datasets to reduce the time and resources required

    Recombination in Bacterial Genomes: Evolutionary Trends

    No full text
    Bacterial organisms have undergone homologous recombination (HR) and horizontal gene transfer (HGT) multiple times during their history. These processes could increase fitness to new environments, cause specialization, the emergence of new species, and changes in virulence. Therefore, comprehensive knowledge of the impact and intensity of genetic exchanges and the location of recombination hotspots on the genome is necessary for understanding the dynamics of adaptation to various conditions. To this end, we aimed to characterize the functional impact and genomic context of computationally detected recombination events by analyzing genomic studies of any bacterial species, for which events have been detected in the last 30 years. Genomic loci where the transfer of DNA was detected pertained to mobile genetic elements (MGEs) housing genes that code for proteins engaged in distinct cellular processes, such as secretion systems, toxins, infection effectors, biosynthesis enzymes, etc. We found that all inferences fall into three main lifestyle categories, namely, ecological diversification, pathogenesis, and symbiosis. The latter primarily exhibits ancestral events, thus, possibly indicating that adaptation appears to be governed by similar recombination-dependent mechanisms

    Exploring Proteins Containing Amyloidogenic Regions in the Proteomes of Bacteria of the Order

    No full text
    Amyloids are protein fibrils with a highly ordered spatial structure called cross-β. To date, amyloids were shown to be implicated in a wide range of biological processes, both pathogenic and functional. In bacteria, functional amyloids are involved in forming biofilms, storing toxins, overcoming the surface tension, and other functions. Rhizobiales represent an economically important group of Alphaproteobacteria , various species of which are not only capable of fixing nitrogen in the symbiosis with leguminous plants but also act as the causative agents of infectious diseases in animals and plants. Here, we implemented bioinformatic screening for potentially amyloidogenic proteins in the proteomes of more than 80 species belonging to the order Rhizobiales . Using SARP ( S equence A nalysis based on the R anking of P robabilities) and Waltz bioinformatic algorithms, we identified the biological processes, where potentially amyloidogenic proteins are overrepresented. We detected protein domains and regions associated with amyloidogenic sequences in the proteomes of various Rhizobiales species. We demonstrated that amyloidogenic regions tend to occur in the membrane or extracellular proteins, many of which are involved in pathogenesis-related processes, including adhesion, assembly of flagellum, and transport of siderophores and lipopolysaccharides, and contain domains typical of the virulence factors (hemolysin, RTX, YadA, LptD); some of them (rhizobiocins, LptD) are also related to symbiosis

    For Someone, You Are the Whole World: Host-Specificity of <i>Salmonella enterica</i>

    No full text
    Salmonella enterica is a bacterial pathogen known to cause gastrointestinal infections in diverse hosts, including humans and animals. Despite extensive knowledge of virulence mechanisms, understanding the factors driving host specificity remains limited. In this study, we performed a comprehensive pangenome-wide analysis of S. enterica to identify potential loci determining preference towards certain hosts. We used a dataset of high-quality genome assemblies grouped into 300 reference clusters with a special focus on four host groups: humans, pigs, cattle, and birds. The reconstructed pangenome was shown to be open and enriched with the accessory component implying high genetic diversity. Notably, phylogenetic inferences did not correspond to the distribution of affected hosts, as large compact phylogenetic groups were absent. By performing a pangenome-wide association study, we identified potential host specificity determinants. These included multiple genes encoding proteins involved in distinct infection stages, e.g., secretion systems, surface structures, transporters, transcription regulators, etc. We also identified antibiotic resistance loci in host-adapted strains. Functional annotation corroborated the results obtained with significant enrichments related to stress response, antibiotic resistance, ion transport, and surface or extracellular localization. We suggested categorizing the revealed specificity factors into three main groups: pathogenesis, resistance to antibiotics, and propagation of mobile genetic elements (MGEs)

    Current Methods for Recombination Detection in Bacteria

    No full text
    The role of genetic exchanges, i.e., homologous recombination (HR) and horizontal gene transfer (HGT), in bacteria cannot be overestimated for it is a pivotal mechanism leading to their evolution and adaptation, thus, tracking the signs of recombination and HGT events is importance both for fundamental and applied science. To date, dozens of bioinformatics tools for revealing recombination signals are available, however, their pros and cons as well as the spectra of solvable tasks have not yet been systematically reviewed. Moreover, there are two major groups of software. One aims to infer evidence of HR, while the other only deals with horizontal gene transfer (HGT). However, despite seemingly different goals, all the methods use similar algorithmic approaches, and the processes are interconnected in terms of genomic evolution influencing each other. In this review, we propose a classification of novel instruments for both HR and HGT detection based on the genomic consequences of recombination. In this context, we summarize available methodologies paying particular attention to the type of traceable events for which a certain program has been designed
    • …
    corecore