10 research outputs found

    Predicting promoters in phage genomes using machine learning models

    Get PDF
    The renewed interest in phages as antibacterial agents has led to the exponentially growing number of sequenced phage genomes. Therefore, the development of novel bioinformatics methods to automate and facilitate phage genome annotation is of utmost importance. The most difficult step of phage genome annotation is the identification of promoters. As the existing methods for predicting promoters are not well suited for phages, we used machine learning models for locating promoters in phage genomes. Several models were created, using different algorithms and datasets, which consisted of known phage promoter and non-promoter sequences. All models showed good performance, but the ANN model provided better results for the smaller dataset (92% of accuracy, 89% of precision and 87% of recall) and the SVM model returned better results for the larger dataset (93% of accuracy, 91% of precision and 80% of recall). Both models were applied to the genome of Pseudomonas phage phiPsa17 and were able to identify both types of promoters, host and phage, found in phage genomes.This study was supported by the Portuguese Foundation for Science andTechnology (FCT) under the scope of the strategic funding of UID/BIO/04469/2019 unit and theProject POCI-01-0145-FEDER-029628. This work was also supported by BioTecNorte operation (NORTE-01-0145-FEDER-000004) funded by the European Regional Development Fundunder the scope of Norte2020 - Programa Operacional Regional do Norte.info:eu-repo/semantics/publishedVersio

    In silico exploration of Red Sea Bacillus genomes for natural product biosynthetic gene clusters

    Get PDF
    Background: The increasing spectrum of multidrug-resistant bacteria is a major global public health concern, necessitating discovery of novel antimicrobial agents. Here, members of the genus Bacillus are investigated as a potentially attractive source of novel antibiotics due to their broad spectrum of antimicrobial activities. We specifically focus on a computational analysis of the distinctive biosynthetic potential of Bacillus paralicheniformis strains isolated from the Red Sea, an ecosystem exposed to adverse, highly saline and hot conditions. Results: We report the complete circular and annotated genomes of two Red Sea strains, B. paralicheniformis Bac48 isolated from mangrove mud and B. paralicheniformis Bac84 isolated from microbial mat collected from Rabigh Harbor Lagoon in Saudi Arabia. Comparing the genomes of B. paralicheniformis Bac48 and B. paralicheniformis Bac84 with nine publicly available complete genomes of B. licheniformis and three genomes of B. paralicheniformis, revealed that all of the B. paralicheniformis strains in this study are more enriched in nonribosomal peptides (NRPs). We further report the first computationally identified trans-acyltransferase (trans-AT) nonribosomal peptide synthetase/polyketide synthase (PKS/ NRPS) cluster in strains of this species. Conclusions:B. paralicheniformis species have more genes associated with biosynthesis of antimicrobial bioactive compounds than other previously characterized species of B. licheniformis, which suggests that these species are better potential sources for novel antibiotics. Moreover, the genome of the Red Sea strain B. paralicheniformis Bac48 is more enriched in modular PKS genes compared to B. licheniformis strains and other B. paralicheniformis strains. This may be linked to adaptations that strains surviving in the Red Sea underwent to survive in the relatively hot and saline ecosystems
    corecore