222 research outputs found

    The development and use of bioinformatic web applications for infectious disease microbiology

    No full text
    The ever-increasing generation and submission of DNA sequences, and associated biological data, to publicly available databases demands software for the analysis of the biological meanings held within. The web provides a common platform for the provision of tools enabling the concurrent deposition, visualisation and analysis of data collected by many users from many different locations. Open programmatic web standards allow the development of applications addressing diverse biological questions and, more recently, are providing methods enabling functionality more traditionally associated with desktop software to be provided via the internet. In this work I detail the development and use of web applications addressing different, but not exclusive, areas of infectious disease microbiology. Firstly, an application utilised by a group of researchers (including myself) to undertake a comparative genetic analysis of the capsular biosynthetic locus from serotypes of the pathogen Streptococcus pneumoniae is detailed. Secondly, an application widely used by communities of researchers and public health laboratories for the assignment of microbial isolates to strains via the internet: mlst.net is described. Thirdly, I describe a generic electronic taxonomy application for assigning strains to bacterial species, exemplified using sequences from the viridans group streptococci, the taxonomy of which is notoriously difficult to define. Lastly, I describe the use of web mapping tools for molecular epidemiological databases, such as mlst.net, and further detail their application to the European distribution of genotypes of Staphylococcus aureus, to catalogue the global distribution of the emerging amphibian fungal pathogen Batrachochytrium dendrobatidis and to provide geocoding tools to encourage users to submit and explore their own data in the geographical context they were collected. Each application has been designed with generality in mind and through reference to user workflows and biological examples, I demonstrate the extensibility and general applicability of current web development methodologies to enable the provision of applications addressing a diversity of biological questions

    The multilocus sequence typing network: mlst.net

    Get PDF
    The unambiguous characterization of strains of a pathogen is crucial for addressing questions relating to its epidemiology, population and evolutionary biology. Multilocus sequence typing (MLST), which defines strains from the sequences at seven house-keeping loci, has become the method of choice for molecular typing of many bacterial and fungal pathogens (and non-pathogens), and MLST schemes and strain databases are available for a growing number of prokaryotic and eukaryotic organisms. Sequence data are ideal for strain characterization as they are unambiguous, meaning strains can readily be compared between laboratories via the Internet. Laboratories undertaking MLST can quickly progress from sequencing the seven gene fragments to characterizing their strains and relating them to those submitted by others and to the population as a whole. We provide the gateway to a number of MLST schemes, each of which contain a set of tools for the initial characterization of strains, and methods for relating query strains to other strains of the species, including clustering based on differences in allelic profiles, phylogenetic trees based on concatenated sequences, and a recently developed method (eBURST) for identifying clonal complexes within a species and displaying the overall structure of the population. This network of MLST websites is available a

    EpiCollect: Linking Smartphones to Web Applications for Epidemiology, Ecology and Community Data Collection

    Get PDF
    Epidemiologists and ecologists often collect data in the field and, on returning to their laboratory, enter their data into a database for further analysis. The recent introduction of mobile phones that utilise the open source Android operating system, and which include (among other features) both GPS and Google Maps, provide new opportunities for developing mobile phone applications, which in conjunction with web applications, allow two-way communication between field workers and their project databases.Here we describe a generic framework, consisting of mobile phone software, EpiCollect, and a web application located within www.spatialepidemiology.net. Data collected by multiple field workers can be submitted by phone, together with GPS data, to a common web database and can be displayed and analysed, along with previously collected data, using Google Maps (or Google Earth). Similarly, data from the web database can be requested and displayed on the mobile phone, again using Google Maps. Data filtering options allow the display of data submitted by the individual field workers or, for example, those data within certain values of a measured variable or a time period.Data collection frameworks utilising mobile phones with data submission to and from central databases are widely applicable and can give a field worker similar display and analysis tools on their mobile phone that they would have if viewing the data in their laboratory via the web. We demonstrate their utility for epidemiological data collection and display, and briefly discuss their application in ecological and community data collection. Furthermore, such frameworks offer great potential for recruiting 'citizen scientists' to contribute data easily to central databases through their mobile phone

    The promise of whole genome pathogen sequencing for the molecular epidemiology of emerging aquaculture pathogens

    Get PDF
    Aquaculture is the fastest growing food-producing sector, and the sustainability of this industry is critical both for global food security and economic welfare. The management of infectious disease represents a key challenge. Here, we discuss the opportunities afforded by whole genome sequencing of bacterial and viral pathogens of aquaculture to mitigate disease emergence and spread. We outline, by way of comparison, how sequencing technology is transforming the molecular epidemiology of pathogens of public health importance, emphasizing the importance of community-oriented databases and analysis tools

    PANINI : Pangenome Neighbour Identification for Bacterial Populations

    Get PDF
    The standard workhorse for genomic analysis of the evolution of bacterial populations is phylogenetic modelling of mutations in the core genome. However, a notable amount of information about evolutionary and transmission processes in diverse populations can be lost unless the accessory genome is also taken into consideration. Here, we introduce PANINI (Pangenome Neighbour Identification for Bacterial Populations), a computationally scalable method for identifying the neighbours for each isolate in a data set using unsupervised machine learning with stochastic neighbour embedding based on the t-SNE (t-distributed stochastic neighbour embedding) algorithm. PANINI is browser-based and integrates with the Microreact platform for rapid online visualization and exploration of both core and accessory genome evolutionary signals, together with relevant epidemiological, geographical, temporal and other metadata. Several case studies with single- and multi-clone pneumococcal populations are presented to demonstrate the ability to identify biologically important signals from gene content data. PANINI is available at http://panini.pathogen.watch and code at http://gitlab.com/cgps/panini.Peer reviewe

    Novel Subclone of Carbapenem-Resistant Klebsiella pneumoniae Sequence Type 11 with Enhanced Virulence and Transmissibility, China.

    Get PDF
    We aimed to clarify the epidemiologic and clinical importance of evolutionary events that occurred in carbapenem-resistant Klebsiella pneumoniae (CRKP). We collected 203 CRKP causing bloodstream infections in a tertiary hospital in China during 2013-2017. We detected a subclonal shift in the dominant clone sequence type (ST) 11 CRKP in which the previously prevalent capsular loci (KL) 47 had been replaced by KL64 since 2016. Patients infected with ST11-KL64 CRKP had a significantly higher 30-day mortality rate than other CRKP-infected patients. Enhanced virulence was further evidenced by phenotypic tests. Phylogenetic reconstruction demonstrated that ST11-KL64 is derived from an ST11-KL47-like ancestor through recombination. We identified a pLVPK-like virulence plasmid carrying rmpA and peg-344 in ST11-KL64 exclusively from 2016 onward. The pLVPK-like-positive ST11-KL64 isolates exhibited enhanced environmental survival. Retrospective screening of a national collection identified ST11-KL64 in multiple regions. Targeted surveillance of this high-risk CRKP clone is urgently needed

    Diversification of Colonization Factors in a Multidrug-Resistant Escherichia coli Lineage Evolving under Negative Frequency-Dependent Selection

    Get PDF
    Escherichia coli is a major cause of bloodstream and urinary tract infections globally. The wide dissemination of multidrug-resistant (MDR) strains of extraintestinal pathogenic E. coli (ExPEC) poses a rapidly increasing public health burden due to narrowed treatment options and increased risk of failure to clear an infection. Here, we present a detailed population genomic analysis of the ExPEC ST131 clone, in which we seek explanations for its success as an emerging pathogenic strain beyond the acquisition of antimicrobial resistance (AMR) genes. We show evidence for evolution toward separate ecological niches for the main clades of ST131 and differential evolution of anaerobic metabolism, key colonization, and virulence factors. We further demonstrate that negative frequency-dependent selection acting across accessory loci is a major mechanism that has shaped the population evolution of this pathogen.IMPORTANCE Infections with multidrug-resistant (MDR) strains of Escherichia coli are a significant global public health concern. To combat these pathogens, we need a deeper understanding of how they evolved from their background populations. By understanding the processes that underpin their emergence, we can design new strategies to limit evolution of new clones and combat existing clones. By combining population genomics with modelling approaches, we show that dominant MDR clones of E. coli are under the influence of negative frequency-dependent selection, preventing them from rising to fixation in a population. Furthermore, we show that this selection acts on genes involved in anaerobic metabolism, suggesting that this key trait, and the ability to colonize human intestinal tracts, is a key step in the evolution of MDR clones of E. coli.Peer reviewe
    corecore