69 research outputs found

    Comparative analysis of gene prediction tools for viral genome annotation

    Get PDF
    The number of newly available viral genomes and metagenomes has increased exponentially since the development of high throughput sequencing platforms and genome analysis tools. Bioinformatic annotation pipelines are largely based on open reading frame (ORF) calling software, which identifies genes independently of the sequence taxonomical background. Although ORF-calling programs provide a rapid genome annotation, they can misidentify ORFs and start codons; errors that might be perpetuated and propagated over time. This study evaluated the performance of multiple ORF-calling programs for viral genome annotation against the complete RefSeq viral database. Programs outputs varied when considering the viral nucleic acid type versus the viral host. According to the number of ORFs, Prodigal and Metaprodigal were the most accurate programs for DNA viruses, while FragGeneScan and Prodigal generated the most accurate outputs for RNA viruses. Similarly, Prodigal outperformed the benchmark for viruses infecting prokaryotes, and GLIMMER and GeneMarkS produced the most accurate annotations for viruses infecting eukaryotes. When the coordinates of the ORFs were considered, Prodigal scored high for all scenarios except for RNA viruses, where GeneMarkS generated the most reliable results. Overall, the quality of the coordinates predicted for RNA viruses was poorer than for DNA viruses, suggesting the need for improved ORF-calling programs to deal with RNA viruses. Moreover, none of the ORF-calling programs reached 90% accuracy for annotation of DNA viruses. Any automatic annotation can still be improved by manual curation, especially when the presence of ORFs is validated with wet-lab experiments. However, our evaluation of the current ORF-calling programs is expected to be useful for the improvement of viral genome annotation pipelines and highlights the need for more expression data to improve the rigor of reference genomes

    A Novel Linear Plasmid Mediates Flagellar Variation in Salmonella Typhi

    Get PDF
    Unlike the majority of Salmonella enterica serovars, Salmonella Typhi (S. Typhi), the etiological agent of human typhoid, is monophasic. S. Typhi normally harbours only the phase 1 flagellin gene (fliC), which encodes the H:d antigen. However, some S. Typhi strains found in Indonesia express an additional flagellin antigen termed H:z66. Molecular analysis of H:z66+ S. Typhi revealed that the H:z66 flagellin structural gene (fljBz66) is encoded on a linear plasmid that we have named pBSSB1. The DNA sequence of pBSSB1 was determined to be just over 27 kbp, and was predicted to encode 33 coding sequences. To our knowledge, pBSSB1 is the first non-bacteriophage–related linear plasmid to be described in the Enterobacteriaceae

    Evolutionary trade-offs associated with loss of PmrB function in host-adapted <i>Pseudomonas aeruginosa</i>

    Get PDF
    Pseudomonas aeruginosa colonises the upper airway of cystic fibrosis (CF) patients, providing a reservoir of host-adapted genotypes that subsequently establish chronic lung infection. We previously experimentally-evolved P. aeruginosa in a murine model of respiratory tract infection and observed early-acquired mutations in pmrB, encoding the sensor kinase of a two-component system that promoted establishment and persistence of infection. Here, using proteomics, we show downregulation of proteins involved in LPS biosynthesis, antimicrobial resistance and phenazine production in pmrB mutants, and upregulation of proteins involved in adherence, lysozyme resistance and inhibition of the chloride ion channel CFTR, relative to wild-type strain LESB65. Accordingly, pmrB mutants are susceptible to antibiotic treatment but show enhanced adherence to airway epithelial cells, resistance to lysozyme treatment, and downregulate host CFTR expression. We propose that P. aeruginosa pmrB mutations in CF patients are subject to an evolutionary trade-off, leading to enhanced colonisation potential, CFTR inhibition, and resistance to host defences, but also to increased susceptibility to antibiotics.</p

    Population genomics of domestic and wild yeasts

    Get PDF
    The natural genetics of an organism is determined by the distribution of sequences of its genome. Here we present one- to four-fold, with some deeper, coverage of the genome sequences of over seventy isolates of the domesticated baker&#x27;s yeast, _Saccharomyces cerevisiae_, and its closest relative, the wild _S. paradoxus_, which has never been associated with human activity. These were collected from numerous geographic locations and sources (including wild, clinical, baking, wine, laboratory and food spoilage). These sequences provide an unprecedented view of the population structure, natural (and artificial) selection and genome evolution in these species. Variation in gene content, SNPs, indels, copy numbers and transposable elements provide insights into the evolution of different lineages. Phenotypic variation broadly correlates with global genome-wide phylogenetic relationships however there is no correlation with source. _S. paradoxus_ populations are well delineated along geographic boundaries while the variation among worldwide _S. cerevisiae_ isolates show less differentiation and is comparable to a single _S. paradoxus_ population. Rather than one or two domestication events leading to the extant baker&#x27;s yeasts, the population structure of _S. cerevisiae_ shows a few well defined geographically isolated lineages and many different mosaics of these lineages, supporting the notion that human influence provided the opportunity for outbreeding and production of new combinations of pre-existing variation

    Comparison of dental topography of marmosets and tamarins (Callitrichidae) to other platyrrhine primates using a novel freeware pipeline

    Get PDF
    Dental topographic metrics (DTMs), which quantify different aspects of the shape of teeth, are powerful tools for studying dietary adaptation and evolution in mammals. Current DTM protocols usually rely on proprietary software, which may be unavailable to researchers for reasons of cost. We address this issue in the context of a DTM analysis of the primate clade Platyrrhini (“New World monkeys”) by: 1) presenting a large comparative sample of scanned second lower molars (m2s) of callitrichids (marmosets and tamarins), previously underrepresented in publicly available datasets; and 2) giving full details of an entirely freeware pipeline for DTM analysis and its validation. We also present an updated dietary classification scheme for extant platyrrhines, based on cluster analysis of dietary data extracted from 98 primary studies. Our freeware pipeline performs equally well in dietary classification accuracy of an existing sample of platyrrhine m2s (excluding callitrichids) as a published protocol that uses proprietary software when multiple DTMs are combined. Individual DTMs, however, sometimes showed very different results in classification accuracies between protocols, most likely due to differences in smoothing functions. The addition of callitrichids resulted in high classification accuracy in predicting diet with combined DTMs, although accuracy was considerably higher when molar size was included (90%) than excluded (73%). We conclude that our new freeware DTM pipeline is capable of accurately predicting diet in platyrrhines based on tooth shape and size, and so is suitable for inferring probable diet of taxa for which direct dietary information is unavailable, such as fossil species

    Nitric oxide (NO) elicits aminoglycoside tolerance in Escherichia coli but antibiotic resistance gene carriage and NO sensitivity have not co-evolved

    Get PDF
    The spread of multidrug-resistance in Gram-negative bacterial pathogens presents a major clinical challenge, and new approaches are required to combat these organisms. Nitric oxide (NO) is a well-known antimicrobial that is produced by the immune system in response to infection, and numerous studies have demonstrated that NO is a respiratory inhibitor with both bacteriostatic and bactericidal properties. However, given that loss of aerobic respiratory complexes is known to diminish antibiotic efficacy, it was hypothesised that the potent respiratory inhibitor NO would elicit similar effects. Indeed, the current work demonstrates that pre-exposure to NO-releasers elicits a > tenfold increase in IC50 for gentamicin against pathogenic E. coli (i.e. a huge decrease in lethality). It was therefore hypothesised that hyper-sensitivity to NO may have arisen in bacterial pathogens and that this trait could promote the acquisition of antibiotic-resistance mechanisms through enabling cells to persist in the presence of toxic levels of antibiotic. To test this hypothesis, genomics and microbiological approaches were used to screen a collection of E. coli clinical isolates for antibiotic susceptibility and NO tolerance, although the data did not support a correlation between increased carriage of antibiotic resistance genes and NO tolerance. However, the current work has important implications for how antibiotic susceptibility might be measured in future (i.e. ± NO) and underlines the evolutionary advantage for bacterial pathogens to maintain tolerance to toxic levels of NO

    Telomeric expression sites are highly conserved in trypanosoma brucei

    Get PDF
    Subtelomeric regions are often under-represented in genome sequences of eukaryotes. One of the best known examples of the use of telomere proximity for adaptive purposes are the bloodstream expression sites (BESs) of the African trypanosome Trypanosoma brucei. To enhance our understanding of BES structure and function in host adaptation and immune evasion, the BES repertoire from the Lister 427 strain of T. brucei were independently tagged and sequenced. BESs are polymorphic in size and structure but reveal a surprisingly conserved architecture in the context of extensive recombination. Very small BESs do exist and many functioning BESs do not contain the full complement of expression site associated genes (ESAGs). The consequences of duplicated or missing ESAGs, including ESAG9, a newly named ESAG12, and additional variant surface glycoprotein genes (VSGs) were evaluated by functional assays after BESs were tagged with a drug-resistance gene. Phylogenetic analysis of constituent ESAG families suggests that BESs are sequence mosaics and that extensive recombination has shaped the evolution of the BES repertoire. This work opens important perspectives in understanding the molecular mechanisms of antigenic variation, a widely used strategy for immune evasion in pathogens, and telomere biology

    Identification of a Mutation Associated with Fatal Foal Immunodeficiency Syndrome in the Fell and Dales Pony

    Get PDF
    The Fell and Dales are rare native UK pony breeds at risk due to falling numbers, in-breeding, and inherited disease. Specifically, the lethal Mendelian recessive disease Foal Immunodeficiency Syndrome (FIS), which manifests as B-lymphocyte immunodeficiency and progressive anemia, is a substantial threat. A significant percentage (∼10%) of the Fell ponies born each year dies from FIS, compromising the long-term survival of this breed. Moreover, the likely spread of FIS into other breeds is of major concern. Indeed, FIS was identified in the Dales pony, a related breed, during the course of this work. Using a stepwise approach comprising linkage and homozygosity mapping followed by haplotype analysis, we mapped the mutation using 14 FIS–affected, 17 obligate carriers, and 10 adults of unknown carrier status to a ∼1 Mb region (29.8 – 30.8 Mb) on chromosome (ECA) 26. A subsequent genome-wide association study identified two SNPs on ECA26 that showed genome-wide significance after Bonferroni correction for multiple testing: BIEC2-692674 at 29.804 Mb and BIEC2-693138 at 32.19 Mb. The associated region spanned 2.6 Mb from ∼29.6 Mb to 32.2 Mb on ECA26. Re-sequencing of this region identified a mutation in the sodium/myo-inositol cotransporter gene (SLC5A3); this causes a P446L substitution in the protein. This gene plays a crucial role in the regulatory response to osmotic stress that is essential in many tissues including lymphoid tissues and during early embryonic development. We propose that the amino acid substitution we identify here alters the function of SLC5A3, leading to erythropoiesis failure and compromise of the immune system. FIS is of significant biological interest as it is unique and is caused by a gene not previously associated with a mammalian disease. Having identified the associated gene, we are now able to eradicate FIS from equine populations by informed selective breeding
    corecore