1,414 research outputs found

    Forty years of The Selfish Gene are not enough

    Get PDF

    Genome re-annotation: a wiki solution?

    Get PDF
    The annotation of most genomes becomes outdated over time, owing in part to our ever-improving knowledge of genomes and in part to improvements in bioinformatics software. Unfortunately, annotation is rarely if ever updated and resources to support routine reannotation are scarce. Wiki software, which would allow many scientists to edit each genome's annotation, offers one possible solution

    Enabling comparative modeling of closely related genomes: Example genus Brucella

    Get PDF
    For many scientific applications, it is highly desirable to be able to compare metabolic models of closely related genomes. In this short report, we attempt to raise awareness to the fact that taking annotated genomes from public repositories and using them for metabolic model reconstructions is far from being trivial due to annotation inconsistencies. We are proposing a protocol for comparative analysis of metabolic models on closely related genomes, using fifteen strains of genus Brucella, which contains pathogens of both humans and livestock. This study lead to the identification and subsequent correction of inconsistent annotations in the SEED database, as well as the identification of 31 biochemical reactions that are common to Brucella, which are not originally identified by automated metabolic reconstructions. We are currently implementing this protocol for improving automated annotations within the SEED database and these improvements have been propagated into PATRIC, Model-SEED, KBase and RAST. This method is an enabling step for the future creation of consistent annotation systems and high-quality model reconstructions that will support in predicting accurate phenotypes such as pathogenicity, media requirements or type of respiration.We thank Jean Jacques Letesson, Maite Iriarte, Stephan Kohler and David O'Callaghan for their input on improving specific annotations. This project has been funded by the United States National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Services, under Contract No. HHSN272200900040C, awarded to BW Sobral, and from the United States National Science Foundation under Grant MCB-1153357, awarded to CS Henry. J.P.F. acknowledges funding from [FRH/BD/70824/2010] of the FCT (Portuguese Foundation for Science and Technology) Ph.D. scholarship

    Identification of moaA3 gene in patient isolates of Mycobacterium tuberculosis in Kerala, which is absent in M. tuberculosis H37Rv and H37Ra

    Get PDF
    BACKGROUND: Tuberculosis is endemic to developing countries like India. Though the whole genome sequences of the type strain M. tuberculosis H37Rv and the clinical strain M. tuberculosis CDC1551 are available, the clinical isolates from India have not been studied extensively at the genome level. This study was carried out in order to have a better understanding of isolates from Kerala, a state in southern India. RESULTS: A PCR based strategy was followed making use of the deletion region primers to understand the genome level differences between the type strain H37Rv and the clinical isolates of M. tuberculosis from Kerala. PCR analysis of patient isolates using RD1 region primers revealed the amplification of a 386 bp region, in addition to the expected 652 bp amplicon. Southern hybridization of genomic DNA with the 386 bp amplicon confirmed the presence of this new region in a majority of the patient isolates from Kerala. Sequence comparison of this amplicon showed close homology with the moaA3 gene of M. bovis. In M. bovis this gene is present in the RvD5 region, an IS6110 mediated deletion that is absent in M. tuberculosis H37Rv. CONCLUSION: This study demonstrates the presence of moaA3 gene, that is absent in M. tuberculosis H37Rv and H37Ra, in a large number of local isolates. Whether the moaA3 gene provides any specific advantage to the field isolates of the pathogen is unclear. Field strains from Kerala have fewer IS6110 sequences and therefore are likely to have fewer IS6110 dependent rearrangements. But as deletions and insertions account for much of the genomic diversity of M. tuberculosis, the mechanisms of formation of sequence polymorphisms in the local isolates should be further examined. These results suggest that studies should focus on strains from endemic areas to understand the complexities of this pathogen

    Text-mining of PubMed abstracts by natural language processing to create a public knowledge base on molecular mechanisms of bacterial enteropathogens

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The Enteropathogen Resource Integration Center (ERIC; <url>http://www.ericbrc.org</url>) has a goal of providing bioinformatics support for the scientific community researching enteropathogenic bacteria such as <it>Escherichia coli </it>and <it>Salmonella </it>spp. Rapid and accurate identification of experimental conclusions from the scientific literature is critical to support research in this field. Natural Language Processing (NLP), and in particular Information Extraction (IE) technology, can be a significant aid to this process.</p> <p>Description</p> <p>We have trained a powerful, state-of-the-art IE technology on a corpus of abstracts from the microbial literature in PubMed to automatically identify and categorize biologically relevant entities and predicative relations. These relations include: Genes/Gene Products and their Roles; Gene Mutations and the resulting Phenotypes; and Organisms and their associated Pathogenicity. Evaluations on blind datasets show an F-measure average of greater than 90% for entities (genes, operons, etc.) and over 70% for relations (gene/gene product to role, etc). This IE capability, combined with text indexing and relational database technologies, constitute the core of our recently deployed text mining application.</p> <p>Conclusion</p> <p>Our Text Mining application is available online on the ERIC website <url>http://www.ericbrc.org/portal/eric/articles</url>. The information retrieval interface displays a list of recently published enteropathogen literature abstracts, and also provides a search interface to execute custom queries by keyword, date range, etc. Upon selection, processed abstracts and the entities and relations extracted from them are retrieved from a relational database and marked up to highlight the entities and relations. The abstract also provides links from extracted genes and gene products to the ERIC Annotations database, thus providing access to comprehensive genomic annotations and adding value to both the text-mining and annotations systems.</p

    Impacts of climate change on plant diseases – opinions and trends

    Get PDF
    There has been a remarkable scientific output on the topic of how climate change is likely to affect plant diseases in the coming decades. This review addresses the need for review of this burgeoning literature by summarizing opinions of previous reviews and trends in recent studies on the impacts of climate change on plant health. Sudden Oak Death is used as an introductory case study: Californian forests could become even more susceptible to this emerging plant disease, if spring precipitations will be accompanied by warmer temperatures, although climate shifts may also affect the current synchronicity between host cambium activity and pathogen colonization rate. A summary of observed and predicted climate changes, as well as of direct effects of climate change on pathosystems, is provided. Prediction and management of climate change effects on plant health are complicated by indirect effects and the interactions with global change drivers. Uncertainty in models of plant disease development under climate change calls for a diversity of management strategies, from more participatory approaches to interdisciplinary science. Involvement of stakeholders and scientists from outside plant pathology shows the importance of trade-offs, for example in the land-sharing vs. sparing debate. Further research is needed on climate change and plant health in mountain, boreal, Mediterranean and tropical regions, with multiple climate change factors and scenarios (including our responses to it, e.g. the assisted migration of plants), in relation to endophytes, viruses and mycorrhiza, using long-term and large-scale datasets and considering various plant disease control methods

    Characterization of Clinically-Attenuated Burkholderia mallei by Whole Genome Sequencing: Candidate Strain for Exclusion from Select Agent Lists

    Get PDF
    is an understudied biothreat agent responsible for glanders which can be lethal in humans and animals. Research with this pathogen has been hampered in part by constraints of Select Agent regulations for safety reasons. Whole genomic sequencing (WGS) is an apt approach to characterize newly discovered or poorly understood microbial pathogens. genome. Therefore, the strain by itself is unlikely to revert naturally to its virulent phenotype. There were other genes present in one strain and not the other and vice-versa. was both avirulent in the natural host ponies, and did not possess T3SS associated genes may be fortuitous to advance biodefense research. The deleted virulence-essential T3SS is not likely to be re-acquired naturally. These findings may provide a basis for exclusion of SAVP1 from the Select Agent regulation or at least discussion of what else would be required for exclusion. This exclusion could accelerate research by investigators not possessing BSL-3 facilities and facilitate the production of reagents such as antibodies without the restraints of Select Agent regulation

    Measurement of the cross-section of high transverse momentum vector bosons reconstructed as single jets and studies of jet substructure in pp collisions at √s = 7 TeV with the ATLAS detector

    Get PDF
    This paper presents a measurement of the cross-section for high transverse momentum W and Z bosons produced in pp collisions and decaying to all-hadronic final states. The data used in the analysis were recorded by the ATLAS detector at the CERN Large Hadron Collider at a centre-of-mass energy of √s = 7 TeV;{\rm Te}{\rm V}andcorrespondtoanintegratedluminosityof and correspond to an integrated luminosity of 4.6\;{\rm f}{{{\rm b}}^{-1}}.ThemeasurementisperformedbyreconstructingtheboostedWorZbosonsinsinglejets.ThereconstructedjetmassisusedtoidentifytheWandZbosons,andajetsubstructuremethodbasedonenergyclusterinformationinthejetcentreofmassframeisusedtosuppressthelargemultijetbackground.ThecrosssectionforeventswithahadronicallydecayingWorZboson,withtransversemomentum. The measurement is performed by reconstructing the boosted W or Z bosons in single jets. The reconstructed jet mass is used to identify the W and Z bosons, and a jet substructure method based on energy cluster information in the jet centre-of-mass frame is used to suppress the large multi-jet background. The cross-section for events with a hadronically decaying W or Z boson, with transverse momentum {{p}_{{\rm T}}}\gt 320\;{\rm Ge}{\rm V}andpseudorapidity and pseudorapidity |\eta |\lt 1.9,ismeasuredtobe, is measured to be {{\sigma }_{W+Z}}=8.5\pm 1.7$ pb and is compared to next-to-leading-order calculations. The selected events are further used to study jet grooming techniques

    High Resolution Discrimination of Clinical Mycobacterium tuberculosis Complex Strains Based on Single Nucleotide Polymorphisms

    Get PDF
    Recently, the diversity of the Mycobacterium tuberculosis complex (MTBC) population structure has been described in detail. Based on geographical separation and specific host pathogen co-evolution shaping MTBC virulence traits, at least 20 major lineages/genotypes have evolved finally leading to a clear influence of strain genetic background on transmissibility, clinical presentation/outcome, and resistance development. Therefore, high resolution genotyping for characterization of strains in larger studies is mandatory for understanding mechanisms of host-pathogen-interaction and to improve tuberculosis (TB) control. Single nucleotide polymorphisms (SNPs) represent the most reliable markers for lineage classification of clinical isolates due to the low levels of homoplasy, however their use is hampered either by low discriminatory power or by the need to analyze a large number of genes to achieve higher resolution. Therefore, we carried out de novo sequencing of 26 genes (approx. 20000 bp per strain) in a reference collection of MTBC strains including all major genotypes to define a highly discriminatory gene set. Overall, 161 polymorphisms were detected of which 59 are genotype-specific, while 13 define deeper branches such as the Euro-American lineage. Unbiased investigation of the most variable set of 11 genes in a population based strain collection (one year, city of Hamburg, Germany) confirmed the validity of SNP analysis as all strains were classified with high accuracy. Taken together, we defined a diagnostic algorithm which allows the identification of 17 MTBC phylogenetic lineages with high confidence for the first time by sequencing analysis of just five genes. In conclusion, the diagnostic algorithm developed in our study is likely to open the door for a low cost high resolution sequence/SNP based differentiation of the MTBC with a very high specificity. High throughput assays can be established which will be needed for large association studies that are mandatory for detailed investigation of host-pathogen-interaction during TB infection
    corecore