48 research outputs found

    From learning taxonomies to phylogenetic learning: Integration of 16S rRNA gene data into FAME-based bacterial classification

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Machine learning techniques have shown to improve bacterial species classification based on fatty acid methyl ester (FAME) data. Nonetheless, FAME analysis has a limited resolution for discrimination of bacteria at the species level. In this paper, we approach the species classification problem from a taxonomic point of view. Such a taxonomy or tree is typically obtained by applying clustering algorithms on FAME data or on 16S rRNA gene data. The knowledge gained from the tree can then be used to evaluate FAME-based classifiers, resulting in a novel framework for bacterial species classification.</p> <p>Results</p> <p>In view of learning in a taxonomic framework, we consider two types of trees. First, a FAME tree is constructed with a supervised divisive clustering algorithm. Subsequently, based on 16S rRNA gene sequence analysis, phylogenetic trees are inferred by the NJ and UPGMA methods. In this second approach, the species classification problem is based on the combination of two different types of data. Herein, 16S rRNA gene sequence data is used for phylogenetic tree inference and the corresponding binary tree splits are learned based on FAME data. We call this learning approach 'phylogenetic learning'. Supervised Random Forest models are developed to train the classification tasks in a stratified cross-validation setting. In this way, better classification results are obtained for species that are typically hard to distinguish by a single or flat multi-class classification model.</p> <p>Conclusions</p> <p>FAME-based bacterial species classification is successfully evaluated in a taxonomic framework. Although the proposed approach does not improve the overall accuracy compared to flat multi-class classification, it has some distinct advantages. First, it has better capabilities for distinguishing species on which flat multi-class classification fails. Secondly, the hierarchical classification structure allows to easily evaluate and visualize the resolution of FAME data for the discrimination of bacterial species. Summarized, by phylogenetic learning we are able to situate and evaluate FAME-based bacterial species classification in a more informative context.</p

    Regional research priorities in brain and nervous system disorders

    Get PDF
    The characteristics of neurological, psychiatric, developmental and substance-use disorders in low-and middle-income countries are unique and the burden that they have will be different from country to country. Many of the differences are explained by the wide variation in population demographics and size, poverty, conflict, culture, land area and quality, and genetics. Neurological, psychiatric, developmental and substance-use disorders that result from, or are worsened by, a lack of adequate nutrition and infectious disease still afflict much of sub-Saharan Africa, although disorders related to increasing longevity, such as stroke, are on the rise. In the Middle East and North Africa, major depressive disorders and post-traumatic stress disorder are a primary concern because of the conflict-ridden environment. Consanguinity is a serious concern that leads to the high prevalence of recessive disorders in the Middle East and North Africa and possibly other regions. The burden of these disorders in Latin American and Asian countries largely surrounds stroke and vascular disease, dementia and lifestyle factors that are influenced by genetics. Although much knowledge has been gained over the past 10 years, the epidemiology of the conditions in low-and middle-income countries still needs more research. Prevention and treatments could be better informed with more longitudinal studies of risk factors. Challenges and opportunities for ameliorating nervous-system disorders can benefit from both local and regional research collaborations. The lack of resources and infrastructure for health-care and related research, both in terms of personnel and equipment, along with the stigma associated with the physical or behavioural manifestations of some disorders have hampered progress in understanding the disease burden and improving brain health. Individual countries, and regions within countries, have specific needs in terms of research priorities.Fil: Ravindranath, Vijayalakshmi. Indian Institute of Science; IndiaFil: Dang, Hoang Minh. Vietnam National University; VietnamFil: Goya, Rodolfo Gustavo. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata. Instituto de Investigaciones Bioquímicas de La Plata ; ArgentinaFil: Mansour, Hader. University of Pittsburgh; Estados Unidos. Mansoura University; EgiptoFil: Nimgaonkar, Vishwajit L.. University of Pittsburgh; Estados UnidosFil: Russell, Vivienne Ann. University of Cape Town; SudáfricaFil: Xin, Yu. Peking University; Chin

    RecG directs DNA synthesis during double-strand break repair

    Get PDF
    Homologous recombination provides a mechanism of DNA double-strand break repair (DSBR) that requires an intact, homologous template for DNA synthesis. When DNA synthesis associated with DSBR is convergent, the broken DNA strands are replaced and repair is accurate. However, if divergent DNA synthesis is established, over-replication of flanking DNA may occur with deleterious consequences. The RecG protein of Escherichia coli is a helicase and translocase that can re-model 3-way and 4-way DNA structures such as replication forks and Holliday junctions. However, the primary role of RecG in live cells has remained elusive. Here we show that, in the absence of RecG, attempted DSBR is accompanied by divergent DNA replication at the site of an induced chromosomal DNA double-strand break. Furthermore, DNA double-stand ends are generated in a recG mutant at sites known to block replication forks. These double-strand ends, also trigger DSBR and the divergent DNA replication characteristic of this mutant, which can explain over-replication of the terminus region of the chromosome. The loss of DNA associated with unwinding joint molecules previously observed in the absence of RuvAB and RecG, is suppressed by a helicase deficient PriA mutation (priA300), arguing that the action of RecG ensures that PriA is bound correctly on D-loops to direct DNA replication rather than to unwind joint molecules. This has led us to put forward a revised model of homologous recombination in which the re-modelling of branched intermediates by RecG plays a fundamental role in directing DNA synthesis and thus maintaining genomic stability
    corecore