1,571 research outputs found

    A comparison of machine learning and Bayesian modelling for molecular serotyping.

    Get PDF
    BACKGROUND: Streptococcus pneumoniae is a human pathogen that is a major cause of infant mortality. Identifying the pneumococcal serotype is an important step in monitoring the impact of vaccines used to protect against disease. Genomic microarrays provide an effective method for molecular serotyping. Previously we developed an empirical Bayesian model for the classification of serotypes from a molecular serotyping array. With only few samples available, a model driven approach was the only option. In the meanwhile, several thousand samples have been made available to us, providing an opportunity to investigate serotype classification by machine learning methods, which could complement the Bayesian model. RESULTS: We compare the performance of the original Bayesian model with two machine learning algorithms: Gradient Boosting Machines and Random Forests. We present our results as an example of a generic strategy whereby a preliminary probabilistic model is complemented or replaced by a machine learning classifier once enough data are available. Despite the availability of thousands of serotyping arrays, a problem encountered when applying machine learning methods is the lack of training data containing mixtures of serotypes; due to the large number of possible combinations. Most of the available training data comprises samples with only a single serotype. To overcome the lack of training data we implemented an iterative analysis, creating artificial training data of serotype mixtures by combining raw data from single serotype arrays. CONCLUSIONS: With the enhanced training set the machine learning algorithms out perform the original Bayesian model. However, for serotypes currently lacking sufficient training data the best performing implementation was a combination of the results of the Bayesian Model and the Gradient Boosting Machine. As well as being an effective method for classifying biological data, machine learning can also be used as an efficient method for revealing subtle biological insights, which we illustrate with an example

    Investigating inter-chromosomal regulatory relationships through a comprehensive meta-analysis of matched copy number and transcriptomics data sets.

    Get PDF
    BACKGROUND: Gene regulatory relationships can be inferred using matched array comparative genomics and transcriptomics data sets from cancer samples. The way in which copy numbers of genes in cancer samples are often greatly disrupted works like a natural gene amplification/deletion experiment. There are now a large number of such data sets publicly available making a meta-analysis of the data possible. RESULTS: We infer inter-chromosomal acting gene regulatory relationships from a meta-analysis of 31 publicly available matched array comparative genomics and transcriptomics data sets in humans. We obtained statistically significant predictions of target genes for 1430 potential regulatory genes. The regulatory relationships being inferred are either direct relationships, of a transcription factor on its target, or indirect ones, through pathways containing intermediate steps. We analyse the predictions in terms of cocitations, both publications which cite a regulator with any of its inferred targets and cocitations of any genes in a target list. CONCLUSIONS: The most striking observation from the results is the greater number of inter-chromosomal regulatory relationships involving repression compared to those involving activation. The complete results of the meta-analysis are presented in the database METAMATCHED. We anticipate that the predictions contained in the database will be useful in informing experiments and in helping to construct networks of regulatory relationships

    Bye And Bye

    Get PDF
    Contains advertisements and/or short musical examples of pieces being sold by publisher.https://digitalcommons.library.umaine.edu/mmb-vp/6946/thumbnail.jp

    Where\u27s That Rainbow?

    Get PDF
    With Ukulele arrangement. Contains advertisements and/or short musical examples of pieces being sold by publisher.https://digitalcommons.library.umaine.edu/mmb-vp/7138/thumbnail.jp

    Mountain Greenery

    Get PDF
    With Ukulele arrangement. Contains advertisements and/or short musical examples of pieces being sold by publisher.https://digitalcommons.library.umaine.edu/mmb-vp/7171/thumbnail.jp

    Autonomous non-equilibrium mechanisms for molecular evolution

    Get PDF

    Maybe It\u27s Me

    Get PDF
    With Ukulele arrangement. Contains advertisements and/or short musical examples of pieces being sold by publisher.https://digitalcommons.library.umaine.edu/mmb-vp/7007/thumbnail.jp

    The Circus On Parade

    Get PDF
    With Ukulele arrangement. Contains advertisements and/or short musical examples of pieces being sold by publisher.https://digitalcommons.library.umaine.edu/mmb-vp/6947/thumbnail.jp

    Extrinsic symplectic symmetric spaces

    Full text link
    We define the notion of extrinsic symplectic symmetric spaces and exhibit some of their properties. We construct large families of examples and show how they fit in the perspective of a complete classification of these manifolds. We also build a natural star-quantization on a class of examples
    • …
    corecore