8 research outputs found

    reComBat: batch-effect removal in large-scale multi-source gene-expression data integration

    Get PDF
    With the steadily increasing abundance of omics data produced all over the world under vastly different experimental conditions residing in public databases, a crucial step in many data-driven bioinformatics applications is that of data integration. The challenge of batch-effect removal for entire databases lies in the large number of batches and biological variation, which can result in design matrix singularity. This problem can currently not be solved satisfactorily by any common batch-correction algorithm.; We present; reComBat; , a regularized version of the empirical Bayes method to overcome this limitation and benchmark it against popular approaches for the harmonization of public gene-expression data (both microarray and bulkRNAsq) of the human opportunistic pathogen; Pseudomonas aeruginosa; . Batch-effects are successfully mitigated while biologically meaningful gene-expression variation is retained.; reComBat; fills the gap in batch-correction approaches applicable to large-scale, public omics databases and opens up new avenues for data-driven analysis of complex biological processes beyond the scope of a single study.; The code is available at https://github.com/BorgwardtLab/reComBat, all data and evaluation code can be found at https://github.com/BorgwardtLab/batchCorrectionPublicData.; Supplementary data are available at; Bioinformatics Advances; online

    reComBat: Batch effect removal in large-scale, multi-source omics data integration

    Get PDF
    With the steadily increasing abundance of omics data produced all over the world, some-times decades apart and under vastly different experimental conditions residing in public databases, a crucial step in many data-driven bioinformatics applications is that of data integration. The challenge of batch effect removal for entire databases lies in the large number and coincide of both batches and desired, biological variation resulting in design matrix singularity. This problem currently cannot be solved by any common batch correction algorithm. In this study, we present reComBat , a regularised version of the empirical Bayes method to overcome this limitation. We demonstrate our approach for the harmonisation of public gene expression data of the human opportunistic pathogen Pseudomonas aeruginosa and study a several metrics to empirically demonstrate that batch effects are successfully mitigated while biologically meaningful gene expression variation is retained. reComBat fills the gap in batch correction approaches applicable to large scale, public omics databases and opens up new avenues for data driven analysis of complex biological processes beyond the scope of a single study

    Kinetics plotting code

    No full text
    <p>This code was used to plot kinetics of time-resolved killcurves. </p><p> </p&gt

    ggplot scripts to plot phage infection data

    No full text
    <p>This code was used to plot kinetics of time-resolved killcurves. </p><p> </p&gt

    reComBat: Batch effect removal in large-scale, multi-source omics data integration

    No full text
    With the steadily increasing abundance of omics data produced all over the world, some-times decades apart and under vastly different experimental conditions residing in public databases, a crucial step in many data-driven bioinformatics applications is that of data integration. The challenge of batch effect removal for entire databases lies in the large number and coincide of both batches and desired, biological variation resulting in design matrix singularity. This problem currently cannot be solved by any common batch correction algorithm. In this study, we present reComBat, a regularised version of the empirical Bayes method to overcome this limitation. We demonstrate our approach for the harmonisation of public gene expression data of the human opportunistic pathogen Pseudomonas aeruginosa and study a several metrics to empirically demonstrate that batch effects are successfully mitigated while biologically meaningful gene expression variation is retained. reComBat fills the gap in batch correction approaches applicable to large scale, public omics databases and opens up new avenues for data driven analysis of complex biological processes beyond the scope of a single study

    Systematic exploration of Escherichia coli phage-host interactions with the BASEL phage collection

    Get PDF
    Bacteriophages, the viruses infecting bacteria, hold great potential for the treatment of multidrug-resistant bacterial infections and other applications due to their unparalleled diversity and recent breakthroughs in their genetic engineering. However, fundamental knowledge of molecular mechanisms underlying phage-host interactions is mostly confined to a few traditional model systems and did not keep pace with the recent massive expansion of the field. The true potential of molecular biology encoded by these viruses has therefore remained largely untapped, and phages for therapy or other applications are often still selected empirically. We therefore sought to promote a systematic exploration of phage-host interactions by composing a well-assorted library of 66 newly isolated phages infecting the model organism Escherichia coli that we share with the community as the BASEL collection (BActeriophage SElection for your Laboratory). This collection is largely representative of natural E. coli phage diversity and was intensively characterized phenotypically and genomically alongside ten well-studied traditional model phages. We experimentally determined essential host receptors of all phages, quantified their sensitivity to eleven defense systems across different layers of bacterial immunity, and matched these results to the phages' host range across a panel of pathogenic enterobacterial strains. Our results reveal clear patterns in the distribution of phage phenotypes and genomic features that highlight systematic differences in the potency of different immunity systems and point towards the molecular basis of receptor specificity in several phage groups. Strong trade-offs were detected between fitness traits like broad host recognition and resistance to bacterial immunity that might drive the divergent adaptation of different phage groups to specific niches. We envision that the BASEL collection will inspire future work exploring the biology of bacteriophages and their hosts by facilitating the discovery of underlying molecular mechanisms as the basis for an effective translation into biotechnology or therapeutic applications

    Systematic exploration of Escherichia coli phage-host interactions with the BASEL phage collection

    Get PDF
    Bacteriophages, the viruses infecting bacteria, hold great potential for the treatment of multidrug-resistant bacterial infections and other applications due to their unparalleled diversity and recent breakthroughs in their genetic engineering. However, fundamental knowledge of the molecular mechanisms underlying phage-host interactions is mostly confined to a few traditional model systems and did not keep pace with the recent massive expansion of the field. The true potential of molecular biology encoded by these viruses has therefore remained largely untapped, and phages for therapy or other applications are often still selected empirically. We therefore sought to promote a systematic exploration of phage-host interactions by composing a well-assorted library of 68 newly isolated phages infecting the model organism Escherichia coli that we share with the community as the BASEL (BActeriophage SElection for your Laboratory) collection. This collection is largely representative of natural E. coli phage diversity and was intensively characterized phenotypically and genomically alongside 10 well-studied traditional model phages. We experimentally determined essential host receptors of all phages, quantified their sensitivity to 11 defense systems across different layers of bacterial immunity, and matched these results to the phages' host range across a panel of pathogenic enterobacterial strains. Clear patterns in the distribution of phage phenotypes and genomic features highlighted systematic differences in the potency of different immunity systems and suggested the molecular basis of receptor specificity in several phage groups. Our results also indicate strong trade-offs between fitness traits like broad host recognition and resistance to bacterial immunity that might drive the divergent adaptation of different phage groups to specific ecological niches. We envision that the BASEL collection will inspire future work exploring the biology of bacteriophages and their hosts by facilitating the discovery of underlying molecular mechanisms as the basis for an effective translation into biotechnology or therapeutic applications

    Systematic exploration of Escherichia coli phage-host interactions with the BASEL phage collection

    Full text link
    Bacteriophages, the viruses infecting bacteria, hold great potential for the treatment of multidrug-resistant bacterial infections and other applications due to their unparalleled diversity and recent breakthroughs in their genetic engineering. However, fundamental knowledge of the molecular mechanisms underlying phage-host interactions is mostly confined to a few traditional model systems and did not keep pace with the recent massive expansion of the field. The true potential of molecular biology encoded by these viruses has therefore remained largely untapped, and phages for therapy or other applications are often still selected empirically. We therefore sought to promote a systematic exploration of phage-host interactions by composing a well-assorted library of 68 newly isolated phages infecting the model organism Escherichia coli that we share with the community as the BASEL (BActeriophage SElection for your Laboratory) collection. This collection is largely representative of natural E. coli phage diversity and was intensively characterized phenotypically and genomically alongside 10 well-studied traditional model phages. We experimentally determined essential host receptors of all phages, quantified their sensitivity to 11 defense systems across different layers of bacterial immunity, and matched these results to the phages' host range across a panel of pathogenic enterobacterial strains. Clear patterns in the distribution of phage phenotypes and genomic features highlighted systematic differences in the potency of different immunity systems and suggested the molecular basis of receptor specificity in several phage groups. Our results also indicate strong trade-offs between fitness traits like broad host recognition and resistance to bacterial immunity that might drive the divergent adaptation of different phage groups to specific ecological niches. We envision that the BASEL collection will inspire future work exploring the biology of bacteriophages and their hosts by facilitating the discovery of underlying molecular mechanisms as the basis for an effective translation into biotechnology or therapeutic applications
    corecore