7 research outputs found

    Benchmarking network propagation methods for disease gene identification

    Get PDF
    In-silico identification of potential target genes for disease is an essential aspect of drug target discovery. Recent studies suggest that successful targets can be found through by leveraging genetic, genomic and protein interaction information. Here, we systematically tested the ability of 12 varied algorithms, based on network propagation, to identify genes that have been targeted by any drug, on gene-disease data from 22 common non-cancerous diseases in OpenTargets. We considered two biological networks, six performance metrics and compared two types of input gene-disease association scores. The impact of the design factors in performance was quantified through additive explanatory models. Standard cross-validation led to over-optimistic performance estimates due to the presence of protein complexes. In order to obtain realistic estimates, we introduced two novel protein complex-aware cross-validation schemes. When seeding biological networks with known drug targets, machine learning and diffusion-based methods found around 2-4 true targets within the top 20 suggestions. Seeding the networks with genes associated to disease by genetics decreased performance below 1 true hit on average. The use of a larger network, although noisier, improved overall performance. We conclude that diffusion-based prioritisers and machine learning applied to diffusion-based features are suited for drug discovery in practice and improve over simpler neighbour-voting methods. We also demonstrate the large impact of choosing an adequate validation strategy and the definition of seed disease genesPeer ReviewedPostprint (published version

    Plasma protein biomarkers for depression and schizophrenia by multi analyte profiling of case-control collections.

    Get PDF
    Despite significant research efforts aimed at understanding the neurobiological underpinnings of psychiatric disorders, the diagnosis and the evaluation of treatment of these disorders are still based solely on relatively subjective assessment of symptoms. Therefore, biological markers which could improve the current classification of psychiatry disorders, and in perspective stratify patients on a biological basis into more homogeneous clinically distinct subgroups, are highly needed. In order to identify novel candidate biological markers for major depression and schizophrenia, we have applied a focused proteomic approach using plasma samples from a large case-control collection. Patients were diagnosed according to DSM criteria using structured interviews and a number of additional clinical variables and demographic information were assessed. Plasma samples from 245 depressed patients, 229 schizophrenic patients and 254 controls were submitted to multi analyte profiling allowing the evaluation of up to 79 proteins, including a series of cytokines, chemokines and neurotrophins previously suggested to be involved in the pathophysiology of depression and schizophrenia. Univariate data analysis showed more significant p-values than would be expected by chance and highlighted several proteins belonging to pathways or mechanisms previously suspected to be involved in the pathophysiology of major depression or schizophrenia, such as insulin and MMP-9 for depression, and BDNF, EGF and a number of chemokines for schizophrenia. Multivariate analysis was carried out to improve the differentiation of cases from controls and identify the most informative panel of markers. The results illustrate the potential of plasma biomarker profiling for psychiatric disorders, when conducted in large collections. The study highlighted a set of analytes as candidate biomarker signatures for depression and schizophrenia, warranting further investigation in independent collections

    Benchmarking network propagation methods for disease gene identification

    No full text
    In-silico identification of potential target genes for disease is an essential aspect of drug target discovery. Recent studies suggest that successful targets can be found through by leveraging genetic, genomic and protein interaction information. Here, we systematically tested the ability of 12 varied algorithms, based on network propagation, to identify genes that have been targeted by any drug, on gene-disease data from 22 common non-cancerous diseases in OpenTargets. We considered two biological networks, six performance metrics and compared two types of input gene-disease association scores. The impact of the design factors in performance was quantified through additive explanatory models. Standard cross-validation led to over-optimistic performance estimates due to the presence of protein complexes. In order to obtain realistic estimates, we introduced two novel protein complex-aware cross-validation schemes. When seeding biological networks with known drug targets, machine learning and diffusion-based methods found around 2-4 true targets within the top 20 suggestions. Seeding the networks with genes associated to disease by genetics decreased performance below 1 true hit on average. The use of a larger network, although noisier, improved overall performance. We conclude that diffusion-based prioritisers and machine learning applied to diffusion-based features are suited for drug discovery in practice and improve over simpler neighbour-voting methods. We also demonstrate the large impact of choosing an adequate validation strategy and the definition of seed disease genesPeer Reviewe

    Benchmarking network propagation methods for disease gene identification

    No full text
    In-silico identification of potential target genes for disease is an essential aspect of drug target discovery. Recent studies suggest that successful targets can be found through by leveraging genetic, genomic and protein interaction information. Here, we systematically tested the ability of 12 varied algorithms, based on network propagation, to identify genes that have been targeted by any drug, on gene-disease data from 22 common non-cancerous diseases in OpenTargets. We considered two biological networks, six performance metrics and compared two types of input gene-disease association scores. The impact of the design factors in performance was quantified through additive explanatory models. Standard cross-validation led to over-optimistic performance estimates due to the presence of protein complexes. In order to obtain realistic estimates, we introduced two novel protein complex-aware cross-validation schemes. When seeding biological networks with known drug targets, machine learning and diffusion-based methods found around 2-4 true targets within the top 20 suggestions. Seeding the networks with genes associated to disease by genetics decreased performance below 1 true hit on average. The use of a larger network, although noisier, improved overall performance. We conclude that diffusion-based prioritisers and machine learning applied to diffusion-based features are suited for drug discovery in practice and improve over simpler neighbour-voting methods. We also demonstrate the large impact of choosing an adequate validation strategy and the definition of seed disease genesPeer Reviewe

    The repurposing of Tebipenem pivoxil as alternative therapy for severe gastrointestinal infections caused by extensively drug-resistant Shigella spp.

    No full text
    BACKGROUND: Diarrhoea remains one of the leading causes of childhood mortality globally. Recent epidemiological studies conducted in low-middle income countries (LMICs) identified Shigella spp. as the first and second most predominant agent of dysentery and moderate diarrhoea, respectively. Antimicrobial therapy is often necessary for Shigella infections; however, we are reaching a crisis point with efficacious antimicrobials. The rapid emergence of resistance against existing antimicrobials in Shigella spp. poses a serious global health problem. METHODS: Aiming to identify alternative antimicrobial chemicals with activity against antimicrobial resistant Shigella, we initiated a collaborative academia-industry drug discovery project, applying high-throughput phenotypic screening across broad chemical diversity and followed a lead compound through in vitro and in vivo characterisation. RESULTS: We identified several known antimicrobial compound classes with antibacterial activity against Shigella. These compounds included the oral carbapenem Tebipenem, which was found to be highly potent against broadly susceptible Shigella and contemporary MDR variants for which we perform detailed pre-clinical testing. Additional in vitro screening demonstrated that Tebipenem had activity against a wide range of other non-Shigella enteric bacteria. Cognisant of the risk for the development of resistance against monotherapy, we identified synergistic behaviour of two different drug combinations incorporating Tebipenem. We found the orally bioavailable prodrug (Tebipenem pivoxil) had ideal pharmacokinetic properties for treating enteric pathogens and was effective in clearing the gut of infecting organisms when administered to Shigella-infected mice and gnotobiotic piglets. CONCLUSIONS: Our data highlight the emerging antimicrobial resistance crisis and shows that Tebipenem pivoxil (licenced for paediatric respiratory tract infections in Japan) should be accelerated into human trials and could be repurposed as an effective treatment for severe diarrhoea caused by MDR Shigella and other enteric pathogens in LMICs. FUNDING: Tres Cantos Open Lab Foundation (projects TC239 and TC246), the Bill and Melinda Gates Foundation (grant OPP1172483) and Wellcome (215515/Z/19/Z)
    corecore