11 research outputs found

    Network Medicine Framework for Identifying Drug Repurposing Opportunities for COVID-19

    Full text link
    The current pandemic has highlighted the need for methodologies that can quickly and reliably prioritize clinically approved compounds for their potential effectiveness for SARS-CoV-2 infections. In the past decade, network medicine has developed and validated multiple predictive algorithms for drug repurposing, exploiting the sub-cellular network-based relationship between a drug's targets and disease genes. Here, we deployed algorithms relying on artificial intelligence, network diffusion, and network proximity, tasking each of them to rank 6,340 drugs for their expected efficacy against SARS-CoV-2. To test the predictions, we used as ground truth 918 drugs that had been experimentally screened in VeroE6 cells, and the list of drugs under clinical trial, that capture the medical community's assessment of drugs with potential COVID-19 efficacy. We find that while most algorithms offer predictive power for these ground truth data, no single method offers consistently reliable outcomes across all datasets and metrics. This prompted us to develop a multimodal approach that fuses the predictions of all algorithms, showing that a consensus among the different predictive methods consistently exceeds the performance of the best individual pipelines. We find that 76 of the 77 drugs that successfully reduced viral infection do not bind the proteins targeted by SARS-CoV-2, indicating that these drugs rely on network-based actions that cannot be identified using docking-based strategies. These advances offer a methodological pathway to identify repurposable drugs for future pathogens and neglected diseases underserved by the costs and extended timeline of de novo drug development

    WH 1866/67

    No full text
    <p>(A) Illustration of how the algorithm can be modified to give the initial seed proteins a higher weight α = 2 by (virtually) doubling the seed proteins while keeping their interactions. Tuning α results in different sets of detected proteins. (B,C) Comparing the performance for varying values of α in synthetic shells (B) and connectivity significance (C) modules, respectively. The best results are obtained for α = 3. (C) The performance may also saturate for α larger than a certain value. For a given disease α can be tuned to optimize the results. Performance of DIAMOnD with respect to different values of α is shown for ulcerative colitis (D) and nutritional and metabolic diseases (E). These plots suggest that at α = 2 the number of true positives is maximal. (F) Overall, α ∼10 results in the best performance of DIAMOnD across 70 diseases. The individual values may vary considerably, however, suggesting an individual optimization for best results.</p

    Biological evaluation of DIAMOnD.

    No full text
    <p>(A) Validation of the DIAMOnD genes based on GeneOntology terms (see <a href="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1004120#sec012" target="_blank">Materials & Methods</a>). (B) The significance of the similarity between DIAMOnD genes and seed genes suggests a cutoff of ∼200 DIAMOnD genes. (C) Network representation of the <i>lysosomal storage diseases</i> module. (D,E) Summary of the validation for all 70 disease modules based on GeneOntology (D) and biological pathways (E). (F) Fraction of seed proteins that are contained in the LCC of the DIAMOnD module for varying iteration steps. The distributions show the values obtained from 70 diseases. By introducing DIAMOnD proteins, previously disconnected seed proteins become part of the LCC.</p

    Comparison between DIAMOnD and Random Walk (RW).

    No full text
    <p>(A,B) Average recovery rates of DIAMOnD and the reference RW algorithm when removing 50% (100 nodes) of 100 generated <i>shells</i> (A) and <i>connectivity</i> (B) modules. (C) Comparison of the biological evidence for proteins identified by DIAMOnD and RW for <i>lysosomal storage diseases</i>. (D) Overlap between identified proteins and immediate neighbors of seed proteins. In contrast to RW, DIAMOnD includes a considerable number of proteins without first-order interactions to seed genes. (E) Comparison of the performance of DIAMOnD and RW across 70 diseases with respect to non-specific disease data. (F) Degree distributions of the identified proteins. DIAMonD proteins are characterized by the absence of hubs.</p

    Topological properties of disease proteins within the Interactome.

    No full text
    <p>(A) Proteins associated with the same phenotype tend to localize in specific neighborhoods of the Interactome, indicating the approximate location of the corresponding <i>disease modules</i>. Topological network communities are highly interconnected groups of nodes. (B) Distribution of the fraction of disease proteins within the largest connected component (LCC) for 70 diseases. Only 10%-30% of the disease proteins are part of the LCC. (C) LCC size of proteins associated with <i>lysosomal storage disease</i> compared to random expectation. Out of 45 disease proteins, 24 (53%) are part of the LCC (<i>z</i>-score = 23.42, empirical <i>p</i>-value < 10<sup>–6</sup>). (D) Significance of the LCC sizes as measured by the <i>z</i>-score for all 70 considered diseases. The whiskers indicate the minimum, 25<sup>th</sup>, 50<sup>th</sup>, 75<sup>th</sup> percentile and maximum across all diseases. Overall, 70% of the diseases show significant clustering (<i>z</i>-score>1.6). (E) LCC z-score distribution in noisy networks in which a fraction <i>f</i> of all links is randomized by either link removal or rewiring. (F) We applied three representative community detection algorithms to explore the extent to which <i>topological</i> modules correspond to <i>disease</i> modules. Only 1%-5% of the communities detected by the different methods are significantly enriched with disease proteins, none of which includes a significant fraction of all disease proteins. (G) Comparison of the distribution of the local modularity <i>R</i> for disease proteins and proteins randomly selected from the Interactome. (H) Distribution of the connectivity significance of disease proteins and randomly selected proteins. (I) Connectivity significance of disease proteins as a function of the fraction <i>f</i> of links removed from the network. The red bars denote the mean and the standard deviation as measured across 70 diseases, yellow bars show random expectation obtained from the same number of randomly distributed genes. (J) Local modularity of disease proteins and randomly selected proteins when a fraction <i>f</i> of the links is removed from the network. (K) Illustration of the local modularity <i>R</i>.</p
    corecore