4 research outputs found
To Embed or Not: Network Embedding as a Paradigm in Computational Biology
Current technology is producing high throughput biomedical data at an ever-growing rate. A common approach to interpreting such data is through network-based analyses. Since biological networks are notoriously complex and hard to decipher, a growing body of work applies graph embedding techniques to simplify, visualize, and facilitate the analysis of the resulting networks. In this review, we survey traditional and new approaches for graph embedding and compare their application to fundamental problems in network biology with using the networks directly. We consider a broad variety of applications including protein network alignment, community detection, and protein function prediction. We find that in all of these domains both types of approaches are of value and their performance depends on the evaluation measures being used and the goal of the project. In particular, network embedding methods outshine direct methods according to some of those measures and are, thus, an essential tool in bioinformatics research
Image informatics approaches to advance cancer drug discovery
High content image-based screening assays utilise cell based models to extract and quantify morphological
phenotypes induced by small molecules. The rich datasets produced can be used to
identify lead compounds in drug discovery efforts, infer compound mechanism of action, or aid
biological understanding with the use of tool compounds. Here I present my work developing and
applying high-content image based screens of small molecules across a panel of eight genetically
and morphologically distinct breast cancer cell lines.
I implemented machine learning models to predict compound mechanism of action from morphological
data and assessed how well these models transfer to unseen cell lines, comparing the
use of numeric morphological features extracted using computer vision techniques against more
modern convolutional neural networks acting on raw image data.
The application of cell line panels have been widely used in pharmacogenomics in order to compare
the sensitivity between genetically distinct cell lines to drug treatments and identify molecular
biomarkers that predict response. I applied dimensional reduction techniques and distance metrics
to develop a measure of differential morphological response between cell lines to small molecule
treatment, which controls for the inherent morphological differences between untreated cell lines.
These methods were then applied to a screen of 13,000 lead-like small molecules across the eight
cell lines to identify compounds which produced distinct phenotypic responses between cell lines.
Putative hits from a subset of approved compounds were then validated in a three-dimensional
tumour spheroid assay to determine the functional effect of these compounds in more complex
models, as well as proteomics to determine the responsible pathways.
Using data generated from the compound screen, I carried out work towards integrating knowledge
of chemical structures with morphological data to infer mechanistic information of the unannotated
compounds, and assess structure activity relationships from cell-based imaging data