31 research outputs found
NEFI: Network Extraction From Images
Networks and network-like structures are amongst the central building blocks
of many technological and biological systems. Given a mathematical graph
representation of a network, methods from graph theory enable a precise
investigation of its properties. Software for the analysis of graphs is widely
available and has been applied to graphs describing large scale networks such
as social networks, protein-interaction networks, etc. In these applications,
graph acquisition, i.e., the extraction of a mathematical graph from a network,
is relatively simple. However, for many network-like structures, e.g. leaf
venations, slime molds and mud cracks, data collection relies on images where
graph extraction requires domain-specific solutions or even manual. Here we
introduce Network Extraction From Images, NEFI, a software tool that
automatically extracts accurate graphs from images of a wide range of networks
originating in various domains. While there is previous work on graph
extraction from images, theoretical results are fully accessible only to an
expert audience and ready-to-use implementations for non-experts are rarely
available or insufficiently documented. NEFI provides a novel platform allowing
practitioners from many disciplines to easily extract graph representations
from images by supplying flexible tools from image processing, computer vision
and graph theory bundled in a convenient package. Thus, NEFI constitutes a
scalable alternative to tedious and error-prone manual graph extraction and
special purpose tools. We anticipate NEFI to enable the collection of larger
datasets by reducing the time spent on graph extraction. The analysis of these
new datasets may open up the possibility to gain new insights into the
structure and function of various types of networks. NEFI is open source and
available http://nefi.mpi-inf.mpg.de
Following the trail of cellular signatures : computational methods for the analysis of molecular high-throughput profiles
Over the last three decades, high-throughput techniques, such as next-generation sequencing, microarrays, or mass spectrometry, have revolutionized biomedical research by enabling scientists to generate detailed molecular profiles of biological samples on a large scale. These profiles are usually complex, high-dimensional, and often prone to technical noise, which makes a manual inspection practically impossible. Hence, powerful computational methods are required that enable the analysis and exploration of these data sets and thereby help researchers to gain novel insights into the underlying biology. In this thesis, we present a comprehensive collection of algorithms, tools, and databases for the integrative analysis of molecular high-throughput profiles. We developed these tools with two primary goals in mind. The detection of deregulated biological processes in complex diseases, like cancer, and the identification of driving factors within those processes. Our first contribution in this context are several major extensions of the GeneTrail web service that make it one of the most comprehensive toolboxes for the analysis of deregulated biological processes and signaling pathways. GeneTrail offers a collection of powerful enrichment and network analysis algorithms that can be used to examine genomic, epigenomic, transcriptomic, miRNomic, and proteomic data sets. In addition to approaches for the analysis of individual -omics types, our framework also provides functionality for the integrative analysis of multi-omics data sets, the investigation of time-resolved expression profiles, and the exploration of single-cell experiments. Besides the analysis of deregulated biological processes, we also focus on the identification of driving factors within those processes, in particular, miRNAs and transcriptional regulators. For miRNAs, we created the miRNA pathway dictionary database miRPathDB, which compiles links between miRNAs, target genes, and target pathways. Furthermore, it provides a variety of tools that help to study associations between them. For the analysis of transcriptional regulators, we developed REGGAE, a novel algorithm for the identification of key regulators that have a significant impact on deregulated genes, e.g., genes that show large expression differences in a comparison between disease and control samples. To analyze the influence of transcriptional regulators on deregulated biological processes,, we also created the RegulatorTrail web service. In addition to REGGAE, this tool suite compiles a range of powerful algorithms that can be used to identify key regulators in transcriptomic, proteomic, and epigenomic data sets. Moreover, we evaluate the capabilities of our tool suite through several case studies that highlight the versatility and potential of our framework. In particular, we used our tools to conducted a detailed analysis of a Wilms' tumor data set. Here, we could identify a circuitry of regulatory mechanisms, including new potential biomarkers, that might contribute to the blastemal subtype's increased malignancy, which could potentially lead to new therapeutic strategies for Wilms' tumors. In summary, we present and evaluate a comprehensive framework of powerful algorithms, tools, and databases to analyze molecular high-throughput profiles. The provided methods are of broad interest to the scientific community and can help to elucidate complex pathogenic mechanisms.Heutzutage werden molekulare Hochdurchsatzmessverfahren, wie Hochdurchsatzsequenzierung, Microarrays, oder Massenspektrometrie, regelmäßig angewendet, um Zellen im großen Stil und auf verschiedenen molekularen Ebenen zu charakterisieren. Die dabei generierten Datensätze sind in der Regel hochdimensional und oft verrauscht. Daher werden leistungsfähige computergestützte Anwendungen benötigt, um deren Analyse zu ermöglichen. In dieser Arbeit präsentieren wir eine Reihe von effektiven Algorithmen, Programmen, und Datenbaken für die Analyse von molekularen Hochdurchsetzdatensätzen. Diese Ansätze wurden entwickelt, um deregulierte biologische Prozesse zu untersuchen und in diesen wichtige Schlüsselmoleküle zu identifizieren. Zusätzlich wurden eine Reihe von Analysen durchgeführt um die verschiedenen Methoden zu evaluieren. Zu diesem Zweck haben wir insbesondere eine Wilmstumor Studie durchgeführt, in der wir verschiedene regulatorische Mechanismen und dazugehörige Biomarker identifizieren konnten, die für die erhöhte Malignität von Wilmstumoren mit blastemreichen Subtyp verantwortlich sein könnten. Diese Erkenntnisse könnten in der Zukunft zu einer verbesserten Behandlung dieser Tumore führen. Diese Ergebnisse zeigen eindrucksvoll, dass unsere Ansätze in der Lage sind, verschiedene molekulare Hochdurchsatzmessungen auszuwerten und dabei helfen können pathogene Mechanismen im Zusammenhang mit Krebs oder anderen komplexen Krankheiten aufzuklären
Simultaneous regression and classification for drug sensitivity prediction using an advanced random forest method
Machine learning methods trained on cancer cell line panels are intensively studied for the prediction
of optimal anti-cancer therapies. While classifcation approaches distinguish efective from inefective
drugs, regression approaches aim to quantify the degree of drug efectiveness. However, the high
specifcity of most anti-cancer drugs induces a skewed distribution of drug response values in favor of
the more drug-resistant cell lines, negatively afecting the classifcation performance (class imbalance)
and regression performance (regression imbalance) for the sensitive cell lines. Here, we present a
novel approach called SimultAneoUs Regression and classifcatiON Random Forests (SAURON-RF)
based on the idea of performing a joint regression and classifcation analysis. We demonstrate that
SAURON-RF improves the classifcation and regression performance for the sensitive cell lines at the
expense of a moderate loss for the resistant ones. Furthermore, our results show that simultaneous
classifcation and regression can be superior to regression or classifcation alone
miRCarta: a central repository for collecting miRNA candidates
The continuous increase of available biological data as consequence of modern high-throughput technologies poses new challenges for analysis techniques and database applications. Especially for miRNAs, one class of small non-coding RNAs, many algorithms have been developed to predict new candidates from next-generation sequencing data. While the amount of publications describing novel miRNA candidates keeps steadily increasing, the current gold standard database for miRNAs - miRBase - has not been updated since June 2014. As a result, publications describing new miRNA candidates in the last three to five years might have a substantial overlap of candidates without noticing. With miRCarta we implemented a database to collect novel miRNA candidates and augment the information provided by miRBase. In the first stage, miRCarta is thought to be a highly sensitive collection of potential miRNA candidates with a high degree of analysis functionality, annotations and details on each miRNA. We added—besides the full content of the miRBase—12,857 human miRNA precursors to miRCarta. Users can match their own predictions to the entries of miRCarta to reduce potential redundancies in their studies. miRCarta provides the most comprehensive collection of human miRNAs and miRNA candidates to form a basis for further refinement and validation studies. The database is freely accessible at https://mircarta.cs.uni-saarland.de/
miRPathDB 2.0: a novel release of the miRNA Pathway Dictionary Database
Since the initial release of miRPathDB, tremendous progress has been made in the field of microRNA (miRNA) research. New miRNA reference databases have emerged, a vast amount of new miRNA candidates has been discovered and the number of experimentally validated target genes has increased considerably. Hence, the demand for a major upgrade of miRPathDB, including extended analysis functionality and intuitive visualizations of query results has emerged. Here, we present the novel release 2.0 of the miRNA Pathway Dictionary Database (miRPathDB) that is freely accessible at https://mpd.bioinf.uni-sb.de/. miRPathDB 2.0 comes with a ten-fold increase of pre-processed data. In total, the updated database provides putative associations between 27 452 (candidate) miRNAs, 28 352 targets and 16 833 pathways for Homo sapiens, as well as interactions of 1978 miRNAs, 24 898 targets and 6511 functional categories for Mus musculus. Additionally, we analyzed publications citing miRPathDB to identify common use-cases and further extensions. Based on this evaluation, we added new functionality for interactive visualizations and down-stream analyses of bulk queries. In summary, the updated version of miRPathDB, with its new custom-tailored features, is one of the most comprehensive and advanced resources for miRNAs and their target pathways
miR-34a as hub of T cell regulation networks
Background: Micro(mi)RNAs are increasingly recognized as central regulators of immune cell function. While it has
been predicted that miRNAs have multiple targets, the majority of these predictions still await experimental confirmation.
Here, miR-34a, a well-known tumor suppressor, is analyzed for targeting genes involved in immune system processes of
leucocytes.
Methods: Using an in-silico approach, we combined miRNA target prediction with GeneTrail2, a web tool for Multi-omics
enrichment analysis, to identify miR-34a target genes, which are involved in the immune system process subcategory of
Gene Ontology.
Results: Out of the 193 predicted target genes in this subcategory we experimentally tested 22 target genes and
confirmed binding of miR-34a to 14 target genes including VAMP2, IKBKE, MYH9, MARCH8, KLRK1, CD11A, TRAFD1, CCR1,
PYDC1, PRF1, PIK3R2, PIK3CD, AP1B1, and ADAM10 by dual luciferase assays. By transfecting Jurkat, primary CD4+ and CD8+
T cells with miR-34a, we demonstrated that ectopic expression of miR-34a leads to reduced levels of endogenous VAMP2
and CD11A, which are central to the analyzed subcategories. Functional downstream analysis of miR-34a over-expression
in activated CD8+ T cells exhibits a distinct decrease of PRF1 secretion.
Conclusions: By simultaneous targeting of 14 mRNAs miR-34a acts as major hub of T cell regulatory networks suggesting
to utilize miR-34a as target of intervention towards a modulation of the immune responsiveness of T-cells in a broad
tumor context
Wrinkle in the plan: miR-34a-5p impacts chemokine signaling by modulating CXCL10/CXCL11/CXCR3-axis in CD4+, CD8+ T cells, and M1 macrophages
Background In 2016 the first-in-human phase I study of a miRNA-based cancer therapy with a liposomal mimic of microRNA-34a-5p (miR-34a-5p) was closed due to five immune related serious adverse events (SAEs) resulting in four patient deaths. For future applications of miRNA mimics in cancer therapy it is mandatory to unravel the miRNA effects both on the tumor tissue and on immune cells. Here, we set out to analyze the impact of miR-34a-5p over-expression on the CXCL10/CXCL11/CXCR3 axis, which is central for the development of an effective cancer control.
Methods We performed a whole genome expression analysis of miR-34a-5p transfected M1 macrophages followed by an over-representation and a protein–protein network analysis. In-silico miRNA target prediction and dual luciferase assays were used for target identification and verification. Target genes involved in chemokine signaling were functionally analyzed in M1 macrophages, CD4+ and CD8+ T cells.
Results A whole genome expression analysis of M1 macrophages with induced miR-34a-5p over-expression revealed an interaction network of downregulated target mRNAs including CXCL10 and CXCL11. In-silico target prediction in combination with dual luciferase assays identified direct binding of miR-34a-5p to the 3′UTRs of CXCL10 and CXCL11. Decreased CXCL10 and CXCL11 secretion was shown on the endogenous protein level and in the supernatant of miR-34a-5p transfected and activated M1 macrophages. To complete the analysis of the CXCL10/CXCL11/CXCR3 axis, we activated miR-34a-5p transfected CD4+ and CD8+ T cells by PMA/Ionomycin and found reduced levels of endogenous CXCR3 and CXCR3 on the cell surface.
Conclusions MiR-34a-5p mimic administered by intravenous administration will likely not only be up-taken by the tumor cells but also by the immune cells. Our results indicate that miR-34a-5p over-expression leads in M1 macrophages to a reduced secretion of CXCL10 and CXCL11 chemokines and in CD4+ and CD8+ T cells to a reduced expression of CXCR3. As a result, less immune cells will be attracted to the tumor site. Furthermore, high levels of miR-34a-5p in naive CD4+ T cells can in turn hinder Th1 cell polarization through the downregulation of CXCR3 leading to a less pronounced activation of cytotoxic T lymphocytes, natural killer, and natural killer T cells and possibly contributing to lymphocytopenia
Quantitative and time-resolved miRNA pattern of early human T cell activation
T cells are central to the immune response against
various pathogens and cancer cells. Complex networks of transcriptional and post-transcriptional
regulators, including microRNAs (miRNAs), coordinate the T cell activation process. Available miRNA
datasets, however, do not sufficiently dissolve the
dynamic changes of miRNA controlled networks
upon T cell activation. Here, we established a quantitative and time-resolved expression pattern for the
entire miRNome over a period of 24 h upon human Tcell activation. Based on our time-resolved datasets,
we identified central miRNAs and specified common miRNA expression profiles. We found the most
prominent quantitative expression changes for miR155-5p with a range from initially 40 molecules/cell to
1600 molecules/cell upon T-cell activation. We established a comprehensive dynamic regulatory network
of both the up- and downstream regulation of miR155. Upstream, we highlight IRF4 and its complexes
with SPI1 and BATF as central for the transcriptional
regulation of miR-155. Downstream of miR-155-5p,
we verified 17 of its target genes by the time-resolved
data recorded after T cell activation. Our data provide comprehensive insights into the range of stimulus induced miRNA abundance changes and lay the
ground to identify efficient points of intervention for
modifying the T cell response
GeneTrail 3: advanced high-throughput enrichment analysis
We present GeneTrail 3, a major extension of our web
service GeneTrail that offers rich functionality for the
identification, analysis, and visualization of deregulated biological processes. Our web service provides
a comprehensive collection of biological processes
and signaling pathways for 12 model organisms that
can be analyzed with a powerful framework for enrichment and network analysis of transcriptomic,
miRNomic, proteomic, and genomic data sets. Moreover, GeneTrail offers novel workflows for the analysis of epigenetic marks, time series experiments,
and single cell data. We demonstrate the capabilities
of our web service in two case-studies, which highlight that GeneTrail is well equipped for uncovering
complex molecular mechanisms. GeneTrail is freely
accessible at: http://genetrail.bioinf.uni-sb.de