34 research outputs found
"DEVELOPMENT OF A DECISION SUPPORT SYSTEM FOR BIOINFORMATICS. EXTRACTION OF PROTEIN COMPLEXES FROM A PROTEIN-PROTEIN INTERACTION NETWORK: A CASE STUDY"
Decision Support Systems and Workflow Management Systems have
become essential tools for some business and scientific field. This
thesis propose a new hybrid architecture for problem solving expertise
and decision-making process, that aims to support high-quality
research in the field of bioinformatics and system biology.
The first part of the dissertation introduces the project to which belong
this thesis work, i.e. the “Bioinformatics Organized Resources -
an Intelligent System” (BORIS) project of the ICAR-CNR; the main
goal of BORIS is to provide an helpful and effective support to researchers
or experimentalist, that have no familiarity with tools and
techniques to solve computational problems in bioinformatics and system
biology.
In the second part of the thesis, the proposed hybrid architecture is
described in detail; it introduces a three-dimensional space for the
BORIS system, where the viewpoints of declarative, procedural and
process approaches are considered. Using the proposed architecture,
the system is able to help the experimentalist choosing, for a given
problem, the right tool at the right moment, to generate a navigable
Workflow at different abstraction layers, extending current workflow
management systems and to free the user from implementation details,
assisting him in the correct configuration of algorithms/services.
A case study about extraction of protein complexes from proteinprotein
interaction networks is presented, in order to show how the
system faces a problem and how it interacts with the user
Recommended from our members
Fast training of self organizing maps for the visual exploration of molecular compounds
Visual exploration of scientific data in life science
area is a growing research field due to the large amount of
available data. The Kohonen’s Self Organizing Map (SOM) is
a widely used tool for visualization of multidimensional data.
In this paper we present a fast learning algorithm for SOMs
that uses a simulated annealing method to adapt the learning
parameters. The algorithm has been adopted in a data analysis
framework for the generation of similarity maps. Such maps
provide an effective tool for the visual exploration of large and
multi-dimensional input spaces. The approach has been applied
to data generated during the High Throughput Screening
of molecular compounds; the generated maps allow a visual
exploration of molecules with similar topological properties.
The experimental analysis on real world data from the
National Cancer Institute shows the speed up of the proposed
SOM training process in comparison to a traditional approach.
The resulting visual landscape groups molecules with similar
chemical properties in densely connected regions
Recommended from our members
Context-aware visual exploration of molecular databases
Facilitating the visual exploration of scientific data has
received increasing attention in the past decade or so. Especially
in life science related application areas the amount
of available data has grown at a breath taking pace. In this
paper we describe an approach that allows for visual inspection
of large collections of molecular compounds. In
contrast to classical visualizations of such spaces we incorporate
a specific focus of analysis, for example the outcome
of a biological experiment such as high throughout
screening results. The presented method uses this experimental
data to select molecular fragments of the underlying
molecules that have interesting properties and uses the
resulting space to generate a two dimensional map based
on a singular value decomposition algorithm and a self organizing
map. Experiments on real datasets show that
the resulting visual landscape groups molecules of similar
chemical properties in densely connected regions
GOWDL: gene ontology-driven wide and deep learning model for cell typing of scRNA-seq data
Single-cell RNA-sequencing (scRNA-seq) allows for obtaining genomic and transcriptomic profiles of individual cells. That data make it possible to characterize tissues at the cell level. In this context, one of the main analyses exploiting scRNA-seq data is identifying the cell types within tissue to estimate the quantitative composition of cell populations. Due to the massive amount of available scRNA-seq data, automatic classification approaches for cell typing, based on the most recent deep learning technology, are needed. Here, we present the gene ontology-driven wide and deep learning (GOWDL) model for classifying cell types in several tissues. GOWDL implements a hybrid architecture that considers the functional annotations found in Gene Ontology and the marker genes typical of specific cell types. We performed cross-validation and independent external testing, comparing our algorithm with 12 other state-of-the-art predictors. Classification scores demonstrated that GOWDL reached the best results over five different tissues, except for recall, where we got about 92% versus 97% of the best tool. Finally, we presented a case study on classifying immune cell populations in breast cancer using a hierarchical approach based on GOWDL
Recommended from our members
The BioDICE Taverna plugin for clustering and visualization of biological data: a workflow for molecular compounds exploration
Background: In many experimental pipelines, clustering of multidimensional biological datasets is used to detect
hidden structures in unlabelled input data. Taverna is a popular workflow management system that is used to design
and execute scientific workflows and aid in silico experimentation. The availability of fast unsupervised methods for clustering and visualization in the Taverna platform is important to support a data-driven scientific discovery in complex and explorative bioinformatics applications.
Results: This work presents a Taverna plugin, the Biological Data Interactive Clustering Explorer (BioDICE), that performs clustering of high-dimensional biological data and provides a nonlinear, topology preserving projection for the visualization of the input data and their similarities. The core algorithm in the BioDICE plugin is Fast Learning Self Organizing Map (FLSOM), which is an improved variant of the Self Organizing Map (SOM) algorithm. The plugin generates an interactive 2D map that allows the visual exploration of multidimensional data and the identification of groups of similar objects. The effectiveness of the plugin is demonstrated on a case study related to chemical
compounds.
Conclusions: The number and variety of available tools and its extensibility have made Taverna a popular choice for the development of scientific data workflows. This work presents a novel plugin, BioDICE, which adds a data-driven knowledge discovery component to Taverna. BioDICE provides an effective and powerful clustering tool, which can be adopted for the explorative analysis of biological datasets
Transcriptomic Analyses Reveal 2 and 4 Family Members of Cytochromes P450 (CYP) Involved in LPS Inflammatory Response in Pharynx of Ciona robusta
Cytochromes P450 (CYP) are enzymes responsible for the biotransformation of most en- dogenous and exogenous agents. The expression of each CYP is influenced by a unique combination of mechanisms and factors including genetic polymorphisms, induction by xenobiotics, and regula- tion by cytokines and hormones. In recent years, Ciona robusta, one of the closest living relatives of vertebrates, has become a model in various fields of biology, in particular for studying inflammatory response. Using an in vivo LPS exposure strategy, next-generation sequencing (NGS) and qRT-PCR combined with bioinformatics and in silico analyses, compared whole pharynx transcripts from naĂŻve and LPS-exposed C. robusta, and we provide the first view of cytochrome genes expression and miRNA regulation in the inflammatory response induced by LPS in a hematopoietic organ. In C. robusta, cytochromes belonging to 2B,2C, 2J, 2U, 4B and 4F subfamilies were deregulated and miRNA network interactions suggest that different conserved and species-specific miRNAs are involved in post-transcriptional regulation of cytochrome genes and that there could be an interplay between specific miRNAs regulating both inflammation and cytochrome molecules in the inflammatory response in C. robusta
Impact of the flame retardant 2,2’4,4’-tetrabromodiphenyl ether (PBDE-47) in THP-1 macrophage-like cell function via small extracellular vesicles
2,2’4,4’-tetrabromodiphenyl ether (PBDE-47) is one of the most widespread environmental brominated flame-retardant congeners which has also been detected in animal and human tissues. Several studies have reported the effects of PBDEs on different health issues, including neurobehavioral and developmental disorders, reproductive health, and alterations of thyroid function. Much less is known about its immunotoxicity. The aim of our study was to investigate the effects that treatment of THP-1 macrophage-like cells with PBDE-47 could have on the content of small extracellular vesicles’ (sEVs) microRNA (miRNA) cargo and their downstream effects on bystander macrophages. To achieve this, we purified sEVs from PBDE-47 treated M(LPS) THP-1 macrophage-like cells (sEVsPBDE+LPS) by means of ultra-centrifugation and characterized their miRNA cargo by microarray analysis detecting the modulation of 18 miRNAs. Furthermore, resting THP-1 derived M(0) macrophage-like cells were cultured with sEVsPBDE+LPS, showing that the treatment reshaped the miRNA profiles of 12 intracellular miRNAs. This dataset was studied in silico, identifying the biological pathways affected by these target genes. This analysis identified 12 pathways all involved in the maturation and polarization of macrophages. Therefore, to evaluate whether sEVsPBDE+LPS can have some immunomodulatory activity, naïve M(0) THP-1 macrophage-like cells cultured with purified sEVsPBDE+LPS were studied for IL-6, TNF-α and TGF-β mRNAs expression and immune stained with the HLA-DR, CD80, CCR7, CD38 and CD209 antigens and analyzed by flow cytometry. This analysis showed that the PBDE-47 treatment does not induce the expression of specific M1 and M2 cytokine markers of differentiation and may have impaired the ability to make immunological synapses and present antigens, down-regulating the expression of HLA-DR and CD209 antigens. Overall, our study supports the model that perturbation of miRNA cargo by PBDE-47 treatment contributes to the rewiring of cellular regulatory pathways capable of inducing perturbation of differentiation markers on naïve resting M(0) THP-1 macrophage-like cells
A new SOM Initialization Algorithm for Nonvectorial Data
Self Organizing Maps (SOMs) are widely used mapping and clustering algorithms family. It is also well known that the performances of the maps in terms of quality of result and learning speed are strongly dependent from the neuron weights initialization. This drawback is common to all the SOM algorithms, and critical for a new SOM algorithm, the Median SOM (M-SOM), developed in order to map datasets characterized by a dissimilarity matrix. In this paper an initialization technique of M-SOM is proposed and compared to the initialization techniques proposed in the original paper. The results show that the proposed initialization technique assures faster learning and better performance in terms of quantization error