110 research outputs found
Gene ARMADA: an integrated multi-analysis platform for microarray data implemented in MATLAB
<p>Abstract</p> <p>Background</p> <p>The microarray data analysis realm is ever growing through the development of various tools, open source and commercial. However there is absence of predefined rational algorithmic analysis workflows or batch standardized processing to incorporate all steps, from raw data import up to the derivation of significantly differentially expressed gene lists. This absence obfuscates the analytical procedure and obstructs the massive comparative processing of genomic microarray datasets. Moreover, the solutions provided, heavily depend on the programming skills of the user, whereas in the case of GUI embedded solutions, they do not provide direct support of various raw image analysis formats or a versatile and simultaneously flexible combination of signal processing methods.</p> <p>Results</p> <p>We describe here Gene ARMADA (Automated Robust MicroArray Data Analysis), a MATLAB implemented platform with a Graphical User Interface. This suite integrates all steps of microarray data analysis including automated data import, noise correction and filtering, normalization, statistical selection of differentially expressed genes, clustering, classification and annotation. In its current version, Gene ARMADA fully supports 2 coloured cDNA and Affymetrix oligonucleotide arrays, plus custom arrays for which experimental details are given in tabular form (Excel spreadsheet, comma separated values, tab-delimited text formats). It also supports the analysis of already processed results through its versatile import editor. Besides being fully automated, Gene ARMADA incorporates numerous functionalities of the Statistics and Bioinformatics Toolboxes of MATLAB. In addition, it provides numerous visualization and exploration tools plus customizable export data formats for seamless integration by other analysis tools or MATLAB, for further processing. Gene ARMADA requires MATLAB 7.4 (R2007a) or higher and is also distributed as a stand-alone application with MATLAB Component Runtime.</p> <p>Conclusion</p> <p>Gene ARMADA provides a highly adaptable, integrative, yet flexible tool which can be used for automated quality control, analysis, annotation and visualization of microarray data, constituting a starting point for further data interpretation and integration with numerous other tools.</p
KEGGconverter: a tool for the in-silico modelling of metabolic networks of the KEGG Pathways database
<p>Abstract</p> <p>Background</p> <p>The KEGG Pathway database is a valuable collection of metabolic pathway maps. Nevertheless, the production of simulation capable metabolic networks from KEGG Pathway data is a challenging complicated work, regardless the already developed tools for this scope. Originally used for illustration purposes, KEGG Pathways through KGML (KEGG Markup Language) files, can provide complete reaction sets and introduce species versioning, which offers advantages for the scope of cellular metabolism simulation modelling. In this project, KEGGconverter is described, implemented also as a web-based application, which uses as source KGML files, in order to construct integrated pathway SBML models fully functional for simulation purposes.</p> <p>Results</p> <p>A case study of the integration of six human metabolic pathways from KEGG depicts the ability of KEGGconverter to automatically produce merged and converted to SBML fully functional pathway models, enhanced with default kinetics. The suitability of the developed tool is demonstrated through a comparison with other state-of-the art relevant software tools for the same data fusion and conversion tasks, thus illustrating the problems and the relevant workflows. Moreover, KEGGconverter permits the inclusion of additional reactions in the resulting model which represent flux cross-talk with neighbouring pathways, providing in this way improved simulative accuracy. These additional reactions are introduced by exploiting relevant semantic information for the elements of the KEGG Pathways database. The architecture and functionalities of the web-based application are presented.</p> <p>Conclusion</p> <p>KEGGconverter is capable of producing integrated analogues of metabolic pathways appropriate for simulation tasks, by inputting only KGML files. The web application acts as a user friendly shell which transparently enables the automated biochemically correct pathway merging, conversion to SBML format, proper renaming of the species, and insertion of default kinetic properties for the pertaining reactions. The tool is available at: <url>http://www.grissom.gr/keggconverter</url></p
KEGGconverter: a tool for the in-silico modelling of metabolic networks of the KEGG Pathways database
Journal URL: http://www.biomedcentral.com/bmcbioinformatics
Int J Mol Sci
Phage-displayed peptide selections generate complex repertoires of several hundred thousand peptides as revealed by next-generation sequencing (NGS). In repeated peptide selections, however, even in identical experimental in vitro conditions, only a very small number of common peptides are found. The repertoire complexities are evidence of the difficulty of distinguishing between effective selections of specific peptide binders to exposed targets and the potential high background noise. Such investigation is even more relevant when considering the plethora of in vivo expressed targets on cells, in organs or in the entire organism to define targeting peptide agents. In the present study, we compare the published NGS data of three peptide repertoires that were obtained by phage display under identical experimental in vitro conditions. By applying the recently developed tool PepSimili we evaluate the calculated similarities of the individual peptides from each of these three repertoires and perform their mappings on the human proteome. The peptide-to-peptide mappings reveal high similarities among the three repertoires, confirming the desired reproducibility of phage-displayed peptide selections
CD90/Thy-1, a Cancer-Associated Cell Surface Signaling Molecule
CD90 is a membrane GPI-anchored protein with one Ig V-type superfamily domain that was initially described in mouse T cells. Besides the specific expression pattern and functions of CD90 that were described in normal tissues, i.e., neurons, fibroblasts and T cells, increasing evidences are currently highlighting the possible involvement of CD90 in cancer. This review first provides a brief overview on CD90 gene, mRNA and protein features and then describes the established links between CD90 and cancer. Finally, we report newly uncovered functional connections between CD90 and endoplasmic reticulum (ER) stress signaling and discuss their potential impact on cancer development
Genetic mapping of novel modifiers for ApcMin induced intestinal polyps’ development using the genetic architecture power of the collaborative cross mice
Abstract Background Familial adenomatous polyposis is an inherited genetic disease, characterized by colorectal polyps. It is caused by inactivating mutations in the Adenomatous polyposis coli (Apc) gene. Mice carrying a nonsense mutation in the Apc gene at R850, which is designated Apc Min/+ (Multiple intestinal neoplasia), develop intestinal adenomas. Several genetic modifier loci of Min (Mom) were previously mapped, but so far, most of the underlying genes have not been identified. To identify novel modifier loci associated with Apc Min/+ , we performed quantitative trait loci (QTL) analysis for polyp development using 49 F1 crosses between different Collaborative Cross (CC) lines and C57BL/6 J-Apc Min/+ mice. The CC population is a genetic reference panel of recombinant inbred lines, each line independently descended from eight genetically diverse founder strains. C57BL/6 J-Apc Min/+ males were mated with females from 49 CC lines. F1 offspring were terminated at 23 weeks and polyp counts from three sub-regions (SB1–3) of small intestinal and colon were recorded. Results The number of polyps in all these sub-regions and colon varied significantly between the different CC lines. At 95% genome-wide significance, we mapped nine novel QTL for variation in polyp number, with distinct QTL associated with each intestinal sub-region. QTL confidence intervals varied in width between 2.63–17.79 Mb. We extracted all genes in the mapped QTL at 90 and 95% CI levels using the BioInfoMiner online platform to extract, significantly enriched pathways and key linker genes, that act as regulatory and orchestrators of the phenotypic landscape associated with the Apc Min/+ mutation. Conclusions Genomic structure of the CC lines has allowed us to identify novel modifiers and confirmed some of the previously mapped modifiers. Key genes involved mainly in metabolic and immunological processes were identified. Future steps in this analysis will be to identify regulatory elements – and possible epistatic effects – located in the mapped QTL
ANASTASIA: An Automated Metagenomic Analysis Pipeline for Novel Enzyme Discovery Exploiting Next Generation Sequencing Data
Metagenomic analysis of environmental samples provides deep insight into the enzymatic mixture of the corresponding niches, capable of revealing peptide sequences with novel functional properties exploiting the high performance of next-generation sequencing (NGS) technologies. At the same time due to their ever increasing complexity, there is a compelling need for ever larger computational configurations to ensure proper bioinformatic analysis, and fine annotation. With the aiming to address the challenges of such an endeavor, we have developed a novel web-based application named ANASTASIA (automated nucleotide aminoacid sequences translational plAtform for systemic interpretation and analysis). ANASTASIA provides a rich environment of bioinformatic tools, either publicly available or novel, proprietary algorithms, integrated within numerous automated algorithmic workflows, and which enables versatile data processing tasks for (meta)genomic sequence datasets. ANASTASIA was initially developed in the framework of the European FP7 project HotZyme, whose aim was to perform exhaustive analysis of metagenomes derived from thermal springs around the globe and to discover new enzymes of industrial interest. ANASTASIA has evolved to become a stable and extensible environment for diversified, metagenomic, functional analyses for a range of applications overarching industrial biotechnology to biomedicine, within the frames of the ELIXIR-GR project. As a showcase, we report the successful in silico mining of a novel thermostable esterase termed “EstDZ4” from a metagenomic sample collected from a hot spring located in Krisuvik, Iceland
Discovery and characterization of a thermostable and highly halotolerant GH5 cellulase from an Icelandic hot spring isolate
Journal ArticleCopyright: © 2016 Zarafeta et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.With the ultimate goal of identifying robust cellulases for industrial biocatalytic conversions, we have isolated and characterized a new thermostable and very halotolerant GH5 cellulase. This new enzyme, termed CelDZ1, was identified by bioinformatic analysis from the genome of a polysaccharide-enrichment culture isolate, initiated from material collected from an Icelandic hot spring. Biochemical characterization of CelDZ1 revealed that it is a glycoside hydrolase with optimal activity at 70°C and pH 5.0 that exhibits good thermostability, high halotolerance at near-saturating salt concentrations, and resistance towards metal ions and other denaturing agents. X-ray crystallography of the new enzyme showed that CelDZ1 is the first reported cellulase structure that lacks the defined sugar-binding 2 subsite and revealed structural features which provide potential explanations of its biochemical characteristics.This work has been carried out in the framework of the HotZyme Project (http://hotzyme.com, grant agreement no. 265933) financed by the European Union 7th Framework Programme FP7/2007-2013, an EU FP7 Collaborative programme
Genetic drivers of head and neck squamous cell carcinoma : aberrant splicing events, mutational burden, hpv infection and future targets
Head and neck cancers include cancers that originate from a variety of locations. These
include the mouth, nasal cavity, throat, sinuses, and salivary glands. These cancers are the sixth most
diagnosed cancers worldwide. Due to the tissues they arise from, they are collectively named head
and neck squamous cell carcinomas (HNSCC). The most important risk factors for head and neck
cancers are infection with human papillomavirus (HPV), tobacco use and alcohol consumption. The
genetic basis behind the development and progression of HNSCC includes aberrant non-coding RNA
levels. However, one of the most important differences between healthy tissue and HNSCC tissue is
changes in the alternative splicing of genes that play a vital role in processes that can be described as
the hallmarks of cancer. These changes in the expression profile of alternately spliced mRNA give rise
to various protein isoforms. These protein isoforms, alternate methylation of proteins, and changes
in the transcription of non-coding RNAs (ncRNA) can be used as diagnostic or prognostic markers
and as targets for the development of new therapeutic agents. This review aims to describe changes
in alternative splicing and ncRNA patterns that contribute to the development and progression of
HNSCC. It will also review the use of the changes in gene expression as biomarkers or as the basis
for the development of new therapies.The South African Medical Research Councilhttps://www.mdpi.com/journal/genesam2022Community DentistryMaxillo-Facial and Oral SurgerySurger
- …