78,705 research outputs found
Pathway-based Analysis with Support Vector Machine (SVM-LASSO) for Gene Selection and Classification
Genomic knowledge has become a popular research field in bioinformatics biological process that providing further biological process information. Many methods have been done to address the issues of high data throughput due to increased use of microarray technology. However, it is still not able to determine the appropriate diseases accurately. This is because of existing non-informative genes that could be included in the analysis of context specific data like cancer gene expression data, which affect the classification performance. This study proposed a pathway-based analysis for gene classification. Pathway-based analysis enable handling microarray data in order to improved biological interpretation of the analysis outcome. Secondly, Support Vector Machine with Least Absolute Shrinkage and Selection Operator algorithm (SVM-LASSO) is proposed, which to find informative genes for each pathway to ensure efficient gene selection and classification in every pathway. Experiments are done using lung cancer dataset and breast cancer dataset that widely used in cancer classification area. A stratified 10-fold cross validation is implement to evaluate the performance of the proposed method in terms of accuracy, specificity and sensitivity. Moreover, biological validation have been done on the selected genes based on biological literatures and biological databases. Next, the results from the proposed methods are compared with the previous study throughout all the data sets in terms of performance. As conclusion, this research finding can contribute in biology area especially in cancer classification area
Pathway-based analysis with Support Vector Machine (SVM-LASSO) for gene selection and classification
Genomic knowledge has become a popular research field in bioinformatics biological process that providing further
biological process information. Many methods have been done to address the issues of high data throughput due to increased use of microarray technology. However, it is still not able to determine the appropriate diseases accurately. This is because of existing noninformative genes that could be included in the analysis of context-specific data like cancer gene expression data, which affect the classification performance. This study proposed a pathway-based analysis for gene classification. Pathway-based analysis enables handling microarray data in order to improve biological interpretation of the analysis outcome. Secondly, Support Vector Machine with Least Absolute Shrinkage and Selection Operator algorithm (SVM-LASSO) is proposed, which to find informative genes for each pathway to ensure efficient gene selection and classification in every pathway. Experiments are done using lung cancer dataset and breast cancer dataset that widely used in cancer classification area. A stratified 10-fold cross validation is implemented to evaluate the performance of the proposed method in terms of accuracy, specificity, and sensitivity. Moreover, biological validation has been done on the selected genes based on biological literature and biological databases. Next, the results from the proposed methods are compared with the previous study throughout all the data sets in terms of performance. As a conclusion, this research finding can contribute in biology area especially in cancer classification area
DNA expression microarrays may be the wrong tool to identify biological pathways
DNA microarray expression signatures are expected to provide new insights into patho- physiological pathways. Numerous variant statistical methods have been described for each step of the signal analysis. We employed five similar statistical tests on the same data set at the level of gene selection. Inter-test agreement for the identification of biological pathways in BioCarta, KEGG and Reactome was calculated using Cohen’s k- score. The identification of specific biological pathways showed only moderate agreement (0.30 < k < 0.79) between the analysis methods used. Pathways identified by microarrays must be treated cautiously as they vary according to the statistical method used
Recommended from our members
Enhanced anticancer activity of a combination of docetaxel and Aneustat (OMN54) in a patient-derived, advanced prostate cancer tissue xenograft model.
The current first-line treatment for advanced metastatic prostate cancer, i.e. docetaxel-based therapy, is only marginally effective. The aim of the present study was to determine whether such therapy can be improved by combining docetaxel with Aneustat (OMN54), a multivalent botanical drug candidate shown to have anti-prostate cancer activity in preliminary in vitro experiments, which is currently undergoing a Phase-I Clinical Trial. Human metastatic, androgen-independent C4-2 prostate cancer cells and NOD-SCID mice bearing PTEN-deficient, metastatic and PSA-secreting, patient-derived subrenal capsule LTL-313H prostate cancer tissue xenografts were treated with docetaxel and Aneustat, alone and in combination. In vitro, Aneustat markedly inhibited C4-2 cell replication in a dose-dependent manner. When Aneustat was combined with docetaxel, the growth inhibitions of the drugs were essentially additive. In vivo, however, the combination of docetaxel and Aneustat enhanced anti-tumor activity synergistically and very markedly, without inducing major host toxicity. Complete growth inhibition and shrinkage of the xenografts could be obtained with the combined drugs as distinct from the drugs on their own. Analysis of the gene expression of the xenografts using microarray indicated that docetaxel + Aneustat led to expanded anticancer activity, in particular to targeting of cancer hallmarks that were not affected by the single drugs. Our findings, obtained with a highly clinically relevant prostate cancer model, suggest, for the first time, that docetaxel-based therapy of advanced human prostate cancer may be improved by combining docetaxel with Aneustat
Dys-regulated Gene Expression Networks by Meta-Analysis of Microarray Data on Oral Squamous Cell Carcinoma
Background: Oral squamous cell carcinoma (OSCC) is the sixth most common type of carcinoma worldwide. Development of OSCC is a multi-step process involving genes related to cell cycle, growth control, apoptosis, DNA damage response and other cellular regulators. The pathogenic pathways involved in this tumor are mostly unknown and therefore a better characterization of OSCC gene expression profile would represent a considerable advance. The availability of publicly available gene expression datasets has opened up new challenges especially for the integration of data generated by different research groups and different array platforms with the purpose of obtaining new insights on the biological process investigated.

Results: In this work we performed a meta-analysis on four microarray and four datasets of gene expression data on OSCC in order to evaluate the degree of agreement of the biological results obtained by these different studies and to identify common regulatory pathways that could be responsible of tumor growth. Sixteen dys-regulated pathways implicated in OSCC were mined out from the four published datasets, and most importantly three pathways were first reported. Those regulatory pathways and biological processes which are significantly enriched have been investigated by means of literatures and meanwhile, four genes of the maximally altered pathways, ECM-receptor interaction, were validated and identified by qRT-PCR as a possible candidate of aggressiveness of OSCC.

Conclusion: we have developed a robust method for analyzing pathways altered in OSCC using three expression array data sets. This study sets a stage for the further discovery of the basic mechanisms that may underlie a diseased state and would help in identifying critical nodes in the pathway that can be targeted for diagnosis and therapeutic intervention. In addition, those who are interested in our approach can obtain the software package (MATLAB platform) by email freely
Recommended from our members
Prediction of regulatory targets of alternative isoforms of the epidermal growth factor receptor in a glioblastoma cell line.
BackgroundThe epidermal growth factor receptor (EGFR) is a major regulator of proliferation in tumor cells. Elevated expression levels of EGFR are associated with prognosis and clinical outcomes of patients in a variety of tumor types. There are at least four splice variants of the mRNA encoding four protein isoforms of EGFR in humans, named I through IV. EGFR isoform I is the full-length protein, whereas isoforms II-IV are shorter protein isoforms. Nevertheless, all EGFR isoforms bind the epidermal growth factor (EGF). Although EGFR is an essential target of long-established and successful tumor therapeutics, the exact function and biomarker potential of alternative EGFR isoforms II-IV are unclear, motivating more in-depth analyses. Hence, we analyzed transcriptome data from glioblastoma cell line SF767 to predict target genes regulated by EGFR isoforms II-IV, but not by EGFR isoform I nor other receptors such as HER2, HER3, or HER4.ResultsWe analyzed the differential expression of potential target genes in a glioblastoma cell line in two nested RNAi experimental conditions and one negative control, contrasting expression with EGF stimulation against expression without EGF stimulation. In one RNAi experiment, we selectively knocked down EGFR splice variant I, while in the other we knocked down all four EGFR splice variants, so the associated effects of EGFR II-IV knock-down can only be inferred indirectly. For this type of nested experimental design, we developed a two-step bioinformatics approach based on the Bayesian Information Criterion for predicting putative target genes of EGFR isoforms II-IV. Finally, we experimentally validated a set of six putative target genes, and we found that qPCR validations confirmed the predictions in all cases.ConclusionsBy performing RNAi experiments for three poorly investigated EGFR isoforms, we were able to successfully predict 1140 putative target genes specifically regulated by EGFR isoforms II-IV using the developed Bayesian Gene Selection Criterion (BGSC) approach. This approach is easily utilizable for the analysis of data of other nested experimental designs, and we provide an implementation in R that is easily adaptable to similar data or experimental designs together with all raw datasets used in this study in the BGSC repository, https://github.com/GrosseLab/BGSC
Serum microRNA array analysis identifies miR-140-3p, miR-33b-3p and miR-671-3p as potential osteoarthritis biomarkers involved in metabolic processes.
Background: MicroRNAs (miRNAs) in circulation have emerged as promising biomarkers. In this study, we aimed to identify a circulating miRNA signature for osteoarthritis (OA) patients and in combination with bioinformatics analysis to evaluate the utility of selected differentially expressed miRNAs in the serum as potential OA biomarkers. Methods: Serum samples were collected from 12 primary OA patients, and 12 healthy individuals were screened using the Agilent Human miRNA Microarray platform interrogating 2549 miRNAs. Receiver Operating Characteristic (ROC) curves were constructed to evaluate the diagnostic performance of the deregulated miRNAs. Expression levels of selected miRNAs were validated by quantitative real-time PCR (qRT-PCR) in all serum and in articular cartilage samples from OA patients (n = 12) and healthy individuals (n = 7). Bioinformatics analysis was used to investigate the involved pathways and target genes for the above miRNAs. Results: We identified 279 differentially expressed miRNAs in the serum of OA patients compared to controls. Two hundred and five miRNAs (73.5%) were upregulated and 74 (26.5%) downregulated. ROC analysis revealed that 77 miRNAs had area under the curve (AUC) > 0.8 and p < 0.05. Bioinformatics analysis in the 77 miRNAs revealed that their target genes were involved in multiple signaling pathways associated with OA, among which FoxO, mTOR, Wnt, pI3K/akt, TGF-β signaling pathways, ECM-receptor interaction, and fatty acid biosynthesis. qRT-PCR validation in seven selected out of the 77 miRNAs revealed 3 significantly downregulated miRNAs (hsa-miR-33b-3p, hsa-miR-671-3p, and hsa-miR-140-3p) in the serum of OA patients, which were in silico predicted to be enriched in pathways involved in metabolic processes. Target-gene analysis of hsa-miR-140-3p, hsa-miR-33b-3p, and hsa-miR-671-3p revealed that InsR and IGFR1 were common targets of all three miRNAs, highlighting their involvement in regulation of metabolic processes that contribute to OA pathology. Hsa-miR-140-3p and hsa-miR-671-3p expression levels were consistently downregulated in articular cartilage of OA patients compared to healthy individuals. Conclusions: A serum miRNA signature was established for the first time using high density resolution miR-arrays in OA patients. We identified a three-miRNA signature, hsa-miR-140-3p, hsa-miR-671-3p, and hsa-miR-33b-3p, in the serum of OA patients, predicted to regulate metabolic processes, which could serve as a potential biomarker for the evaluation of OA risk and progression.Peer reviewedFinal Published versio
Stable Feature Selection for Biomarker Discovery
Feature selection techniques have been used as the workhorse in biomarker
discovery applications for a long time. Surprisingly, the stability of feature
selection with respect to sampling variations has long been under-considered.
It is only until recently that this issue has received more and more attention.
In this article, we review existing stable feature selection methods for
biomarker discovery using a generic hierarchal framework. We have two
objectives: (1) providing an overview on this new yet fast growing topic for a
convenient reference; (2) categorizing existing methods under an expandable
framework for future research and development
A constitutive active MAPK/ERK pathway due to BRAFV600E positively regulates AHR pathway in PTC
The aryl hydrocarbon receptor (AHR) is a ligand-activated transcription factor mediating the toxicity and tumor-promoting properties of dioxin. AHR has been reported to be overexpressed and constitutively active in a variety of solid tumors, but few data are currently available concerning its role in thyroid cancer. In this study we quantitatively explored a series of 51 paired-normal and papillary thyroid carcinoma (PTC) tissues for AHR-related genes. We identified an increased AHR expression/activity in PTC, independently from its nuclear dimerization partner and repressor but strictly related to a constitutive active MAPK/ERK pathway. The AHR up-regulation followed by an increased expression of AHR target genes was confirmed by a meta-analysis of published microarray data, suggesting a ligand-independent active AHR pathway in PTC. In-vitro studies using a PTC-derived cell line (BCPAP) and HEK293 cells showed that BRAF(V600E) may directly modulate AHR localization, induce AHR expression and activity in an exogenous ligand-independent manner. The AHR pathway might represent a potential novel therapeutic target for PTC in the clinical practice
- …