9 research outputs found

    Classification of Time Series Gene Expression in Clinical Studies via Integration of Biological Network

    No full text
    <div><p>The increasing availability of time series expression datasets, although promising, raises a number of new computational challenges. Accordingly, the development of suitable classification methods to make reliable and sound predictions is becoming a pressing issue. We propose, here, a new method to classify time series gene expression via integration of biological networks. We evaluated our approach on 2 different datasets and showed that the use of a hidden Markov model/Gaussian mixture models hybrid explores the time-dependence of the expression data, thereby leading to better prediction results. We demonstrated that the biclustering procedure identifies function-related genes as a whole, giving rise to high accordance in prognosis prediction across independent time series datasets. In addition, we showed that integration of biological networks into our method significantly improves prediction performance. Moreover, we compared our approach with several state-of–the-art algorithms and found that our method outperformed previous approaches with regard to various criteria. Finally, our approach achieved better prediction results on early-stage data, implying the potential of our method for practical prediction.</p> </div

    Classification accuracies of different discretization methods for Baranzini dataset and Goertsches dataset: average (AVG) and standard deviation (SD).

    No full text
    <p>Classification accuracies of different discretization methods for Baranzini dataset and Goertsches dataset: average (AVG) and standard deviation (SD).</p

    Classification accuracies of distinct classification methods for Baranzini dataset and Goertsches dataset: average (AVG) and standard deviation (SD).

    No full text
    <p>Classification accuracies of distinct classification methods for Baranzini dataset and Goertsches dataset: average (AVG) and standard deviation (SD).</p

    Precision, Recall and F-measure of different classification approaches.

    No full text
    <p>The bars and error ticks represent mean values and standard deviations respectively. (A) shows the result for Baranzini dataset. (B) shows the result for Goertsches dataset.</p

    Classification accuracies of PPI-SVM-KNN with the change of parameter C.

    No full text
    <p>The bars and error ticks represent mean values and standard deviations respectively. (A) shows the result for Baranzini dataset. (B) shows the result for Goertsches dataset.</p

    Schematic overview of classification of time series gene expression.

    No full text
    <p>The prediction process primarily consists of 4 or 5 steps. Firstly, gene states are inferred by an HMM/GMM hybrid model. Secondly, the QL-biclustering algorithm extracts biclusters of every patient from the gene state matrix. Thirdly, every bicluster is scored according to its genes' connection in the protein-protein interaction (PPI) network. Finally, the label of every test patient is predicted by PPI-SVM-KNN based on patient similarity, taking into account both bicluster similarity and its PPIScore.</p

    Randomly selected biclustering examples from Baranzini dataset and Goertsches dataset.

    No full text
    <p>The expression values of genes in each bicluster are shown in (A) and (B). The state transitions of genes in each bicluster are shown in (C) and (D). The bicluster from Baranzini dataset consists of gene ITGAL and gene ITGB1 and their state transitions from time point 1 to time point 7. The bicluster from Goertsches dataset consists of gene CASP5 and gene CASP1 and their state transitions from time point 1 to time point 3.</p

    Prediction accuracies of different classification approaches with the change of measurements.

    No full text
    <p>The points in the figure represent mean values. (A) shows the accuracies from time point 3 to time point 7 for Baranzini dataset. (B) shows the accuracies from time point 3 to time point 5 for Goertsches dataset.</p

    Classification accuracies of PPI-SVM-KNN with the change of parameter K from 3 to 9.

    No full text
    <p>The bars and error ticks represent mean values and standard deviations respectively. (A) shows the result for Baranzini dataset. (B) shows the result for Goertsches dataset.</p
    corecore