18,201 research outputs found

    Jeeva: Enterprise Grid-enabled Web Portal for Protein Secondary Structure Prediction

    Get PDF
    This paper presents a Grid portal for protein secondary structure prediction developed by using services of Aneka, a .NET-based enterprise Grid technology. The portal is used by research scientists to discover new prediction structures in a parallel manner. An SVM (Support Vector Machine)-based prediction algorithm is used with 64 sample protein sequences as a case study to demonstrate the potential of enterprise Grids.Comment: 7 page

    High Dimensional Classification with combined Adaptive Sparse PLS and Logistic Regression

    Get PDF
    Motivation: The high dimensionality of genomic data calls for the development of specific classification methodologies, especially to prevent over-optimistic predictions. This challenge can be tackled by compression and variable selection, which combined constitute a powerful framework for classification, as well as data visualization and interpretation. However, current proposed combinations lead to instable and non convergent methods due to inappropriate computational frameworks. We hereby propose a stable and convergent approach for classification in high dimensional based on sparse Partial Least Squares (sparse PLS). Results: We start by proposing a new solution for the sparse PLS problem that is based on proximal operators for the case of univariate responses. Then we develop an adaptive version of the sparse PLS for classification, which combines iterative optimization of logistic regression and sparse PLS to ensure convergence and stability. Our results are confirmed on synthetic and experimental data. In particular we show how crucial convergence and stability can be when cross-validation is involved for calibration purposes. Using gene expression data we explore the prediction of breast cancer relapse. We also propose a multicategorial version of our method on the prediction of cell-types based on single-cell expression data. Availability: Our approach is implemented in the plsgenomics R-package.Comment: 9 pages, 3 figures, 4 tables + Supplementary Materials 8 pages, 3 figures, 10 table

    Simulating non-small cell lung cancer with a multiscale agent-based model

    Get PDF
    Background The epidermal growth factor receptor (EGFR) is frequently overexpressed in many cancers, including non-small cell lung cancer (NSCLC). In silcio modeling is considered to be an increasingly promising tool to add useful insights into the dynamics of the EGFR signal transduction pathway. However, most of the previous modeling work focused on the molecular or the cellular level only, neglecting the crucial feedback between these scales as well as the interaction with the heterogeneous biochemical microenvironment. Results We developed a multiscale model for investigating expansion dynamics of NSCLC within a two-dimensional in silico microenvironment. At the molecular level, a specific EGFR-ERK intracellular signal transduction pathway was implemented. Dynamical alterations of these molecules were used to trigger phenotypic changes at the cellular level. Examining the relationship between extrinsic ligand concentrations, intrinsic molecular profiles and microscopic patterns, the results confirmed that increasing the amount of available growth factor leads to a spatially more aggressive cancer system. Moreover, for the cell closest to nutrient abundance, a phase-transition emerges where a minimal increase in extrinsic ligand abolishes the proliferative phenotype altogether. Conclusions Our in silico results indicate that, in NSCLC, in the presence of a strong extrinsic chemotactic stimulus, and depending on the cell's location, downstream EGFR-ERK signaling may be processed more efficiently, thereby yielding a migration-dominant cell phenotype and overall, an accelerated spatio-temporal expansion rate.Comment: 37 pages, 7 figure

    Mean-Field Theory of Meta-Learning

    Full text link
    We discuss here the mean-field theory for a cellular automata model of meta-learning. The meta-learning is the process of combining outcomes of individual learning procedures in order to determine the final decision with higher accuracy than any single learning method. Our method is constructed from an ensemble of interacting, learning agents, that acquire and process incoming information using various types, or different versions of machine learning algorithms. The abstract learning space, where all agents are located, is constructed here using a fully connected model that couples all agents with random strength values. The cellular automata network simulates the higher level integration of information acquired from the independent learning trials. The final classification of incoming input data is therefore defined as the stationary state of the meta-learning system using simple majority rule, yet the minority clusters that share opposite classification outcome can be observed in the system. Therefore, the probability of selecting proper class for a given input data, can be estimated even without the prior knowledge of its affiliation. The fuzzy logic can be easily introduced into the system, even if learning agents are build from simple binary classification machine learning algorithms by calculating the percentage of agreeing agents.Comment: 23 page
    • …
    corecore