711 research outputs found

    Using emergent clustering methods to analyse short time series gene expression data from childhood leukemia treated with glucocorticoids

    Get PDF
    Acute lymphoblastic leukemia (ALL) causes the highest number of deaths from cancer in children aged between one and fourteen. The most common treatment for children with ALL is chemotherapy, a cancer treatment that uses drugs to kill cancer cells or stop cell division. The drug and dosage combinations may vary for each child. Unfortunately, chemotherapy treatments may cause serious side effects. Glucocorticoids (GCs) have been used as therapeutic agents for children with ALL for more than 50 years. Common and widely drugs in this class include prednisolone and dexamethasone. Childhood leukemia now has a survival rate of 80% (Pui, Robison, & Look, 2008). The key clinical question is identifying those children who will not respond well to established therapy strategies.GCs regulate diverse biological processes, for example, metabolism, development, differentiation, cell survival and immunity. GCs induce apoptosis and G1 cell cycle arrest in lymphoid cells. In fact, not much is known about the molecular mechanism of GCs sensitivity and resistance, and GCs-induced apoptotic signal transduction pathways and there are many controversial hypotheses about both genes regulated by GCs and potential molecular mechanism of GCs-induced apoptosis. Therefore, understanding the mechanism of this drug should lead to better prognostic factors (treatment response), more targeted therapies and prevention of side effects. GCs induced apoptosis have been studied by using microarray technology in vivo and in vitro on samples consisting of GCs treated ALL cell lines, mouse thymocytes and/or ALL patients. However, time series GCs treated childhood ALL datasets are currently extremely limited. DNA microarrays are essential tools for analysis of expression of many genes simultaneously. Gene expression data show the level of activity of several genes under experimental conditions. Genes with similar expression patterns could belong to the same pathway or have similar function. DNA microarray data analysis has been carried out using statistical analysis as well as machine learning and data mining approaches. There are many microarray analysis tools; this study aims to combine emergent clustering methods to get meaningful biological insights into mechanisms underlying GCs induced apoptosis. In this study, microarray data originated from prednisolone (glucocorticoids) treated childhood ALL samples (Schmidt et al., 2006) (B-linage and T-linage) and collected at 6 and 24 hours after treatment are analysed using four methods: Selforganizing maps (SOMs), Emergent self-organizing maps (ESOM) (Ultsch & Morchen, 2005), the Short Time series Expression Miner (STEM) (Ernst & Bar-Joseph, 2006) and Fuzzy clustering by Local Approximation of MEmbership (FLAME) (Fu & Medico, 2007). The results revealed intrinsic biological patterns underlying the GCs time series data: there are at least five different gene activities happening during the three time points; GCs-induced apoptotic genes were identified; and genes active at both time points or only at 6 hours or 24 hours were determined. Also, interesting gene clusters with membership in already known pathways were found thereby providing promising candidate gens for further inferring GCs induced apoptotic gene regulatory networks

    geneCBR: a translational tool for multiple-microarray analysis and integrative information retrieval for aiding diagnosis in cancer research

    Get PDF
    8 pages, 5 figures, 3 additional files.-- Software.[Background] Bioinformatics and medical informatics are two research fields that serve the needs of different but related communities. Both domains share the common goal of providing new algorithms, methods and technological solutions to biomedical research, and contributing to the treatment and cure of diseases. Although different microarray techniques have been successfully used to investigate useful information for cancer diagnosis at the gene expression level, the true integration of existing methods into day-to-day clinical practice is still a long way off. Within this context, case-based reasoning emerges as a suitable paradigm specially intended for the development of biomedical informatics applications and decision support systems, given the support and collaboration involved in such a translational development. With the goals of removing barriers against multi-disciplinary collaboration and facilitating the dissemination and transfer of knowledge to real practice, case-based reasoning systems have the potential to be applied to translational research mainly because their computational reasoning paradigm is similar to the way clinicians gather, analyze and process information in their own practice of clinical medicine.[Results] In addressing the issue of bridging the existing gap between biomedical researchers and clinicians who work in the domain of cancer diagnosis, prognosis and treatment, we have developed and made accessible a common interactive framework. Our geneCBR system implements a freely available software tool that allows the use of combined techniques that can be applied to gene selection, clustering, knowledge extraction and prediction for aiding diagnosis in cancer research. For biomedical researches, geneCBR expert mode offers a core workbench for designing and testing new techniques and experiments. For pathologists or oncologists, geneCBR diagnostic mode implements an effective and reliable system that can diagnose cancer subtypes based on the analysis of microarray data using a CBR architecture. For programmers, geneCBR programming mode includes an advanced edition module for run-time modification of previous coded techniques.[Conclusion] geneCBR is a new translational tool that can effectively support the integrative work of programmers, biomedical researches and clinicians working together in a common framework. The code is freely available under the GPL license and can be obtained at http://www.genecbr.org (webcite).This work is supported in part by the projects Research on Translational Bioinformatics (ref. 08VIB6) from University of Vigo and Development of computational tools for the classification and clustering of gene expression data in order to discover meaningful biological information in cancer diagnosis (ref. VA100A08) from JCyL (Spain). The work of D. Glez-Peña is supported by a "María Barbeito" contract from Xunta de Galicia.Peer reviewe

    Applying GCS Networks to Fuzzy Discretized Microarray Data for Tumour Diagnosis

    Get PDF
    Gene expression profiles belonging to DNA microarrays are composed of thousands of genes at the same time, representing the complex relationships between them. In this context, the ability of designing methods capable of overcoming current limitations is crucial to reduce the generalization error of state-of-the-art algorithms. This paper presents the application of a self-organised growing cell structures network in an attempt to cluster biological homogeneous patients. This technique makes use of a previous successful supervised fuzzy pattern algorithm capable of performing DNA microarray data reduction. The proposed model has been tested with microarray data belonging to bone marrow samples from 43 adult patients with cancer plus a group of six cases corresponding to healthy persons. The results of this work demonstrate that classical artificial intelligence techniques can be effectively used for tumour diagnosis working with high-dimensional microarray data

    gene-CBR: a case-based reasong tool for cancer diagnosis using microarray data sets

    Get PDF
    Gene expression profiles are composed of thousands of genes at the same time, representing the complex relationships between them. One of the well-known constraints specifically related to microarray data is the large number of genes in comparison with the small number of available experiments or cases. In this context, the ability of design methods capable of overcoming current limitations of state-of-the-art algorithms is crucial to the development of successful applications. This paper presentsgene-CBR, a hybrid model that can perform cancer classification based on microarray data. The system employs a case-based reasoning model that incorporates a set of fuzzy prototypes, a growing cell structure network and a set of rules to provide an accurate diagnosis. The hybrid model has been implemented and tested with microarray data belonging to bone marrow cases from forty-three adult patients with cancer plus a group of six cases corresponding to healthy persons

    Using Fuzzy Patterns for Gene Selection and Data Reduction on Microarray Data

    Get PDF
    The advent of DNA microarray technology has supplied a large volume of data to many fields like machine learning and data mining. Intelligent support is essential for managing and interpreting this great amount of information. One of the well-known constraints specifically related to microarray data is the large number of genes in comparison with the small number of available experiments. In this context, the ability of design methods capable of overcoming current limitations of state-of-the-art algorithms is crucial to the development of successful applications. In this paper we demonstrate how a supervised fuzzy pattern algorithm can be used to perform DNA microarray data reduction over real data. The benefits of our method can be employed to find biologically significant insights relating to meaningful genes in order to improve previous successful techniques. Experimental results on acute myeloid leukemia diagnosis show the effectiveness of the proposed approach

    Predicting Escherichia coli loads in cascading dams with machine learning: An integration of hydrometeorology, animal density and grazing pattern

    Get PDF
    Accurate prediction of Escherichia coli contamination in surface waters is challenging due to considerable uncertainty in the physical, chemical and biological variables that control E. coli occurrence and sources in surface waters. This study proposes a novel approach by integrating hydro-climatic variables as well as animal density and grazing pattern in the feature selection modeling phase to increase E. coli prediction accuracy for two cascading dams at the USMeat Animal Research Center (USMARC), Nebraska. Predictive models were developed using regression techniques and an artificial neural network (ANN). Two adaptive neuro-fuzzy inference system (ANFIS) structures including subtractive clustering and fuzzy c-means (FCM)clusteringwere also used to developmodels for predicting E. coli. The performances of the predictive models were evaluated and compared using root mean squared log error (RMSLE). Cross-validation and model performance results indicated that although themajority of models predicted E. coli accurately, ANFIS models resulted in fewer errors compared to the othermodels. The ANFISmodels have the potential to be used to predict E. coli concentration for intervention plans and monitoring programs for cascading dams, and to implement effective best management practices for grazing and irrigation during the growing season

    Development and Evolution of the Mammalian Cerebellum at Single Cell Resolution

    Get PDF
    Originally thought to only take part in motor control, the cerebellum emerged over the last decades as an important organ in various higher cognitive functions, such as learning and speech. Besides this, the cerebellum is associated to various diseases, such as spinocerebellar ataxia, autism spectrum disorder, and medulloblastoma. The basic structure and connective properties of it are well understood, but single-cell-technologies made it possible to study the cerebellum at higher resolution. Many questions about molecular details of its development and evolution are still not answered. Cerebella are present in all jawed vertebrates, though structural diversity is macroscopic and microscopic detectable, such as the number of deep nuclei, the presence of the vermis, or the mode of production of one of the most important cell types in the cerebellum - granule cells. Using single-nucleus RNA-sequencing (snRNA-seq) and bioinformatic approaches, I studied cerebellum data of human, mouse (Mus musculus) and opossum (Monodelphis domestica). The dataset contained samples spanning the organs development at high temporal resolution. It was possible to track the differentiation of the major cerebellar neuronal and glial cell types, as well as identify states and subtypes. This generated a comprehensive map of cellular complexity through eutherian (human and mouse) and marsupial (opossum) development. Leveraging the evolutionary distance of approximately 160 million years between the eutherian and marsupial lineage, conserved and diverged cell type marker genes could be identified which might be promising candidates for understanding the basic blueprint of cerebellar cell type identity. Stage correspondence mapping aligned the vastly different developmental time frames of the three studied species and allowed the identification of a two-fold increase in Purkinje cell progenitors in the human lineage, which might be connected to a recently identified human-specific secondary ventricular zone progenitor pool. It was possible to model the differentiation path of granule and Purkinje cells from early progenitors to mature neurons. Conserved and diverged gene expression trajectories were discovered. Using in vitro and in vivo intollerance scores, I could show that genes which are dynamically expressed during differentiation show higher functional constraint as non-dynamic genes, fitting to previous bulk-RNA-seq studies, showing similar results across the development of the full organ. Some orthologs with diverging patterns were disease-associated genes, which could have implications on clinical research on conditions like autism spectrum disorders and medulloblastoma. Furthermore, fundamental changes of gene expressions, established as gain or loss of expression within a cell type and species, were detected. Affected genes showed decreased functional constraint, verifying evolutionary principles on single cell scale. Taken together, this study shows the strength of state of the art methodology combined with high resolution developmental sampling in an evolution biological context to discover fundamental principles of organ development at single-cell scale

    Case-based reasoning as a decision support system for cancer diagnosis: A case study

    Get PDF
    Microarray technology can measure the expression levels of thousands of genes in an experiment. This fact makes the use of computational methods in cancer research absolutely essential. One of the possible applications is in the use of Artificial Intelligence techniques. Several of these techniques have been used to analyze expression arrays, but there is a growing need for new and effective solutions. This paper presents a Case-based reasoning (CBR) system for automatic classification of leukemia patients from microarray data. The system incorporates novel algorithms for data mining that allow filtering, classification, and knowledge extraction. The system has been tested and the results obtained are presented in this paper

    Computational Intelligence Techniques for Classification in Microarray Analysis

    Get PDF
    During the last few years there has been a growing need for using computational intelligence techniques to analyze microarray data. The aim of the system presented in this study is to provide innovative decision support techniques for classifying data from microarrays and for extracting knowledge about the classification process. The computational intelligence techniques used in this chapter follow the case-based reasoning paradigm to emulate the steps followed in expression analysis. This work presents a novel filtering technique based on statistical methods, a new clustering technique that uses ESOINN (Enhanced Self-Organizing Incremental Neuronal Network), and a knowledge extraction technique based on the RIPPER algorithm. The system presented within this chapter has been applied to classify CLL patients and extract knowledge about the classification process. The results obtained permit us to conclude that the system provides a notable reduction of the dimensionality of the data obtained from microarrays. Moreover, the classification process takes the detection of relevant and irrelevant probes into account, which is fundamental for subsequent classification and an extraction of knowledge tool with a graphical interface to explain the classification process, and has been much appreciated by the human experts. Finally, the philosophy of the CBR systems facilitates the resolution of new problems using past experiences, which is very appropriate regarding the classification of leukemia
    corecore