2,515 research outputs found

    Augmented Sparse Reconstruction of Protein Signaling Networks

    Full text link
    The problem of reconstructing and identifying intracellular protein signaling and biochemical networks is of critical importance in biology today. We sought to develop a mathematical approach to this problem using, as a test case, one of the most well-studied and clinically important signaling networks in biology today, the epidermal growth factor receptor (EGFR) driven signaling cascade. More specifically, we suggest a method, augmented sparse reconstruction, for the identification of links among nodes of ordinary differential equation (ODE) networks from a small set of trajectories with different initial conditions. Our method builds a system of representation by using a collection of integrals of all given trajectories and by attenuating block of terms in the representation itself. The system of representation is then augmented with random vectors, and minimization of the 1-norm is used to find sparse representations for the dynamical interactions of each node. Augmentation by random vectors is crucial, since sparsity alone is not able to handle the large error-in-variables in the representation. Augmented sparse reconstruction allows to consider potentially very large spaces of models and it is able to detect with high accuracy the few relevant links among nodes, even when moderate noise is added to the measured trajectories. After showing the performance of our method on a model of the EGFR protein network, we sketch briefly the potential future therapeutic applications of this approach.Comment: 24 pages, 6 figure

    Interval-valued analysis for discriminative gene selection and tissue sample classification using microarray data

    Get PDF
    AbstractAn important application of gene expression data is to classify samples in a variety of diagnostic fields. However, high dimensionality and a small number of noisy samples pose significant challenges to existing classification methods. Focused on the problems of overfitting and sensitivity to noise of the dataset in the classification of microarray data, we propose an interval-valued analysis method based on a rough set technique to select discriminative genes and to use these genes to classify tissue samples of microarray data. We first select a small subset of genes based on interval-valued rough set by considering the preference-ordered domains of the gene expression data, and then classify test samples into certain classes with a term of similar degree. Experiments show that the proposed method is able to reach high prediction accuracies with a small number of selected genes and its performance is robust to noise

    Towards Novel Nonparametric Statistical Methods and Bioinformatics Tools for Clinical and Translational Sciences

    Get PDF
    As the field of functional genetics and genomics is beginning to mature, we become confronted with new challenges. The constant drop in price for sequencing and gene expression profiling as well as the increasing number of genetic and genomic variables that can be measured makes it feasible to address more complex questions. The success with rare diseases caused by single loci or genes has provided us with a proof-of-concept that new therapies can be developed based on functional genomics and genetics. Common diseases, however, typically involve genetic epistasis, genomic pathways, and proteomic pattern. Moreover, to better understand the underlying biologi-cal systems, we often need to integrate information from several of these sources. Thus, as the field of clinical research moves toward complex diseases, the demand for modern data base systems and advanced statistical methods increases. The traditional statistical methods implemented in most of the bioinformatics tools currently used in the novel field of genetics and functional genomics are based on the linear model and, thus, have shortcomings when applied to nonlinear biological systems. The previous work on partially ordered data (Wittkowski 1988; 1992), when combined with theoretical results (Hoeffding 1948) and computational strategies (Deuchler 1914) has opened a new field of nonparametric statistics. With grid technology, new tools are now feasible when screening for interactions between genetics (Wittkowski, Liu 2002) and functional genomics (Wittkowski, Lee 2004). Having more complex study designs and more specific methods available increases the demand for decision support when selecting appropriate bioinformatics tools. With the advent of rapid prototyping systems for Web based database application, we have recently begun to complement previous work on knowledge based systems with graphical Web-based tools for acquisition of DESIGN and MODEL knowledge.Biostatistics Bioinformatics NIH NCRR ROADMAP

    Natural variation in abiotic stress responsive gene expression and local adaptation to climate in Arabidopsis thaliana.

    Get PDF
    Gene expression varies widely in natural populations, yet the proximate and ultimate causes of this variation are poorly known. Understanding how variation in gene expression affects abiotic stress tolerance, fitness, and adaptation is central to the field of evolutionary genetics. We tested the hypothesis that genes with natural genetic variation in their expression responses to abiotic stress are likely to be involved in local adaptation to climate in Arabidopsis thaliana. Specifically, we compared genes with consistent expression responses to environmental stress (expression stress responsive, "eSR") to genes with genetically variable responses to abiotic stress (expression genotype-by-environment interaction, "eGEI"). We found that on average genes that exhibited eGEI in response to drought or cold had greater polymorphism in promoter regions and stronger associations with climate than those of eSR genes or genomic controls. We also found that transcription factor binding sites known to respond to environmental stressors, especially abscisic acid responsive elements, showed significantly higher polymorphism in drought eGEI genes in comparison to eSR genes. By contrast, eSR genes tended to exhibit relatively greater pairwise haplotype sharing, lower promoter diversity, and fewer nonsynonymous polymorphisms, suggesting purifying selection or selective sweeps. Our results indicate that cis-regulatory evolution and genetic variation in stress responsive gene expression may be important mechanisms of local adaptation to climatic selective gradients

    Memetic micro-genetic algorithms for cancer data classification

    Get PDF
    Fast and precise medical diagnosis of human cancer is crucial for treatment decisions. Gene selection consists of identifying a set of informative genes from microarray data to allow high predictive accuracy in human cancer classification. This task is a combinatorial search problem, and optimisation methods can be applied for its resolution. In this paper, two memetic micro-genetic algorithms (MμV1 and MμV2) with different hybridisation approaches are proposed for feature selection of cancer microarray data. Seven gene expression datasets are used for experimentation. The comparison with stochastic state-of-the-art optimisation techniques concludes that problem-dependent local search methods combined with micro-genetic algorithms improve feature selection of cancer microarray data.Fil: Rojas, Matias Gabriel. Universidad Nacional de Lujan. Centro de Investigacion Docencia y Extension En Tecnologias de la Informacion y Las Comunicaciones.; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Mendoza; ArgentinaFil: Olivera, Ana Carolina. Universidad Nacional de Cuyo. Facultad de Ingeniería; Argentina. Universidad Nacional de Lujan. Centro de Investigacion Docencia y Extension En Tecnologias de la Informacion y Las Comunicaciones.; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Mendoza; ArgentinaFil: Carballido, Jessica Andrea. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Bahía Blanca. Instituto de Ciencias e Ingeniería de la Computación; ArgentinaFil: Vidal, Pablo Javier. Universidad Nacional de Cuyo. Facultad de Ingeniería; Argentina. Universidad Nacional del Sur. Departamento de Ciencias e Ingeniería de la Computación; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Mendoza; Argentin

    Genetic Effects at Pleiotropic Loci Are Context-Dependent with Consequences for the Maintenance of Genetic Variation in Populations

    Get PDF
    Context-dependent genetic effects, including genotype-by-environment and genotype-by-sex interactions, are a potential mechanism by which genetic variation of complex traits is maintained in populations. Pleiotropic genetic effects are also thought to play an important role in evolution, reflecting functional and developmental relationships among traits. We examine context-dependent genetic effects at pleiotropic loci associated with normal variation in multiple metabolic syndrome (MetS) components (obesity, dyslipidemia, and diabetes-related traits). MetS prevalence is increasing in Western societies and, while environmental in origin, presents substantial variation in individual response. We identify 23 pleiotropic MetS quantitative trait loci (QTL) in an F16 advanced intercross between the LG/J and SM/J inbred mouse strains (Wustl:LG,SM-G16; n = 1002). Half of each family was fed a high-fat diet and half fed a low-fat diet; and additive, dominance, and parent-of-origin imprinting genotypic effects were examined in animals partitioned into sex, diet, and sex-by-diet cohorts. We examine the context-dependency of the underlying additive, dominance, and imprinting genetic effects of the traits associated with these pleiotropic QTL. Further, we examine sequence polymorphisms (SNPs) between LG/J and SM/J as well as differential expression of positional candidate genes in these regions. We show that genetic associations are different in different sex, diet, and sex-by-diet settings. We also show that over- or underdominance and ecological cross-over interactions for single phenotypes may not be common, however multidimensional synthetic phenotypes at loci with pleiotropic effects can produce situations that favor the maintenance of genetic variation in populations. Our findings have important implications for evolution and the notion of personalized medicine

    Spatiotemporal dynamics of the postnatal developing primate brain transcriptome.

    Get PDF
    Developmental changes in the temporal and spatial regulation of gene expression drive the emergence of normal mature brain function, while disruptions in these processes underlie many neurodevelopmental abnormalities. To solidify our foundational knowledge of such changes in a primate brain with an extended period of postnatal maturation like in human, we investigated the whole-genome transcriptional profiles of rhesus monkey brains from birth to adulthood. We found that gene expression dynamics are largest from birth through infancy, after which gene expression profiles transition to a relatively stable state by young adulthood. Biological pathway enrichment analysis revealed that genes more highly expressed at birth are associated with cell adhesion and neuron differentiation, while genes more highly expressed in juveniles and adults are associated with cell death. Neocortex showed significantly greater differential expression over time than subcortical structures, and this trend likely reflects the protracted postnatal development of the cortex. Using network analysis, we identified 27 co-expression modules containing genes with highly correlated expression patterns that are associated with specific brain regions, ages or both. In particular, one module with high expression in neonatal cortex and striatum that decreases during infancy and juvenile development was significantly enriched for autism spectrum disorder (ASD)-related genes. This network was enriched for genes associated with axon guidance and interneuron differentiation, consistent with a disruption in the formation of functional cortical circuitry in ASD

    Biclustering on expression data: A review

    Get PDF
    Biclustering has become a popular technique for the study of gene expression data, especially for discovering functionally related gene sets under different subsets of experimental conditions. Most of biclustering approaches use a measure or cost function that determines the quality of biclusters. In such cases, the development of both a suitable heuristics and a good measure for guiding the search are essential for discovering interesting biclusters in an expression matrix. Nevertheless, not all existing biclustering approaches base their search on evaluation measures for biclusters. There exists a diverse set of biclustering tools that follow different strategies and algorithmic concepts which guide the search towards meaningful results. In this paper we present a extensive survey of biclustering approaches, classifying them into two categories according to whether or not use evaluation metrics within the search method: biclustering algorithms based on evaluation measures and non metric-based biclustering algorithms. In both cases, they have been classified according to the type of meta-heuristics which they are based on.Ministerio de Economía y Competitividad TIN2011-2895
    corecore