2,515 research outputs found
Recommended from our members
Genomic regions, cellular components and gene regulatory basis underlying pod length variations in cowpea (V. unguiculata L. Walp).
Cowpea (V. unguiculata L. Walp) is a climate resilient legume crop important for food security. Cultivated cowpea (V. unguiculata L) generally comprises the bushy, short-podded grain cowpea dominant in Africa and the climbing, long-podded vegetable cowpea popular in Asia. How selection has contributed to the diversification of the two types of cowpea remains largely unknown. In the current study, a novel genotyping assay for over 50 000 SNPs was employed to delineate genomic regions governing pod length. Major, minor and epistatic QTLs were identified through QTL mapping. Seventy-two SNPs associated with pod length were detected by genome-wide association studies (GWAS). Population stratification analysis revealed subdivision among a cowpea germplasm collection consisting of 299 accessions, which is consistent with pod length groups. Genomic scan for selective signals suggested that domestication of vegetable cowpea was accompanied by selection of multiple traits including pod length, while the further improvement process was featured by selection of pod length primarily. Pod growth kinetics assay demonstrated that more durable cell proliferation rather than cell elongation or enlargement was the main reason for longer pods. Transcriptomic analysis suggested the involvement of sugar, gibberellin and nutritional signalling in regulation of pod length. This study establishes the basis for map-based cloning of pod length genes in cowpea and for marker-assisted selection of this trait in breeding programmes
Augmented Sparse Reconstruction of Protein Signaling Networks
The problem of reconstructing and identifying intracellular protein signaling
and biochemical networks is of critical importance in biology today. We sought
to develop a mathematical approach to this problem using, as a test case, one
of the most well-studied and clinically important signaling networks in biology
today, the epidermal growth factor receptor (EGFR) driven signaling cascade.
More specifically, we suggest a method, augmented sparse reconstruction, for
the identification of links among nodes of ordinary differential equation (ODE)
networks from a small set of trajectories with different initial conditions.
Our method builds a system of representation by using a collection of integrals
of all given trajectories and by attenuating block of terms in the
representation itself. The system of representation is then augmented with
random vectors, and minimization of the 1-norm is used to find sparse
representations for the dynamical interactions of each node. Augmentation by
random vectors is crucial, since sparsity alone is not able to handle the large
error-in-variables in the representation. Augmented sparse reconstruction
allows to consider potentially very large spaces of models and it is able to
detect with high accuracy the few relevant links among nodes, even when
moderate noise is added to the measured trajectories. After showing the
performance of our method on a model of the EGFR protein network, we sketch
briefly the potential future therapeutic applications of this approach.Comment: 24 pages, 6 figure
Interval-valued analysis for discriminative gene selection and tissue sample classification using microarray data
AbstractAn important application of gene expression data is to classify samples in a variety of diagnostic fields. However, high dimensionality and a small number of noisy samples pose significant challenges to existing classification methods. Focused on the problems of overfitting and sensitivity to noise of the dataset in the classification of microarray data, we propose an interval-valued analysis method based on a rough set technique to select discriminative genes and to use these genes to classify tissue samples of microarray data. We first select a small subset of genes based on interval-valued rough set by considering the preference-ordered domains of the gene expression data, and then classify test samples into certain classes with a term of similar degree. Experiments show that the proposed method is able to reach high prediction accuracies with a small number of selected genes and its performance is robust to noise
Towards Novel Nonparametric Statistical Methods and Bioinformatics Tools for Clinical and Translational Sciences
As the field of functional genetics and genomics is beginning to mature, we become confronted with new challenges. The constant drop in price for sequencing and gene expression profiling as well as the increasing number of genetic and genomic variables that can be measured makes it feasible to address more complex questions. The success with rare diseases caused by single loci or genes has provided us with a proof-of-concept that new therapies can be developed based on functional genomics and genetics. Common diseases, however, typically involve genetic epistasis, genomic pathways, and proteomic pattern. Moreover, to better understand the underlying biologi-cal systems, we often need to integrate information from several of these sources. Thus, as the field of clinical research moves toward complex diseases, the demand for modern data base systems and advanced statistical methods increases. The traditional statistical methods implemented in most of the bioinformatics tools currently used in the novel field of genetics and functional genomics are based on the linear model and, thus, have shortcomings when applied to nonlinear biological systems. The previous work on partially ordered data (Wittkowski 1988; 1992), when combined with theoretical results (Hoeffding 1948) and computational strategies (Deuchler 1914) has opened a new field of nonparametric statistics. With grid technology, new tools are now feasible when screening for interactions between genetics (Wittkowski, Liu 2002) and functional genomics (Wittkowski, Lee 2004). Having more complex study designs and more specific methods available increases the demand for decision support when selecting appropriate bioinformatics tools. With the advent of rapid prototyping systems for Web based database application, we have recently begun to complement previous work on knowledge based systems with graphical Web-based tools for acquisition of DESIGN and MODEL knowledge.Biostatistics Bioinformatics NIH NCRR ROADMAP
Natural variation in abiotic stress responsive gene expression and local adaptation to climate in Arabidopsis thaliana.
Gene expression varies widely in natural populations, yet the proximate and ultimate causes of this variation are poorly known. Understanding how variation in gene expression affects abiotic stress tolerance, fitness, and adaptation is central to the field of evolutionary genetics. We tested the hypothesis that genes with natural genetic variation in their expression responses to abiotic stress are likely to be involved in local adaptation to climate in Arabidopsis thaliana. Specifically, we compared genes with consistent expression responses to environmental stress (expression stress responsive, "eSR") to genes with genetically variable responses to abiotic stress (expression genotype-by-environment interaction, "eGEI"). We found that on average genes that exhibited eGEI in response to drought or cold had greater polymorphism in promoter regions and stronger associations with climate than those of eSR genes or genomic controls. We also found that transcription factor binding sites known to respond to environmental stressors, especially abscisic acid responsive elements, showed significantly higher polymorphism in drought eGEI genes in comparison to eSR genes. By contrast, eSR genes tended to exhibit relatively greater pairwise haplotype sharing, lower promoter diversity, and fewer nonsynonymous polymorphisms, suggesting purifying selection or selective sweeps. Our results indicate that cis-regulatory evolution and genetic variation in stress responsive gene expression may be important mechanisms of local adaptation to climatic selective gradients
Memetic micro-genetic algorithms for cancer data classification
Fast and precise medical diagnosis of human cancer is crucial for treatment decisions. Gene selection consists of identifying a set of informative genes from microarray data to allow high predictive accuracy in human cancer classification. This task is a combinatorial search problem, and optimisation methods can be applied for its resolution. In this paper, two memetic micro-genetic algorithms (MμV1 and MμV2) with different hybridisation approaches are proposed for feature selection of cancer microarray data. Seven gene expression datasets are used for experimentation. The comparison with stochastic state-of-the-art optimisation techniques concludes that problem-dependent local search methods combined with micro-genetic algorithms improve feature selection of cancer microarray data.Fil: Rojas, Matias Gabriel. Universidad Nacional de Lujan. Centro de Investigacion Docencia y Extension En Tecnologias de la Informacion y Las Comunicaciones.; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Mendoza; ArgentinaFil: Olivera, Ana Carolina. Universidad Nacional de Cuyo. Facultad de Ingeniería; Argentina. Universidad Nacional de Lujan. Centro de Investigacion Docencia y Extension En Tecnologias de la Informacion y Las Comunicaciones.; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Mendoza; ArgentinaFil: Carballido, Jessica Andrea. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Bahía Blanca. Instituto de Ciencias e Ingeniería de la Computación; ArgentinaFil: Vidal, Pablo Javier. Universidad Nacional de Cuyo. Facultad de Ingeniería; Argentina. Universidad Nacional del Sur. Departamento de Ciencias e Ingeniería de la Computación; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Mendoza; Argentin
Genetic Effects at Pleiotropic Loci Are Context-Dependent with Consequences for the Maintenance of Genetic Variation in Populations
Context-dependent genetic effects, including genotype-by-environment and genotype-by-sex interactions, are a potential mechanism by which genetic variation of complex traits is maintained in populations. Pleiotropic genetic effects are also thought to play an important role in evolution, reflecting functional and developmental relationships among traits. We examine context-dependent genetic effects at pleiotropic loci associated with normal variation in multiple metabolic syndrome (MetS) components (obesity, dyslipidemia, and diabetes-related traits). MetS prevalence is increasing in Western societies and, while environmental in origin, presents substantial variation in individual response. We identify 23 pleiotropic MetS quantitative trait loci (QTL) in an F16 advanced intercross between the LG/J and SM/J inbred mouse strains (Wustl:LG,SM-G16; n = 1002). Half of each family was fed a high-fat diet and half fed a low-fat diet; and additive, dominance, and parent-of-origin imprinting genotypic effects were examined in animals partitioned into sex, diet, and sex-by-diet cohorts. We examine the context-dependency of the underlying additive, dominance, and imprinting genetic effects of the traits associated with these pleiotropic QTL. Further, we examine sequence polymorphisms (SNPs) between LG/J and SM/J as well as differential expression of positional candidate genes in these regions. We show that genetic associations are different in different sex, diet, and sex-by-diet settings. We also show that over- or underdominance and ecological cross-over interactions for single phenotypes may not be common, however multidimensional synthetic phenotypes at loci with pleiotropic effects can produce situations that favor the maintenance of genetic variation in populations. Our findings have important implications for evolution and the notion of personalized medicine
Spatiotemporal dynamics of the postnatal developing primate brain transcriptome.
Developmental changes in the temporal and spatial regulation of gene expression drive the emergence of normal mature brain function, while disruptions in these processes underlie many neurodevelopmental abnormalities. To solidify our foundational knowledge of such changes in a primate brain with an extended period of postnatal maturation like in human, we investigated the whole-genome transcriptional profiles of rhesus monkey brains from birth to adulthood. We found that gene expression dynamics are largest from birth through infancy, after which gene expression profiles transition to a relatively stable state by young adulthood. Biological pathway enrichment analysis revealed that genes more highly expressed at birth are associated with cell adhesion and neuron differentiation, while genes more highly expressed in juveniles and adults are associated with cell death. Neocortex showed significantly greater differential expression over time than subcortical structures, and this trend likely reflects the protracted postnatal development of the cortex. Using network analysis, we identified 27 co-expression modules containing genes with highly correlated expression patterns that are associated with specific brain regions, ages or both. In particular, one module with high expression in neonatal cortex and striatum that decreases during infancy and juvenile development was significantly enriched for autism spectrum disorder (ASD)-related genes. This network was enriched for genes associated with axon guidance and interneuron differentiation, consistent with a disruption in the formation of functional cortical circuitry in ASD
Biclustering on expression data: A review
Biclustering has become a popular technique for the study of gene expression data, especially for discovering functionally related gene sets under different subsets of experimental conditions. Most of biclustering approaches use a measure or cost function that determines the quality of biclusters. In such cases, the development of both a suitable heuristics and a good measure for guiding the search are essential for discovering interesting biclusters in an expression matrix. Nevertheless, not all existing biclustering approaches base their search on evaluation measures for biclusters. There exists a diverse set of biclustering tools that follow different strategies and algorithmic concepts which guide the search towards meaningful results. In this paper we present a extensive survey of biclustering approaches, classifying them into two categories according to whether or not use evaluation metrics within the search method: biclustering algorithms based on evaluation measures and non metric-based biclustering algorithms. In both cases, they have been classified according to the type of meta-heuristics which they are based on.Ministerio de Economía y Competitividad TIN2011-2895
- …