4,727 research outputs found
maigesPack: A Computational Environment for Microarray Data Analysis
Microarray technology is still an important way to assess gene expression in
molecular biology, mainly because it measures expression profiles for thousands
of genes simultaneously, what makes this technology a good option for some
studies focused on systems biology. One of its main problem is complexity of
experimental procedure, presenting several sources of variability, hindering
statistical modeling. So far, there is no standard protocol for generation and
evaluation of microarray data. To mitigate the analysis process this paper
presents an R package, named maigesPack, that helps with data organization.
Besides that, it makes data analysis process more robust, reliable and
reproducible. Also, maigesPack aggregates several data analysis procedures
reported in literature, for instance: cluster analysis, differential
expression, supervised classifiers, relevance networks and functional
classification of gene groups or gene networks
The ROS wheel: refining ROS transcriptional footprints
In the last decade, microarray studies have delivered extensive inventories of transcriptome-wide changes in messenger RNA levels provoked by various types of oxidative stress in Arabidopsis (Arabidopsis thaliana). Previous cross-study comparisons indicated how different types of reactive oxygen species (ROS) and their subcellular accumulation sites are able to reshape the transcriptome in specific manners. However, these analyses often employed simplistic statistical frameworks that are not compatible with large-scale analyses. Here, we reanalyzed a total of 79 Affymetrix ATH1 microarray studies of redox homeostasis perturbation experiments. To create hierarchy in such a high number of transcriptomic data sets, all transcriptional profiles were clustered on the overlap extent of their differentially expressed transcripts. Subsequently, meta-analysis determined a single magnitude of differential expression across studies and identified common transcriptional footprints per cluster. The resulting transcriptional footprints revealed the regulation of various metabolic pathways and gene families. The RESPIRATORY BURST OXIDASE HOMOLOG F-mediated respiratory burst had a major impact and was a converging point among several studies. Conversely, the timing of the oxidative stress response was a determining factor in shaping different transcriptome footprints. Our study emphasizes the need to interpret transcriptomic data sets in a systematic context, where initial, specific stress triggers can converge to common, aspecific transcriptional changes. We believe that these refined transcriptional footprints provide a valuable resource for assessing the involvement of ROS in biological processes in plants
Integration and mining of malaria molecular, functional and pharmacological data: how far are we from a chemogenomic knowledge space?
The organization and mining of malaria genomic and post-genomic data is
highly motivated by the necessity to predict and characterize new biological
targets and new drugs. Biological targets are sought in a biological space
designed from the genomic data from Plasmodium falciparum, but using also the
millions of genomic data from other species. Drug candidates are sought in a
chemical space containing the millions of small molecules stored in public and
private chemolibraries. Data management should therefore be as reliable and
versatile as possible. In this context, we examined five aspects of the
organization and mining of malaria genomic and post-genomic data: 1) the
comparison of protein sequences including compositionally atypical malaria
sequences, 2) the high throughput reconstruction of molecular phylogenies, 3)
the representation of biological processes particularly metabolic pathways, 4)
the versatile methods to integrate genomic data, biological representations and
functional profiling obtained from X-omic experiments after drug treatments and
5) the determination and prediction of protein structures and their molecular
docking with drug candidate structures. Progresses toward a grid-enabled
chemogenomic knowledge space are discussed.Comment: 43 pages, 4 figures, to appear in Malaria Journa
Recommended from our members
Genetic analysis of a major international collection of cultivated apple varieties reveals previously unknown historic heteroploid and inbred relationships
Domesticated apple (Malus x domestica Borkh.) is a major global crop and the genetic diversity held within the pool of cultivated varieties is important for the development of future cultivars. The aim of this study was to investigate the diversity held within the domesticated form, through the analysis of a major international germplasm collection of cultivated varieties, the UK National Fruit Collection, consisting of over 2,000 selections of named cultivars and seedling varieties. We utilised Diversity Array Technology (DArT) markers to assess the genetic diversity within the collection. Clustering attempts, using the software STRUCTURE revealed that the accessions formed a complex and historically admixed group for which clear clustering was challenging. Comparison of accessions using the Jaccard similarity coefficient allowed us to identify clonal and duplicate material as well as revealing pairs and groups that appeared more closely related than a standard parent-offspring or full-sibling relations. From further investigation, we were able to propose a number of new pedigrees, which revealed that some historically important cultivars were more closely related than previously documented and that some of them were partially inbred. We were also able to elucidate a number of parent-offspring relationships that had resulted in a number of important polyploid cultivars. This included reuniting polyploid cultivars that in some cases dated as far back as the 18th century, with diploid parents that potentially date back as far as the 13th century
Modeling And Identification Of Differentially Regulated Genes Using Transcriptomics And Proteomics Data
Photosynthetic organisms are complex dynamical systems, showing a remarkable ability to adapt to different environmental conditions for their survival. Mechanisms underlying the coordination between different cellular processes in these organisms are still poorly understood. In this dissertation we utilize various computational and modeling techniques to analyze transcriptomics and proteomics data sets from several photosynthetic organisms. We try to use changes in expression levels of genes to study responses of these organisms to various environmental conditions such as availability of nutrients, concentrations of chemicals in growth media, and temperature. Three specific problems studied here are transcriptomics modifications in photosynthetic organisms under reduction-oxidation: redox) stress conditions, circadian and diurnal rhythms of cyanobacteria and the effect of incident light patterns on these rhythms, and the coordination between biological processes in cyanobacteria under various growth conditions. Under redox stresses caused by high light treatments, a strong transcriptomic level response, spread across many biological processes, is discovered in the cyanobacterium Synechocystis sp. PCC 6803. Based on statistical tests, expression levels of about 20% of genes in Synechocystis 6803 are identified as significantly affected due to influence of high light. Gene clustering methods reveal that these responses can mainly be classified as transient and consistent responses, depending on the duration of modified behaviors. Many genes related to energy production as well as energy utilization are shown to be strongly affected. Analysis of microarray data under two stress conditions, high light and DCMU treatment, combined with data mining and motif finding algorithms led to a discovery of novel transcription factor, RRTF1 that responds to redox stresses in Arabidopsis thaliana. Time course transcriptomics data from Cyanothece sp. ATCC 51142 have shown strong diurnal rhythms. By combining multiple experimental conditions and using gene classification algorithms based on Fourier scores and angular distances, it is shown that majority of the diurnal genes are in fact light responding. Only about 10% of genes in the genome are categorized as being circadian controlled. A transcription control model based on dynamical systems is employed to identify the interactions between diurnal genes. A phase oscillator network is proposed to model the behavior of different biological processes. Both these models are shown to carry biologically meaningful features. To study the coordination between different biological processes to various environment and genetic modifications, an interaction model is derived using Bayesian network approach, combining all publicly available microarray data sets for Synechocystis sp. PCC 6803. Several novel relationships between biological processes are discovered from the model. Model is used to simulate several experimental conditions, and the response of the model is shown to agree with the experimentally observed behaviors
Comparison of drought stress response and gene expression between a GM maize variety and a near-isogenic non-GM variety
Maize MON810, grown and commercialised worldwide, is the only cultivated GM event in the EU. Maize MON810, variety DKC6575, and the corresponding near-isogenic line Tietar were studied in different growth conditions, to compare their behaviour in response to drought. Main photosynthetic parameters were significantly affected by water stress in both GM and non –GM varieties to a similar extents. Though DKC6575 (GM) had a greater sensitivity in the early phase of stress response as compared with Tietar (non GM), after six days of stress they behaved similarly, and both varieties recovered from stress damage.
Profiling gene expression in water deficit regimes and in a generalised water stress condition showed an up-regulation of many stress- responsive genes, but a greater number of differentially expressed genes was observed in Tietar, with genes belonging to transcription factor families and genes encoding HSPs, LEAs and detoxification enzymes. Since induction of these genes have been indicated from the literature as typical of stress responses, their activation in Tietar rather than in DKC6575 may be reminiscent of a more efficient response to drought. DKC6575 was also analysed for the expression of the transgene CryIAb (encoding the delta-endotoxin insecticidal protein) in water deficit conditions. In all the experiments, the CryIAb transcript was not influenced by water stress, but was expressed at a constant level.. This suggests that though possessing a different pattern of sensitivity to stress, the GM variety maintains the same expression level for the transgene
- …