1,221 research outputs found
Experimental and computational applications of microarray technology for malaria eradication in Africa
Various mutation assisted drug resistance evolved in Plasmodium falciparum strains and insecticide
resistance to female Anopheles mosquito account for major biomedical catastrophes standing against
all efforts to eradicate malaria in Sub-Saharan Africa. Malaria is endemic in more than 100 countries and
by far the most costly disease in terms of human health causing major losses among many African
nations including Nigeria. The fight against malaria is failing and DNA microarray analysis need to keep
up the pace in order to unravel the evolving parasite’s gene expression profile which is a pointer to
monitoring the genes involved in malaria’s infective metabolic pathway. Huge data is generated and
biologists have the challenge of extracting useful information from volumes of microarray data.
Expression levels for tens of thousands of genes can be simultaneously measured in a single
hybridization experiment and are collectively called a “gene expression profile”. Gene expression
profiles can also be used in studying various state of malaria development in which expression profiles
of different disease states at different time points are collected and compared to each other to establish
a classifying scheme for purposes such as diagnosis and treatments with adequate drugs. This paper
examines microarray technology and its application as supported by appropriate software tools from
experimental set-up to the level of data analysis. An assessment of the level of microarray technology
in Africa, its availability and techniques required for malaria eradication and effective healthcare in
Nigeria and Africa in general were also underscored
B2G-FAR, a species-centered GO annotation repository
Motivation: Functional genomics research has expanded enormously in the last decade thanks to the cost reduction in high-throughput technologies and the development of computational tools that generate, standardize and share information on gene and protein function such as the Gene Ontology (GO). Nevertheless, many biologists, especially working with non-model organisms, still suffer from non-existing or low-coverage functional annotation, or simply struggle retrieving, summarizing and querying these data
Unsupervised assessment of microarray data quality using a Gaussian mixture model
<p>Abstract</p> <p>Background</p> <p>Quality assessment of microarray data is an important and often challenging aspect of gene expression analysis. This task frequently involves the examination of a variety of summary statistics and diagnostic plots. The interpretation of these diagnostics is often subjective, and generally requires careful expert scrutiny.</p> <p>Results</p> <p>We show how an unsupervised classification technique based on the Expectation-Maximization (EM) algorithm and the naïve Bayes model can be used to automate microarray quality assessment. The method is flexible and can be easily adapted to accommodate alternate quality statistics and platforms. We evaluate our approach using Affymetrix 3' gene expression and exon arrays and compare the performance of this method to a similar supervised approach.</p> <p>Conclusion</p> <p>This research illustrates the efficacy of an unsupervised classification approach for the purpose of automated microarray data quality assessment. Since our approach requires only unannotated training data, it is easy to customize and to keep up-to-date as technology evolves. In contrast to other "black box" classification systems, this method also allows for intuitive explanations.</p
MMpred: functional miRNA – mRNA interaction analyses by miRNA expression prediction
Background: MicroRNA (miRNA) directed gene repression is an important mechanism of posttranscriptional
regulation. Comprehensive analyses of how microRNA influence biological processes requires paired
miRNA-mRNA expression datasets. However, a review of both GEO and ArrayExpress repositories revealed few
such datasets, which was in stark contrast to the large number of messenger RNA (mRNA) only datasets. It is of
interest that numerous primary miRNAs (precursors of microRNA) are known to be co-expressed with coding
genes (host genes).
Results: We developed a miRNA-mRNA interaction analyses pipeline. The proposed solution is based on two
miRNA expression prediction methods – a scaling function and a linear model. Additionally, miRNA-mRNA anticorrelation
analyses are used to determine the most probable miRNA gene targets (i.e. the differentially
expressed genes under the influence of up- or down-regulated microRNA). Both the consistency and accuracy
of the prediction method is ensured by the application of stringent statistical methods. Finally, the predicted
targets are subjected to functional enrichment analyses including GO, KEGG and DO, to better understand the
predicted interactions.
Conclusions: The MMpred pipeline requires only mRNA expression data as input and is independent of third
party miRNA target prediction methods. The method passed extensive numerical validation based on the
binding energy between the mature miRNA and 3’ UTR region of the target gene. We report that MMpred is
capable of generating results similar to that obtained using paired datasets. For the reported test cases we
generated consistent output and predicted biological relationships that will help formulate further testable
hypotheses
An application in bioinformatics : a comparison of affymetrix and compugen human genome microarrays
The human genome microarrays from Compugen® and Affymetrix® were compared in the context of the emerging field of computational biology. The two premier database servers for genomic sequence data, the National Center for Biotechnology Information and the European Bioinformatics Institute, were described in detail. The various databases and data mining tools available through these data servers were also discussed. Microarrays were examined from a historical perspective and their main current applications-expression analysis, mutation analysis, and comparative genomic hybridization-were discussed. The two main types of microarrays, cDNA spotted microarrays and high-density spotted microarrays were analyzed by exploring the human genome microarray from Compugen® and the HGU133 Set from Affymetrix® respectively. Array design issues, sequence collection and analysis, and probe selection processes for the two representative types of arrays were described. The respective chip design of the two types of microarrays was also analyzed. It was found that the human genome microarray from Compugen 0 contains probes that interrogate 1,119,840 bases corresponding to 18,664 genes, while the HG-U133 Set from Affymetrix® contains probes that interrogate only 825,000 bases corresponding to 33,000 genes. Based on this, the efficiency of the 25-mer probes of the HG-U133 Set from Affymetrix® compared to the 60-mer probes of the microarray from Compugen® was questioned
An open-access long oligonucleotide microarray resource for analysis of the human and mouse transcriptomes
Two collections of oligonucleotides have been designed for preparing pangenomic human and mouse microarrays. A total of 148 993 and 121 703 oligonucleotides were designed against human and mouse transcripts. Quality scores were created in order to select 25 342 human and 24 109 mouse oligonucleotides. They correspond to: (i) a BLAST-specificity score; (ii) the number of expressed sequence tags matching each probe; (iii) the distance to the 3′ end of the target mRNA. Scores were also used to compare in silico the two microarrays with commercial microarrays. The sets described here, called RNG/MRC collections, appear at least as specific and sensitive as those from the commercial platforms. The RNG/MRC collections have now been used by an Anglo-French consortium to distribute more than 3500 microarrays to the academic community. Ad hoc identification of tissue-specific transcripts and a ∼80% correlation with hybridizations performed on Affymetrix GeneChip™ suggest that the RNG/MRC microarrays perform well. This work provides a comprehensive open resource for investigators working on human and mouse transcriptomes, as well as a generic method to generate new microarray collections in other organisms. All information related to these probes, as well as additional information about commercial microarrays have been stored in a freely-accessible database called MEDIANTE
Genome Alteration Print (GAP): a tool to visualize and mine complex cancer genomic profiles obtained by SNP arrays
GAP, a method for analyzing complex cancer genome profiles from SNP arrays, performs well even with poor quality data and rearranged genome
EzArray: A web-based highly automated Affymetrix expression array data management and analysis system
<p>Abstract</p> <p>Background</p> <p>Though microarray experiments are very popular in life science research, managing and analyzing microarray data are still challenging tasks for many biologists. Most microarray programs require users to have sophisticated knowledge of mathematics, statistics and computer skills for usage. With accumulating microarray data deposited in public databases, easy-to-use programs to re-analyze previously published microarray data are in high demand.</p> <p>Results</p> <p>EzArray is a web-based Affymetrix expression array data management and analysis system for researchers who need to organize microarray data efficiently and get data analyzed instantly. EzArray organizes microarray data into projects that can be analyzed online with predefined or custom procedures. EzArray performs data preprocessing and detection of differentially expressed genes with statistical methods. All analysis procedures are optimized and highly automated so that even novice users with limited pre-knowledge of microarray data analysis can complete initial analysis quickly. Since all input files, analysis parameters, and executed scripts can be downloaded, EzArray provides maximum reproducibility for each analysis. In addition, EzArray integrates with Gene Expression Omnibus (GEO) and allows instantaneous re-analysis of published array data.</p> <p>Conclusion</p> <p>EzArray is a novel Affymetrix expression array data analysis and sharing system. EzArray provides easy-to-use tools for re-analyzing published microarray data and will help both novice and experienced users perform initial analysis of their microarray data from the location of data storage. We believe EzArray will be a useful system for facilities with microarray services and laboratories with multiple members involved in microarray data analysis. EzArray is freely available from <url>http://www.ezarray.com/</url>.</p
AIGO: towards a unified framework for the analysis and the inter-comparison of GO functional annotations
BACKGROUND: In response to the rapid growth of available genome sequences, efforts have been made to develop automatic inference methods to functionally characterize them. Pipelines that infer functional annotation are now routinely used to produce new annotations at a genome scale and for a broad variety of species. These pipelines differ widely in their inference algorithms, confidence thresholds and data sources for reasoning. This heterogeneity makes a comparison of the relative merits of each approach extremely complex. The evaluation of the quality of the resultant annotations is also challenging given there is often no existing gold-standard against which to evaluate precision and recall. RESULTS: In this paper, we present a pragmatic approach to the study of functional annotations. An ensemble of 12 metrics, describing various aspects of functional annotations, is defined and implemented in a unified framework, which facilitates their systematic analysis and inter-comparison. The use of this framework is demonstrated on three illustrative examples: analysing the outputs of state-of-the-art inference pipelines, comparing electronic versus manual annotation methods, and monitoring the evolution of publicly available functional annotations. The framework is part of the AIGO library (http://code.google.com/p/aigo) for the Analysis and the Inter-comparison of the products of Gene Ontology (GO) annotation pipelines. The AIGO library also provides functionalities to easily load, analyse, manipulate and compare functional annotations and also to plot and export the results of the analysis in various formats. CONCLUSIONS: This work is a step toward developing a unified framework for the systematic study of GO functional annotations. This framework has been designed so that new metrics on GO functional annotations can be added in a very straightforward way
- …