47 research outputs found
Salvaging Affymetrix probes after probe-level re-annotation
Background: Affymetrix GeneChips can be re-annotated at the probe-level by breaking up the original probe-sets and recomposing new probe-sets based on up-to-date genomic knowledge, such as available in Entrez Gene. This results in custom Chip Description Files (CDF). Using these custom CDFs improves the quality of the data and thus the results of related gene expression studies. However, 44-71% of the probes on a GeneChip are lost in this re-annotation process. Although generally aimed at less known genes, losing these probes obviously means a substantial loss of expensive experiment data. Biologists are therefore very reluctant to adopt this approach. Findings: We aimed to re-introduce the non-affected Affymetrix probe-sets after these re-annotation procedures. For this, we developed an algorithm (CDF-Merger) and applied it to standard Affymetrix CDFs and custom Brainarray CDFs to obtain Hybrid CDFs. Thus, salvaging lost Affymetrix probes with our CDF-Merger restored probe content up to 94%. Because the salvaged probes (up to 54% of the probe content on the arrays) represent less-reliable probe-sets, we made the origin of all probe-set definitions traceable, so biologists can choose at any time in their analyses, which subset of probe-sets they want to use. Conclusion: The availability of up-to-date Hybrid CDFs plus R environment allows for easy implementation of our approach
SigWinR; the SigWin-detector updated and ported to R
<p>Abstract</p> <p>Background</p> <p>Our SigWin-detector discovers significantly enriched windows of (genomic) elements in any sequence of values (genes or other genomic elements in a DNA sequence) in a fast and reproducible way. However, since it is grid based, only (life) scientists with access to the grid can use this tool. Therefore and on request, we have developed the SigWinR package which makes the SigWin-detector available to a much wider audience. At the same time, we have introduced several improvements to its algorithm as well as its functionality, based on the feedback of SigWin-detector end users.</p> <p>Findings</p> <p>To allow usage of the SigWin-detector on a desktop computer, we have rewritten it as a package for R: SigWinR. R is a free and widely used multi platform software environment for statistical computing and graphics. The package can be installed and used on all platforms for which R is available. The improvements involve: a visualization of the input-sequence values supporting the interpretation of Ridgeograms; a visualization allowing for an easy interpretation of enriched or depleted regions in the sequence using windows of pre-defined size; an option that allows the analysis of circular sequences, which results in rectangular Ridgeograms; an application to identify regions of co-altered gene expression (ROCAGEs) with a real-life biological use-case; adaptation of the algorithm to allow analysis of non-regularly sampled data using a constant window size in physical space without resampling the data. To achieve this, support for analysis of windows with an even number of elements was added.</p> <p>Conclusion</p> <p>By porting the SigWin-detector as an R package, SigWinR, improving its algorithm and functionality combined with adequate performance, we have made SigWin-detector more useful as well as more easily accessible to scientists without a grid infrastructure.</p
OligoRAP – an Oligo Re-Annotation Pipeline to improve annotation and estimate target specificity
Background - High throughput gene expression studies using oligonucleotide microarrays depend on the specificity of each oligonucleotide (oligo or probe) for its target gene. However, target specific probes can only be designed when a reference genome of the species at hand were completely sequenced, when this genome were completely annotated and when the genetic variation of the sampled individuals were completely known. Unfortunately there is not a single species for which such a complete data set is available. Therefore, it is important that probe annotation can be updated frequently for optimal interpretation of microarray experiments. Results - In this paper we present OligoRAP, a pipeline to automatically update the annotation of oligo libraries and estimate oligo target specificity. OligoRAP uses a reference genome assembly with Ensembl and Entrez Gene annotation supplemented with a set of unmapped transcripts derived from RefSeq and UniGene to handle assembly gaps. OligoRAP produces alignments of each oligo with the reference assembly as well as with unmapped transcripts. These alignments are re-mapped to the annotation sources, which results in a concise, as complete as possible and up-to-date annotation of the oligo library. The building blocks of this pipeline are BioMoby web services creating a highly modular and distributed system with a robust, remote programmatic interface. OligoRAP was used to update the annotation for a subset of 791 oligos from the ARK-Genomics 20 K chicken array, which were selected as starting material for the oligo annotation session of the EADGENE/SABRE Post-analysis workshop. Based on the updated annotation about one third of these oligos is problematic with regard to target specificity. In addition, the accession numbers or ids the oligos were originally designed for no longer exist in the updated annotation for almost half of the oligos. Conclusion - As microarrays are designed on incomplete data, it is important to update probe annotation and check target specificity regularly. OligoRAP provides both and due to its design based on BioMoby web services it can easily be embedded as an oligo annotation engine in customised applications for microarray data analysis. The dramatic difference in updated annotation and target specificity for the ARK-Genomics 20 K chicken array as compared to the original data emphasises the need for regular updates
RNA isolation method for single embryo transcriptome analysis in zebrafish
Background: Transcriptome analysis during embryogenesis usually requires pooling of embryos to obtain sufficient RNA. Hence, the measured levels of gene-expression represent the average mRNA levels of pooled samples and the biological variation among individuals is confounded. This can irreversibly reduce the robustness, resolution, or expressiveness of the experiment. Therefore, we developed a robust method to isolate abundant high-quality RNA from individual embryos to perform single embryo transcriptome analyses using zebrafish as a model organism. Available methods for embryonic zebrafish RNA isolation minimally utilize ten embryos. Further downscaling of these methods to one embryo is practically not feasible. Findings: We developed a single embryo RNA extraction method based on sample homogenization in liquid nitrogen, RNA extraction with phenol and column purification. Evaluation of this method showed that: the quality of the RNA was very good with an average RIN value of 8.3-8.9; the yield was always ≥ 200 ng RNA per embryo; the method was applicable to all stages of zebrafish embryogenesis; the success rate was almost 100%; and the extracted RNA performed excellent in microarray experiments in that the technical variation was much lower than the biological variation. Conclusions: Presented is a high-quality, robust RNA isolation method. Obtaining sufficient RNA from single embryos eliminates the necessity of sample pooling and its associated drawbacks. Although our RNA isolation method has been setup for transcriptome analysis in zebrafish, it can also be used for other model systems and other applications like (q)PCR and transcriptome sequencing
Using R in Taverna: RShell v1.2
Background: R is the statistical language commonly used by many life scientists in (omics) data by the open source workflow management system Taverna. However, Taverna had limited support for R, because it supported just a few data types and only a single output. Also, there was no support for graphical output and persistent sessions. Altogether this made using R in Taverna impractical.\ud
\ud
Findings: We have developed an R plugin for Taverna: RShell, which provides R functionality within workflows designed in Taverna. In order to fully support the R language, our RShell plugin directly uses the R interpreter. The RShell plugin consists of a Taverna processor for R scripts and an RShell Session Manager that communicates with the R server. We made the RShell processor highly configurable allowing the user to define multiple inputs and outputs. Also, various data types are supported, such as strings, numeric data and images. To limit data transport between multiple RShell processors, the RShell plugin also supports persistent sessions. Here, we will describe the architecture of RShell and the new features that are introduced in version 1.2, i.e.: i) Support for R up to and including R version 2.9; ii) Support for persistent sessions to limit data transfer; iii) Support for vector graphics output through PDF; iv) Syntax highlighting of the R code; v) Improved usability through fewer port types. Our new RShell processor is backwards compatible with workflows that use older versions of the RShell processor. We demonstrate the value of the RShell processor by a use-case workflow that maps oligonucleotide probes designed with DNA sequence information from Vega onto the Ensembl genome assembly.\ud
\ud
Conclusion: Our RShell plugin enables Taverna users to employ R scripts within their workflows in a highly configurable way
Smart Environments for Collaborative Design, Implementation, and Interpretation of Scientific Experiments
Ambient intelligence promises to enable humans to smoothly interact with their environment, mediated by computer technology. In the literature on ambient intelligence, empirical scientists are not often mentioned. Yet they form an interesting target group for this technology. In this position paper, we describe a project aimed at realising an ambient intelligence environment for face-to-face meetings of researchers with different academic backgrounds involved in molecular biology “omics” experiments. In particular, microarray experiments are a focus of attention because these experiments require multidisciplinary collaboration for their design, analysis, and interpretation. Such an environment is characterised by a high degree of complexity that has to be mitigated by ambient intelligence technology. By experimenting in a real-life setting, we will learn more about life scientists as a user group
Concepción de un plan de negocios para la creación de una agencia de representación de nuevos músicos: modelo de aplicación basado en el género pop- rock
Al realizar un plan de negocio a partir de la idea de creación de una agencia de representación de artistas musicales, se analizaron los distintos factores que intervienen en el mismo, para de esta manera evaluar si la idea de negocio es factible o no y así mismo aplicar los conocimiento adquiridos en la maestría al emplearlos en un caso aplicativo que es el de IOSSA, un músico italiano con gran potencial.
En el capítulo II, se realizó un análisis general del mercado en el cuál se determinó la oferta y la demanda de la música grabada y en streaming, para así poder determinar el tipo de estrategias a realizar.
En el capítulo III, se realizó un estudio de la competencia y de acuerdo al análisis, realizó también un plan de marketing con las estrategias a realizar, seguido por una descripción de la estructura organizacional de la Agencia de Representación.
Adicionalmente, se realizó un estudio de factibilidad económica y financiera para medir la factibilidad de la idea de negocio. Por último se realizó un caso aplicativo en el cuál se muestra cómo trabajará la Agencia de representación de artistas musicales.
Líneas de investigación y desarrollo futuras:
La información que se ha considerado necesaria presentar va desde la situación general del mercado, para luego realizar un estudio de la competencia y finalmente realizar un análisis de factibilidad del plan de negocios internacional. Así el proyecto pueda ser entendido, aceptado y también podría ser útil para solicitar créditos o buscar inversores o socios si el caso lo amerite.Facultad de Ciencias Económica