RNA CoMPASS: RNA Comprehensive Multi-Processor Analysis System for Sequencing

Abstract

The main theme of this dissertation is to develop a distributed computational pipeline for processing next-generation RNA sequencing (RNA-seq) data. RNA-seq experiments generate hundreds of millions of short reads for each DNA/RNA sample. There are many existing bioinformatics tools developed for the analysis and visualization of this data, but very large studies present computational and organizational challenges that are difficult to overcome manually. We designed a comprehensive pipeline for the analysis of RNA sequencing which leverages many existing tools and parallel computing technology to facilitate the analysis of extremely large studies. RNA CoMPASS provides a web-based graphical user interface and distributed computational pipeline including endogenous transcriptome quantification and additionally the investigation of exogenous sequences

    Similar works