20 research outputs found
Genome-wide Characterization of RNA Expression and Processing
The production of fully mature protein-coding transcripts is an intricate process that involves numerous regulation steps. The complexity of these steps provides the means for multilayered control of gene expression. Comprehensive understanding of gene expression regulation is essential for interpreting the role of gene expression programs in tissue specificity, development and disease. In this thesis, we aim to provide a better global view of the human transcriptome, focusing on its content, synthesis, processing and regulation using next-generation sequencing as a read-out. In Paper I, we show that sequencing of total RNA provides unique insights into RNA processing. Our results revealed that co-transcriptional splicing is a widespread mechanism in human and chimpanzee brain tissues. We also found a correlation between slowly removed introns and alternative splicing. In Paper II, we explore the benefits of exome capture approaches in combination with RNA-sequencing to detect transcripts expressed at low-levels. Based on our results, we demonstrate that this approach increases the sensitivity for detecting low level transcripts and leads to the identification of novel exons and splice isoforms. In Paper III, we highlight the advantages of performing RNA-sequencing on separate cytoplasmic and nuclear RNA fractions. In comparison with conventional poly(A) RNA, cytoplasmic RNA contained a significantly higher fraction of exonic sequence, providing increased sensitivity for splice junction detection and for improved de novo assembly. Conversely, the nuclear fraction showed an enrichment of unprocessed RNA compared to when sequencing total RNA, making it suitable for analysis of RNA processing dynamics. In Paper IV, we used exome sequencing to sequence the DNA of a patient with unexplained intellectual disability and identified a de novo mutation in BAZ1A, which encodes the chromatin-remodeling factor ACF1. Functional studies indicated that the mutation influences the expression of genes involved in extracellular matrix organization, synaptic function and vitamin D3 metabolism. The differential expression of CYP24A, SYNGAP1 and COL1A2 correlated with the patient’s clinical diagnosis. The findings presented in this thesis contribute towards an improved understanding of the human transcriptome in health and disease, and highlight the advantages of developing novel methods to obtain global and comprehensive views of the transcriptome
Genome-wide Characterization of RNA Expression and Processing
The production of fully mature protein-coding transcripts is an intricate process that involves numerous regulation steps. The complexity of these steps provides the means for multilayered control of gene expression. Comprehensive understanding of gene expression regulation is essential for interpreting the role of gene expression programs in tissue specificity, development and disease. In this thesis, we aim to provide a better global view of the human transcriptome, focusing on its content, synthesis, processing and regulation using next-generation sequencing as a read-out. In Paper I, we show that sequencing of total RNA provides unique insights into RNA processing. Our results revealed that co-transcriptional splicing is a widespread mechanism in human and chimpanzee brain tissues. We also found a correlation between slowly removed introns and alternative splicing. In Paper II, we explore the benefits of exome capture approaches in combination with RNA-sequencing to detect transcripts expressed at low-levels. Based on our results, we demonstrate that this approach increases the sensitivity for detecting low level transcripts and leads to the identification of novel exons and splice isoforms. In Paper III, we highlight the advantages of performing RNA-sequencing on separate cytoplasmic and nuclear RNA fractions. In comparison with conventional poly(A) RNA, cytoplasmic RNA contained a significantly higher fraction of exonic sequence, providing increased sensitivity for splice junction detection and for improved de novo assembly. Conversely, the nuclear fraction showed an enrichment of unprocessed RNA compared to when sequencing total RNA, making it suitable for analysis of RNA processing dynamics. In Paper IV, we used exome sequencing to sequence the DNA of a patient with unexplained intellectual disability and identified a de novo mutation in BAZ1A, which encodes the chromatin-remodeling factor ACF1. Functional studies indicated that the mutation influences the expression of genes involved in extracellular matrix organization, synaptic function and vitamin D3 metabolism. The differential expression of CYP24A, SYNGAP1 and COL1A2 correlated with the patient’s clinical diagnosis. The findings presented in this thesis contribute towards an improved understanding of the human transcriptome in health and disease, and highlight the advantages of developing novel methods to obtain global and comprehensive views of the transcriptome
Genome-wide Characterization of RNA Expression and Processing
The production of fully mature protein-coding transcripts is an intricate process that involves numerous regulation steps. The complexity of these steps provides the means for multilayered control of gene expression. Comprehensive understanding of gene expression regulation is essential for interpreting the role of gene expression programs in tissue specificity, development and disease. In this thesis, we aim to provide a better global view of the human transcriptome, focusing on its content, synthesis, processing and regulation using next-generation sequencing as a read-out. In Paper I, we show that sequencing of total RNA provides unique insights into RNA processing. Our results revealed that co-transcriptional splicing is a widespread mechanism in human and chimpanzee brain tissues. We also found a correlation between slowly removed introns and alternative splicing. In Paper II, we explore the benefits of exome capture approaches in combination with RNA-sequencing to detect transcripts expressed at low-levels. Based on our results, we demonstrate that this approach increases the sensitivity for detecting low level transcripts and leads to the identification of novel exons and splice isoforms. In Paper III, we highlight the advantages of performing RNA-sequencing on separate cytoplasmic and nuclear RNA fractions. In comparison with conventional poly(A) RNA, cytoplasmic RNA contained a significantly higher fraction of exonic sequence, providing increased sensitivity for splice junction detection and for improved de novo assembly. Conversely, the nuclear fraction showed an enrichment of unprocessed RNA compared to when sequencing total RNA, making it suitable for analysis of RNA processing dynamics. In Paper IV, we used exome sequencing to sequence the DNA of a patient with unexplained intellectual disability and identified a de novo mutation in BAZ1A, which encodes the chromatin-remodeling factor ACF1. Functional studies indicated that the mutation influences the expression of genes involved in extracellular matrix organization, synaptic function and vitamin D3 metabolism. The differential expression of CYP24A, SYNGAP1 and COL1A2 correlated with the patient’s clinical diagnosis. The findings presented in this thesis contribute towards an improved understanding of the human transcriptome in health and disease, and highlight the advantages of developing novel methods to obtain global and comprehensive views of the transcriptome
In silico Prediction and Docking of Tertiary Structure of LuxI, an Inducer Synthase of Vibrio fischeri
Background: LuxI is a component of the quorum sensing signaling pathway in Vibrio fischeri responsible for the inducer synthesis that is essential for bioluminescence.
Methods: Homology modeling of LuxI was carried out using Phyre2 and refined with the GalaxyWEB server. Five models were generated and evaluated by ERRAT, ANOLEA, QMEAN6, and Procheck.
Results: Five refined models were generated by the GalaxyWEB server, with Model 4 having the greatest quality based on the QMEAN6 score of 0.732. ERRAT analysis revealed an overall quality of 98.9%, while the overall quality of the initial model was 54%. The mean force potential energy, as analyzed by ANOLEA, were better compared to the initial model. Sterochemical quality estimation by Procheck showed that the refined Model 4 had a reliable structure, and was therefore submitted to the protein model database. Drug Discovery Workbench V.2 was used to screen 2700 experimental compounds from the DrugBank database to identify inhibitors that can bind to the active site between amino acids 24 and 110. Ten compounds with high negative scores were selected as the best in binding.
Conclusion: The model produced, and the predicted acteyltransferase binding site, could be useful in modeling homologous sequences from other microorganisms and the design of new antimicrobials
Characterization of the nuclear and cytosolic transcriptomes in human brain tissue reveals new insights into the subcellular distribution of RNA transcripts
Transcriptome analysis has mainly relied on analyzing RNA sequencing data from whole cells, overlooking the impact of subcellular RNA localization and its influence on our understanding of gene function, and interpretation of gene expression signatures in cells. Here, we separated cytosolic and nuclear RNA from human fetal and adult brain samples and performed a comprehensive analysis of cytosolic and nuclear transcriptomes. There are significant differences in RNA expression for protein-coding and lncRNA genes between cytosol and nucleus. We show that transcripts encoding the nuclear-encoded mitochondrial proteins are significantly enriched in the cytosol compared to the rest of protein-coding genes. Differential expression analysis between fetal and adult frontal cortex show that results obtained from the cytosolic RNA differ from results using nuclear RNA both at the level of transcript types and the number of differentially expressed genes. Our data provide a resource for the subcellular localization of thousands of RNA transcripts in the human brain and highlight differences in using the cytosolic or the nuclear transcriptomes for expression analysis.De två första författarna delar förstaförfattarskapet</p
Efficient cellular fractionation improves RNA sequencing analysis of mature and nascent transcripts from human tissues
Background: The starting material for RNA sequencing (RNA-seq) studies is usually total RNA or polyA+ RNA. Both forms of RNA represent heterogeneous pools of RNA molecules at different levels of maturation and processing. Such heterogeneity, in addition to the biases associated with polyA+ purification steps, may influence the analysis, sensitivity and the interpretation of RNA-seq data. We hypothesize that subcellular fractions of RNA may provide a more accurate picture of gene expression. Results: We present results for sequencing of cytoplasmic and nuclear RNA after cellular fractionation of tissue samples. In comparison with conventional polyA+ RNA, the cytoplasmic RNA contains a significantly higher fraction of exonic sequence, providing increased sensitivity in expression analysis and splice junction detection, and in improved de novo assembly of RNA-seq data. Conversely, the nuclear fraction shows an enrichment of unprocessed RNA compared with total RNA-seq, making it suitable for analysis of nascent transcripts and RNA processing dynamics. Conclusion: Our results show that cellular fractionation is a more rapid and cost effective approach than conventional polyA+ enrichment when studying mature RNAs. Thus, RNA-seq of separated cytosolic and nuclear RNA can significantly improve the analysis of complex transcriptomes from mammalian tissues
Efficient cellular fractionation improves RNA sequencing analysis of mature and nascent transcripts from human tissues
Background: The starting material for RNA sequencing (RNA-seq) studies is usually total RNA or polyA+ RNA. Both forms of RNA represent heterogeneous pools of RNA molecules at different levels of maturation and processing. Such heterogeneity, in addition to the biases associated with polyA+ purification steps, may influence the analysis, sensitivity and the interpretation of RNA-seq data. We hypothesize that subcellular fractions of RNA may provide a more accurate picture of gene expression. Results: We present results for sequencing of cytoplasmic and nuclear RNA after cellular fractionation of tissue samples. In comparison with conventional polyA+ RNA, the cytoplasmic RNA contains a significantly higher fraction of exonic sequence, providing increased sensitivity in expression analysis and splice junction detection, and in improved de novo assembly of RNA-seq data. Conversely, the nuclear fraction shows an enrichment of unprocessed RNA compared with total RNA-seq, making it suitable for analysis of nascent transcripts and RNA processing dynamics. Conclusion: Our results show that cellular fractionation is a more rapid and cost effective approach than conventional polyA+ enrichment when studying mature RNAs. Thus, RNA-seq of separated cytosolic and nuclear RNA can significantly improve the analysis of complex transcriptomes from mammalian tissues
Expression profiling and in situ screening of circular RNAs in human tissues
Circular RNAs (circRNAs) were recently discovered as a class of widely expressed noncoding RNA and have been implicated in regulation of gene expression. However, the function of the majority of circRNAs remains unknown. Studies of circRNAs have been hampered by a lack of essential approaches for detection, quantification and visualization. We therefore developed a target-enrichment sequencing method suitable for screening of circRNAs and their linear counterparts in large number of samples. We also applied padlock probes and in situ sequencing to visualize and determine circRNA localization in human brain tissue at subcellular levels. We measured circRNA abundance across different human samples and tissues. Our results highlight the potential of this RNA class to act as a specific diagnostic marker in blood and serum, by detection of circRNAs from genes exclusively expressed in the brain. The powerful and scalable tools we present will enable studies of circRNA function and facilitate screening of circRNA as diagnostic biomarkers
Novel Y-Chromosome Long Non-Coding RNAs Expressed in Human Male CNS During Early Development
Global microarray gene expression analyses previously demonstrated differences in female and male embryos during neurodevelopment. In particular, before sexual maturation of the gonads, the differences seem to concentrate on the expression of genes encoded on the X- and Y-chromosomes. To investigate genome-wide differences in expression during this early developmental window, we combined high-resolution RNA sequencing with qPCR to analyze brain samples from human embryos during the first trimester of development. Our analysis was tailored for maximum sensitivity to discover Y-chromosome gene expression, but at the same time, it was underpowered to detect X-inactivation escapees. Using this approach, we found that 5 out of 13 expressed game to log pairs showed unbalanced gene dosage, and as a consequence, a male-biased expression. In addition, we found six novel non-annotated long non-coding RNAs on the Y-chromosome with conserved expression patterns in newborn chimpanzee. The tissue specific and time-restricted expression of these long non-coding RNAs strongly suggests important functions during central nervous system development in human males
A Role for the Chromatin-Remodeling Factor BAZ1A in Neurodevelopment
Chromatin-remodeling factors are required for a wide range of cellular and biological processes including development and cognition, mainly by regulating gene expression. As these functions would predict, deregulation of chromatin-remodeling factors causes various disease syndromes, including neurodevelopmental disorders. Recent reports have linked mutations in several genes coding for chromatin-remodeling factors to intellectual disability (ID). Here, we used exome sequencing and identified a nonsynonymous de novo mutation in BAZ1A (NM_182648.2:c.4043T > G, p.Phe1348Cys), encoding the ATP-utilizing chromatin assembly and remodeling factor 1 (ACF1), in a patient with unexplained ID. ACF1 has been previously reported to bind to the promoter of the vitamin D receptor (VDR)-regulated genes and suppress their expression. Our results show that the patient displays decreased binding of ACF1 to the promoter of the VDR-regulated gene CYP24A1. Using RNA sequencing, we find that the mutation affects the expression of genes involved in several pathways including vitamin D metabolism, Wnt signaling and synaptic formation. RNA sequencing of BAZ1A knockdown cells and Baz1a knockout mice revealed that BAZ1A carry out distinctive functions in different tissues. We also demonstrate that BAZ1A depletion influence the expression of genes important for nervous system development and function. Our data point to an important role for BAZ1A in neurodevelopment, and highlight a possible link for BAZ1A to ID