431 research outputs found

    Using Structural and Evolutionary Information to Detect and Correct Pyrosequencing Errors in Noncoding RNAs.

    Get PDF
    Extended version of RECOMB'13International audienceThe analysis of the sequence-structure relationship in RNA molecules is not only essential for evolutionary studies but also for concrete applications such as error-correction in next generation sequencing (NGS) technologies. The prohibitive sizes of the mutational and conformational landscapes, combined with the volume of data to process, require efficient algorithms to compute sequence-structure properties. In this article, we address the correction of NGS errors by calculating which mutations most increase the likelihood of a sequence to a given structure and RNA family. We introduce RNApyro, an efficient, linear time and space inside-outside algorithm that computes exact mutational probabilities under secondary structure and evolutionary constraints given as a multiple sequence alignment with a consensus structure. We develop a scoring scheme combining classical stacking base-pair energies to novel isostericity scores and apply our techniques to correct pointwise errors in 5s and 16s rRNA sequences. Our results suggest that RNApyro is a promising algorithm to complement existing tools in the NGS error-correction pipeline

    Combining structure probing data on RNA mutants with evolutionary information reveals RNA-binding interfaces

    Get PDF
    International audienceSystematic structure probing experiments (e.g. SHAPE) of RNA mutants such as the mutate-and-map protocol give us a direct access into the genetic robustness of ncRNA structures. Comparative studies of homologous sequences provide a distinct, yet complementary, approach to analyze structural and functional properties of non-coding RNAs. In this paper, we introduce a formal framework to combine the biochemical signal collected from mutate-and-map experiments, with the evolutionary information available in multiple sequence alignments. We apply neutral theory principles to detect complex long-range dependencies between nucleotides of a single stranded RNA, and implement these ideas into a software called aRNhAck. We illustrate the biological significance of this signal and show that the nucleotides networks calculated with aRNhAck are correlated with nucleotides located in RNA-RNA, RNA-protein, RNA-DNA and RNA-ligand interfaces. aRNhAck is freely available at http://csb.cs.mcgill.ca/arnhack

    Next-Generation Sequencing โ€” An Overview of the History, Tools, and โ€œOmicโ€ Applications

    Get PDF
    Next-generation sequencing (NGS) technologies using DNA, RNA, or methylation sequencing have impacted enormously on the life sciences. NGS is the choice for large-scale genomic and transcriptomic sequencing because of the high-throughput production and outputs of sequencing data in the gigabase range per instrument run and the lower cost compared to the traditional Sanger first-generation sequencing method. The vast amounts of data generated by NGS have broadened our understanding of structural and functional genomics through the concepts of โ€œomicsโ€ ranging from basic genomics to integrated systeomics, providing new insight into the workings and meaning of genetic conservation and diversity of living things. NGS today is more than ever about how different organisms use genetic information and molecular biology to survive and reproduce with and without mutations, disease, and diversity within their population networks and changing environments. In this chapter, the advances, applications, and challenges of NGS are reviewed starting with a history of first-generation sequencing followed by the major NGS platforms, the bioinformatics issues confronting NGS data storage and analysis, and the impacts made in the fields of genetics, biology, agriculture, and medicine in the brave, new world of โ€omics.

    Multi-Platform Next-Generation Sequencing of the Domestic Turkey (Meleagris gallopavo): Genome Assembly and Analysis

    Get PDF
    The combined application of next-generation sequencing platforms has provided an economical approach to unlocking the potential of the turkey genome

    Application of statistical analysis in transcriptomic and metagenomic data

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ(์„์‚ฌ)--์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› :์ž์—ฐ๊ณผํ•™๋Œ€ํ•™ ํ˜‘๋™๊ณผ์ • ์ƒ๋ฌผ์ •๋ณดํ•™์ „๊ณต,2019. 8. ๊น€ํฌ๋ฐœ.With the advance in sequencing technology, genomics, transcriptomics, proteomics, epigenomics and metagenomics study genetic materials on the genome-wide scale. Transcriptome and metagenomics have common tools and methods for analysis because they use quantitative data. Analysis of transcriptome data aims at quantifying gene expression and finding differentially expressed genes (DEG) under the certain condition. To detect DEG and trait-associated genes, various statistical methods and tools have been developed and some of them are widely being used. As analysis of metagenome data also use quantified abundance of microorganisms, some developed tools for transcriptome analysis were also applied in metagenome data. In this thesis, I described how the statistical methods were employed to solve biological problems in quantitative data. Analysis of quantitative data recently employed machine learning based methods to predict traits including disease and healthy status. Especially, as gut microbiome is associated with hosts health, several studies suggested the possibility to diagnosis the diseases using abundance and kinds of microorganisms living in gut. In Chapter 2, machine learning based multi-classifier algorithms were established and evaluated to classify several diseases using gut microbiome data. LogitBoost algorithms and using abundance of microorganisms at genus level showed the highest performance. By selecting microorganisms to enhance the performance, the selected microorganisms were suggested as markers to classify various disease simultaneously. Gut microbiome is used as significant marker not only in human health but also in domestic animals. For example, microbial communities altered by feeds in broiler chickens. In Chapter 3, the effect of A. hookeri on gut microbiome in young broiler chickens was investigated. Statistical test revealed that the composition of microbiome was altered by supplement with A. hookeri leaf. The modulated gut microbiome by leaf was correlated with growth traits including body weight, bone strength, and infectious bursal disease antibody. For more accurate analysis of quantitative data, accurate quantification of genetic materials is essential. Chapter 3 suggests the cause of mis-quantification of mRNAs and solutions to reduce the mis-quantified expression. Long non-coding RNAs, which are overlapped with mRNAs in genomic position, can be mis-quantified to the overlapped mRNAs. Simulation showed the degree of errors by such mis-quantification. Tools for alignment and quantification were compared to reduce the error and achieve more accurate quantification for transcriptome.์‹œํ€€์‹ฑ ๊ธฐ์ˆ ์˜ ๋ฐœ๋‹ฌ๋กœ ์œ ์ „์ฒด, ์ „์‚ฌ์ฒด, ๋‹จ๋ฐฑ์ฒด, ํ›„์„ฑ ์œ ์ „์ฒด, ๋ฉ”ํƒ€์ง€๋…ธ๋ฏน์™€ ๊ฐ™์€ ๋ถ„์•ผ์—์„œ ์œ ์ „์ฒด ๋‹จ์œ„๋กœ ์ƒ๋ช…์ฒด์˜ ์ •๋ณด๋ฅผ ํ•ด๋…ํ•  ์ˆ˜ ์žˆ๊ฒŒ ๋˜์—ˆ๋‹ค. ์ด ์ค‘ ์ „์‚ฌ์ฒด์™€ ๋ฉ”ํƒ€์ง€๋†ˆ ๋ถ„์•ผ๋Š” ์ •๋Ÿ‰ํ™”๋œ ์œ ์ „ ์ •๋ณด๋ฅผ ๋‹ค๋ฃฌ๋‹ค๋Š” ๊ณตํ†ต์  ๋•Œ๋ฌธ์— ํ†ต๊ณ„์  ๋ถ„์„ ๋ฐฉ๋ฒ•๋“ค์„ ๊ณต์œ ํ•˜๊ณ  ์žˆ๋‹ค. ์ „์‚ฌ์ฒด ๋ถ„์„์€ ์œ ์ „์ž๋“ค์˜ ๋ฐœํ˜„์„ ์ •๋Ÿ‰ํ™” ํ•˜๊ณ , ํŠน์ • ์กฐ๊ฑด ํ•˜์— ๋‹ค๋ฅธ ์–‘์œผ๋กœ ๋ฐœํ˜„๋˜๋Š” ์œ ์ „์ž๋ฅผ ๋ฐœ๊ตดํ•˜๋Š” ๊ฒƒ์„ ๋ชฉํ‘œํ•˜๊ณ  ์žˆ๋‹ค. ์ •๋Ÿ‰ํ™”๋œ ์–‘์„ ๊ทธ๋ฃน ๊ฐ„ ๋น„๊ตํ•˜๋Š” ๊ฑฐ๋‚˜ ํ˜•์งˆ๊ณผ ๊ด€๋ จ๋œ ์œ ์ „์ž๋ฅผ ์ฐพ๊ธฐ ์œ„ํ•ด ์—ฌ๋Ÿฌ ํ†ต๊ณ„์  ๋ถ„์„ ๋„๊ตฌ ๋ฐ ๋ฐฉ๋ฒ•๋“ค์ด ๊ฐœ๋ฐœ๋˜์—ˆ์œผ๋ฉฐ ๋„๋ฆฌ ์‚ฌ์šฉ๋˜๊ณ  ์žˆ๋‹ค. ๋ฉ”ํƒ€์ง€๋†ˆ ๋ถ„์„์€ ์ •๋Ÿ‰ํ™” ํ•˜๋Š” ๋Œ€์ƒ์ด ๋ฏธ์ƒ๋ฌผ๋“ค์˜ ์–‘์ด๋ผ๋Š” ๊ฒƒ์€ ๋‹ค๋ฅด๋‚˜ ์ •๋Ÿ‰ํ™”ํ•œ ๋ฏธ์ƒ๋ฌผ๋“ค์˜ ์–‘์„ ๋ถ„์„ํ•˜๊ธฐ ๋•Œ๋ฌธ์— ์ „์‚ฌ์ฒด์—์„œ ์‚ฌ์šฉ๋˜์—ˆ๋˜ ๋ฐฉ๋ฒ•๋“ค์ด ๋Œ€๋ถ€๋ถ„ ์ด์šฉ๋˜๊ณ  ์žˆ๋‹ค. ์–‘์  ์ž๋ฃŒ์˜ ๊ธฐ๋ณธ์ ์ธ ๋ถ„์„์—์„œ ๋” ๋‚˜์•„๊ฐ€์„œ ๋จธ์‹ ๋Ÿฌ๋‹๊ธฐ๋ฒ•์„ ์ด์šฉํ•˜์—ฌ ์ •๋Ÿ‰ํ™”๋œ ์œ ์ „๋ฌผ์งˆ์˜ ์–‘์œผ๋กœ ์งˆ๋ณ‘๊ณผ ๊ฐ™์€ ํ˜•์งˆ์„ ์˜ˆ์ธกํ•˜๊ณ ์ž ํ•˜๋Š” ์‹œ๋„๋„ ์ด๋ฃจ์–ด ์ง€๊ณ  ์žˆ๋‹ค. ํŠนํžˆ, ์ธ๊ฐ„์˜ ์žฅ๋‚ด๋ฏธ์ƒ๋ฌผ์€ ๋ฉด์—ญ์ฒด๊ณ„์™€ ์—ฐ๊ด€์„ฑ์ด ์žˆ๊ธฐ ๋•Œ๋ฌธ์— ์žฅ๋‚ด๋ฏธ์ƒ๋ฌผ์˜ ์ข…๋ฅ˜์™€ ์–‘์œผ๋กœ ์งˆ๋ณ‘์„ ์ง„๋‹จํ•˜๋ ค๋Š” ์—ฌ๋Ÿฌ ์—ฐ๊ตฌ๊ฐ€ ๋ณด๊ณ  ๋˜์—ˆ๋‹ค. ์ œ 2์žฅ์—์„œ๋Š” ๋‹ค์–‘ํ•œ ์งˆ๋ณ‘์„ ๊ฐ€์ง„ ํ™˜์ž๋“ค์˜ ์žฅ๋‚ด๋ฏธ์ƒ๋ฌผ์„ ์ด์šฉํ•˜์—ฌ ๋จธ์‹ ๋Ÿฌ๋‹๊ธฐ๋ฐ˜ ๋‹ค์ค‘ ๋ถ„๋ฅ˜ ์•Œ๊ณ ๋ฆฌ์ฆ˜์œผ๋กœ ์งˆ๋ณ‘์„ ๋ถ„๋ฅ˜ํ•  ์ˆ˜ ์žˆ๋Š” ๋ชจ๋ธ์„ ๊ตฌ์ถ•ํ•˜๊ณ  ์ด๋ฅผ ํ‰๊ฐ€ํ•˜์˜€๋‹ค. ์ด ์—ฐ๊ตฌ๋ฅผ ํ†ตํ•ด LogitBoost ๊ธฐ๋ฐ˜ ์˜ˆ์ธก ๋ชจ๋ธ์ด 6 ๊ฐ€์ง€ ์งˆ๋ณ‘์„ ๊ฐ€์žฅ ์ž˜ ๊ตฌ๋ถ„ ์ง“๋‹ค๋Š” ๊ฒƒ์„ ๋ฐํ˜”๊ณ , ๋ฏธ์ƒ๋ฌผ์˜ ๋ถ„๋ฅ˜์ฒด๊ณ„ ์ค‘ ์†(genus)์—์„œ์˜ ์–‘์„ ์ด์šฉํ–ˆ์„ ๋•Œ ์„ฑ๋Šฅ์ด ๊ฐ€์žฅ ์ข‹๋‹ค๋Š” ๊ฒƒ์„ ๋ณด์˜€๋‹ค. ๋˜ํ•œ ๋ฏธ์ƒ๋ฌผ๋“ค์„ ์„ ํƒํ•˜์—ฌ ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์„ ๋†’์ด๋Š” ๊ณผ์ •์—์„œ ๋‹ค์–‘ํ•œ ์งˆ๋ณ‘์„ ๋™์‹œ์— ๊ตฌ๋ถ„ํ•˜๋Š” ๋ฏธ์ƒ๋ฌผ๋“ค์„ ์งˆ๋ณ‘ ์ง„๋‹จ์„ ์œ„ํ•œ ๋งˆ์ปค๋กœ ์ œ์‹œํ•˜์˜€๋‹ค. ์ธ๊ฐ„์—์„œ ๋ฟ๋งŒ์•„๋‹ˆ๋ผ ๋™๋ฌผ๋“ค์—์„œ๋„ ์žฅ๋‚ด๋ฏธ์ƒ๋ฌผ์˜ ์กฐ์„ฑ์€ ๊ฑด๊ฐ• ๋ฐ ์ƒ์‚ฐ๋Ÿ‰์˜ ์ค‘์š”ํ•œ ์ง€ํ‘œ๋กœ ์ด์šฉ๋˜๊ณ  ์žˆ๋‹ค. ์˜ˆ๋ฅผ๋“ค์–ด, ์‚ฌ๋ฃŒ์— ๋”ฐ๋ผ ์œก๊ณ„์˜ ์žฅ๋‚ด๋ฏธ์ƒ๋ฌผ ์กฐ์„ฑ์˜ ๋ณ€ํ™”๋Š” ๊ณผ๊ฑฐ ์—ฐ๊ตฌ์—์„œ ๋ณด๊ณ ๋˜์–ด ์™”๋‹ค. 3์žฅ์—์„œ๋Š” ์‚ผ์ฑ„๋ฅผ ๋ณต์šฉํ•œ ์œก๊ณ„์˜ ์žฅ๋‚ด ๋ฏธ์ƒ๋ฌผ์„ ์กฐ์‚ฌํ•˜๊ณ , ์ƒ์‚ฐ์„ฑ๊ณผ ์—ฐ๊ด€์ด ์žˆ๋Š” ์žฅ๋‚ด๋ฏธ์ƒ๋ฌผ๋“ค์„ ๋ฐœ๊ตดํ•˜์˜€๋‹ค. ์‚ผ์ฑ„์˜ ์žŽ์„ ๋ณต์šฉ์€ ์œก๊ณ„์˜ ์žฅ๋‚ด๋ฏธ์ƒ๋ฌผ์— ์˜ํ–ฅ์„ ๋ฏธ์น˜๋Š” ๊ฒƒ์„ ๋ฐํ˜€๋‚ด์—ˆ์œผ๋ฉฐ, ์—ฐ๊ด€์„ฑ ๋ถ„์„์„ ํ†ตํ•ด์„œ ์‚ผ์ฑ„์˜ ๋ณต์šฉ์— ์˜ํ–ฅ์„ ๋ฐ›๋Š” ๋ฏธ์ƒ๋ฌผ๋“ค์ด ์œก๊ณ„์˜ ์ฒด์ค‘, ๊ฒฝ๊ณจ๊ฐ•๋„ ๋ฐ ๋ฉด์—ญ๊ณผ ๊ด€๋ จ๋˜์–ด ์žˆ๋‹ค ๊ฒƒ์„ ์ œ์‹œํ•˜์˜€๋‹ค. ๋˜ํ•œ ๋ฏธ์ƒ๋ฌผ ๊ธฐ๋Šฅ๋ถ„์„์„ ํ†ตํ•ด ๋ฏธ์ƒ๋ฌผ ์กฐ์„ฑ์˜ ๋ณ€ํ™”๊ฐ€ ํƒ„์ˆ˜ํ™”๋ฌผ ๋Œ€์‚ฌ๋ฅผ ์ฆ์ง„์‹œํ‚จ๋‹ค๋Š” ๋‹จ์„œ๋ฅผ ์ œ์‹œํ•˜์˜€๋‹ค. ์ •๋Ÿ‰์  ๋ฐ์ดํ„ฐ๋“ค์˜ ์ข€ ๋” ์ •ํ™•ํ•œ ๋ถ„์„์„ ์œ„ํ•ด์„œ๋Š” ์ •๋Ÿ‰ํ™” ๋‹จ๊ณ„์—์„œ ์œ ์ „๋ฌผ์งˆ์˜ ์ •ํ™•ํ•œ ์ธก์ •์ด ๋ฌด์—‡๋ณด๋‹ค ์ค‘์š”ํ•˜๋‹ค. 3์žฅ์—์„œ๋Š” ์ „์‚ฌ์ฒด ๋ถ„์„์—์„œ mRNA์˜ ๋ฐœํ˜„๋Ÿ‰์ด ์ •ํ™•ํžˆ ์ธก์ •๋˜์ง€ ๋ชปํ•˜๊ฒŒ ํ•˜๋Š” ์—๋Ÿฌ์š”์ธ์„ ์ œ์‹œํ•˜๊ณ  ์ด๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•œ ๋ฐฉ๋ฒ•๋“ค์„ ์ œ์‹œํ•˜์˜€๋‹ค. ์œ ์ „์ฒด ์„œ์—ด์—์„œ mRNA์™€ ์ค‘์ฒฉ๋˜์–ด ์žˆ๋Š” lncRNA๋Š” ์ •๋Ÿ‰ํ™”์˜ ์•Œ๊ณ ๋ฆฌ์ฆ˜์ƒ mRNA๋กœ ์˜ค์ธ๋  ์ˆ˜ ์žˆ๋‹ค๋Š” ๊ฒƒ์„ ๊ฐ€์ •ํ•˜์˜€์œผ๋ฉฐ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์„ ํ†ตํ•ด lncRNA ๋ฐœํ˜„๋Ÿ‰์ž„์—๋„ mRNA๋กœ ์ •๋Ÿ‰ํ™”๋˜๋Š” ์—๋Ÿฌ์œจ์„ ์ œ์‹œํ•˜์˜€๋‹ค. ์ด๋Ÿฌํ•œ ์—๋Ÿฌ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด์„œ ์ •๋Ÿ‰ํ™” ๋‹จ๊ณ„์—์„œ ์“ฐ์ด๋Š” ์—ฌ๋Ÿฌ ์•Œ๊ณ ๋ฆฌ์ฆ˜๊ณผ ํˆด์„ ๋น„๊ตํ•˜์—ฌ ๋” ์ •ํ™•ํ•œ ์ •๋ นํ™”๋ฅผ ํ†ตํ•œ ์ „์‚ฌ์ฒด ๋ถ„์„์„ ๊ฐ€๋Šฅํ•˜๋„๋ก ํ•˜์˜€๋‹ค.Abstract 4 Contents 7 List of Tables 9 List of Figures 10 Chapter 1. Literature Review 13 1.1 Machine learning approaches for gut microbiome data 14 1.2 Community analysis in the metagenomic data 16 1.3 Quantification of mRNAs and lncRNAs 17 Chapter 2. Establishment and evaluation of prediction model for multiple disease classification based on gut microbial data 20 2.1 Abstract 21 2.2 Introduction 22 2.3 Materials and Methods 25 2.4 Results 28 2.5 Discussion 42 Chapter 3. Effects of Allium hookeri leaf on gut microbiome related to growth performance of young broiler chickens 52 3.1 Abstract 53 3.2 Introduction 54 3.3 Materials and methods 57 3.4 Result 60 3.5 Discussion 73 Chapter 4. The overlap between lncRNA and mRNA causes misleading transcripts quantification: A comprehensive evaluation of quantification for RNA-Seq data 82 4.1 Abstract 83 4.2 Introduction 85 4.3 Materials and Methods 88 4.4 Results 96 4.5 Discussion 117 References 137 ์š”์•ฝ(๊ตญ๋ฌธ์ดˆ๋ก) 152Maste

    Emerging Topics in Genome Sequencing and Analysis

    Get PDF
    This dissertation studies the emerging topics in genome sequencing and analysis with DNA and RNA. The optimal hybrid sequencing and assembly for accurate genome reconstruction and efficient detection approaches for novel ncRNAs in genomes are discussed. The next-generation sequencing is a significant topic that provides whole genetic information for the further biological research. Recent advances in high-throughput genome sequencing technologies have enabled the systematic study of various genomes by making whole genome sequencing affordable. To date, many hybrid genome assembly algorithms have been developed that can take reads from multiple read sources to reconstruct the original genome. An important aspect of hybrid sequencing and assembly is that the feasibility conditions for genome reconstruction can be satisfied by different combinations of the available read sources, opening up the possibility of optimally combining the sources to minimize the sequencing cost while ensuring accurate genome reconstruction. In this study, we derive the conditions for whole genome reconstruction from multiple read sources at a given confidence level and also introduce the optimal strategy for combining reads from different sources to minimize the overall sequencing cost. We show that the optimal read set, which simultaneously satisfies the feasibility conditions for genome reconstruction and minimizes the sequencing cost, can be effectively predicted through constrained discrete optimization. The availability of genome-wide sequences for a variety of species provides a large database for the further RNA analysis with computational methods. Recent studies have shown that noncoding RNAs (ncRNAs) are known to play crucial roles in various biological processes, and some ncRNAs are related to the genome stability and a variety of inherited diseases. The discovery of novel ncRNAs is hence an important topic, and there is a pressing need for accurate computational detection approaches that can be used to efficiently detect novel ncRNAs in genomes. One important issue is RNA structure alignment for comparative genome analysis, as RNA secondary structures are better conserved than the RNA sequences. Simultaneous RNA alignment and folding algorithms aim to accurately align RNAs by predicting the consensus structure and alignment at the same time, but the computational complexity of the optimal dynamic programming algorithm for simultaneous alignment and folding is extremely high. In this work, we proposed an innovative method, TOPAS, for RNA structural alignment that can efficiently align RNAs through topological networks. Although many ncRNAs are known to have a well conserved secondary structure, which provides useful clues for computational prediction, the prediction of ncRNAs is still challenging, since it has been shown that a structure-based approach alone may not be sufficient for detecting ncRNAs in a single sequence. In this study, we first develop a new approach by utilizing the n-gram model to classify the sequences and extract effective features to capture sequence homology. Based on this approach, we propose an advanced method, piRNAdetect, for reliable computational prediction of piRNAs in genome sequences. Utilizing the n-gram model can enhance the detection of ncRNAs that have sparse folding structures with many unpaired bases. By incorporating the n-gram model with the generalized ensemble defect, which assesses structure conservation and conformation to the consensus structure, we further propose RNAdetect, a novel computational method for accurate detection of ncRNAs through comparative genome analysis. Extensive performance evaluation based on the Rfam database and bacterial genomes demonstrates that our approaches can accurately and reliably detect novel ncRNAs, outperforming the current advanced methods

    Changes in DNA methylation patterns in mammals with senescence, ageing and energy restriction

    Get PDF
    PhD ThesisDNA methylation is a reversible and inheritable chemical modification which involves the addition of a methyl group to DNA catalyzed by DNA methyltransferases (DNMTs), resulting in the conversion of the cytosine to 5โ€™-methylcytosine, where the cytosine residue is followed by a guanine residue (CpG). In mammals, there are unusually dense clusters of CpG dinucleotides in the promoters of genes which are called โ€œCpG islandsโ€. For many housekeeping genes, these CpG islands are unmethylated in normal healthy tissue. However, CpG islands methylation is often associated with gene silencing and changed patterns of DNA methylation, associated with altered patterns of gene transcription, contribute to the aetiology of several diseases and to ageing. The overall aim of this project was to characterise the changes in DNA methylation which are observed during cell senescence (in the human cell line MRC-5) and during ageing and in response to dietary energy restriction (in various mice tissues). For this purpose: i) global DNA methylation was quantified using various techniques (ELISA, LINE-1, B1 and LUMA assays and HPLC); ii) site-specific genome-wide screening for methylation changes was performed using the MeDIP technique followed by hybridization to DNA microarrays; iii) validation of the results for selected candidates was performed by pyrosequencing and iv) investigation of mitochondrial DNA methylation was conducted using bisulphite-modified-DNA PCR direct sequencing. Effects of senescence on gene expression were assessed by transcriptome microarrays and by RT-qPCR studies. During the study, previously established methods for the investigation of global DNA methylation (LUMA) and site-specific methylation (MeDIP) were improved. Whilst global DNA methylation changes were detectable in senescence and after short-term dietary energy restriction, DNMTs/Dnmts expression changes were observed in senescence, ageing and following dietary energy restriction and were tissue- and treatment-specific. In parallel, site-specific aberrant DNA methylation was found in the promoters of the genes CTTN, GLIPR2, NPTX1 and SLC39A14 in replicative senescent MRC-5 human fibroblasts. These changes were validated by pyrosequencing and were accompanied by changes in expression of the corresponding genes. Also, in a pilot study, promoter methylation of several cell cycle genes was altered in replicative senescence with associated changes in gene expression. Concordant methylation changes were found in the promoters of 47 gene in ageing mouse and heart tissues including the genes Wnt5a, Map4k5, Apcdd1, Chp2 and Rasgrp2. In addition, dietary energy restriction counteracted the age-related DNA methylation changes in the promoters of 40 genes, including Aifm1, Irf8, Rarg, Nmi, Maf1, Rab33a and Fxn in mouse liver. Finally, mitochondrial DNA methylation studies revealed that senescence affected the DNA methylation patterns of the MT-COI and MT-ND1 gene coding sequences in MRC-5 fibroblasts whilst ageing affected the DNA methylation pattern of the D-Loop region in mouse liver, but this was not reversed by dietary energy restriction. Pathway analysis revealed that senescence- and age-related aberrant DNA methylation affected genes involved in inter-cellular communication, stress response, malignant transformation, cellular development/proliferation control, cell growth/differentiation and survival, apoptosis and immune response. As these genes contribute to the maintenance of cellular and tissue homeostasis, these findings suggest a potential role for altered DNA methylation in the aetiology of senescence and ageing. On the other hand, short-term dietary energy restriction modulated some of the age-related aberrant DNA methylation patterns of the ageing mouse liver, in particular those in promoters of genes involved in apoptosis regulation, inflammatory and immune response to viral infections, transcription regulation, vesicle trafficking and mitochondrial iron transport and respiration. Finally, mitochondrial DNA aberrant methylation - found to occur at genes belonging to Complex IV and to Complex I - may contribute to the accumulation of hazardous superoxide species in senescent cells whilst DNA aberrant methylation at the D-Loop mitochondrial regulatory region may contribute to age-related mitochondrial dysfunction. In conclusion, these findings suggest that altered DNA methylation may have a role in the aetiology of senescence and ageing and that some of the effects of dietary energy restriction in slowing down the ageing process and also delaying the onset of age-related diseases may occur via epigenetic mechanisms, including amelioration of age-related aberrations in patterns of DNA methylation

    New approaches to unveil the Transcriptional landscape of dopaminergic neurons

    Get PDF
    Recent advances in studying the mammalian transcriptome arised new questions about how genes are organized and what is the function of noncoding RNAs. Furthermore, the discovery of large amounts of polyA- transcripts and antisense transcription proved that a portion of the transcriptome has still to be characterized. The complex anatomo-functional organization of the brain has prevented a comprehensive analysis of the transcriptional landscape of this tissue. New techniques must be developed to approach neuronal heterogeneity. In this study we combined Laser Capture Microdissection (LCM) and nanoCAGE, based on Cap Analysis of Gene Expression (CAGE), to describe expressed genes and map their transcription start sites (TSS) in two specific populations, A9 and A10, of mouse mesencephalic dopaminergic cells. Although sharing common dopaminergic marker genes, these two populations are part of different midbrain anatomical structures, substantia nigra (SN) for A9 and ventral tegmental area (VTA) for A10, project to relatively distinct areas, participate to distinct ascending dopaminergic pathways, exhibit different electrophysiological properties and different susceptibility to neurodegeneration in Parkinson`s disease. Specific neurons were identified by the expression of Green Fluorescent Protein driven by a celltype specific promoter in transgenic mice. High-quality RNAs were purified from 1000-2500 cells collected by LCM. We adapted the CAGE technique to analyze limiting amounts of RNAs (nanoCAGE). We took advantage of the cap-switching properties of the reverse transcriptase to specifically tag the 5`end of transcripts with a sequence containing a class III restriction site for EcoP15I. By creating 32bp 5`tags, we considerably improved the TSS mapping rate on the genome. A semi-suppressive PCR strategy was used to prevent primer dimers formation. The use of random priming in the 1st strand synthesis allowed to capture poly(A)- RNAs. 5`tags were sequenced with Illumina-Solexa platform. Here we show that this new nanoCAGE technology ensures a true high-throughput coverage of the transcriptome of a small number of identified neurons and can be used as an effective mean for gene discovery in the noncoding RNAs, to uncover putative alternative promoters associated to variants of protein coding transcripts and to detect potentially regulatory antisense transcripts. A further experimental validation by 5`RACE (Rapid Amplification of cDNA Ends) and RT-PCR on few candidate genes, have confirmed the existence in vivo of alternative TSS in the case of key regulatory genes involved in specifying and maintaining the dopaminergic phenotype of these neurons such as \u3b1-synuclein (Snca), dopamine transporter (Dat), vescicular monoamine transporter 2 (Vmat2), catechol-O-methyltransferase (Comt). Furthermore the differential expression of an antisense transcript overlapping to the polyubiquitin (Ubc) gene was detected as potentially interesting candidate gene accounting for differences in the ubiquitin-proteasome system (UPS) function in the two neuron populations. The potential implications deriving from these newly discovered alternative promoters and transcripts are discussed, considering also the potential consequences for the corresponding protein isoforms

    Genomewide analysis of gene expression in Vitis vinifera ssp.

    Get PDF
    • โ€ฆ
    corecore