Search CORE

HAL-Ecole des Ponts ParisTech

Hal-Diderot

HAL - UPEC / UPEM

Plant-RRBS, a bisulfite and next-generation sequencing-based methylome profiling method enriching for coverage of cytosine positions

Author: A Akalin
A Meissner
A Verkest
AJ Bewick
AR Elhamamsy
AR Quinlan
B Langmead
Bram Slabbinck
C Becker
CA Ibarra
Cindy Martens
D Meng
D Pignatta
EJ Finnegan
F Johannes
Frederik Coppens
G Gremme
H Parkinson
H Saze
H Schöb
H Stroud
H Zhang
H-Q Wang
HJ Xie
International Rice Genome Sequencing Project
J Du
J Yu
JI Gent
K Manning
K Okonechnikov
M Block De
M Block De
M Choi
M Hauben
M Schmidt
Magdalena Woloszynska
Marc De Block
Martin Schmidt
MD Schultz
MG Murray
Michiel Van Bel
Mieke Van Lijsebettens
P Cubas
PF Gugger
PS Schnable
R Lister
RJ Schmitz
SE Jacobsen
SJ Cokus
T-F Hsieh
The Arabidopsis Genome Initiative
TJ Treangen
TP Gurp van
W Guo
X Cao
X Chen
X Li
X Wang
X Zhang
ZD Smith
ZL Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Background: Cytosine methylation in plant genomes is important for the regulation of gene transcription and transposon activity. Genome-wide methylomes are studied upon mutation of the DNA methyltransferases, adaptation to environmental stresses or during development. However, from basic biology to breeding programs, there is a need to monitor multiple samples to determine transgenerational methylation inheritance or differential cytosine methylation. Methylome data obtained by sodium hydrogen sulfite (bisulfite)-conversion and next-generation sequencing (NGS) provide genome- wide information on cytosine methylation. However, a profiling method that detects cytosine methylation state dispersed over the genome would allow high-throughput analysis of multiple plant samples with distinct epigenetic signatures. We use specific restriction endonucleases to enrich for cytosine coverage in a bisulfite and NGS-based profiling method, which was compared to whole-genome bisulfite sequencing of the same plant material. Methods: We established an effective methylome profiling method in plants, termed plant-reduced representation bisulfite sequencing (plant-RRBS), using optimized double restriction endonuclease digestion, fragment end repair, adapter ligation, followed by bisulfite conversion, PCR amplification and NGS. We report a performant laboratory protocol and a straightforward bioinformatics data analysis pipeline for plant-RRBS, applicable for any reference-sequenced plant species. Results: As a proof of concept, methylome profiling was performed using an Oryza sativa ssp. indica pure breeding line and a derived epigenetically altered line (epiline). Plant-RRBS detects methylation levels at tens of millions of cytosine positions deduced from bisulfite conversion in multiple samples. To evaluate the method, the coverage of cytosine positions, the intra-line similarity and the differential cytosine methylation levels between the pure breeding line and the epiline were determined. Plant-RRBS reproducibly covers commonly up to one fourth of the cytosine positions in the rice genome when using MspI-DpnII within a group of five biological replicates of a line. The method predominantly detects cytosine methylation in putative promoter regions and not-annotated regions in rice. Conclusions: Plant-RRBS offers high-throughput and broad, genome- dispersed methylation detection by effective read number generation obtained from reproducibly covered genome fractions using optimized endonuclease combinations, facilitating comparative analyses of multi-sample studies for cytosine methylation and transgenerational stability in experimental material and plant breeding populations

Ghent University Academic Bibliography

ZENODO

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

GenomeBlast: a web tool for small genome comparison

Author: AL Delcher
DD Womble
DL Swofford
Etsuko N Moriyama
Guoqing Lu
J Felsenstein
JO Korbel
KA Frazer
KP O'Brien
L Florea
Liying Jiang
Luwen Zhang
M Berriman
M Remm
MD Hendy
MG Montague
MM Alba
RD Page
Resa MK Helikar
RL Tatusov
S Kurtz
S Schwartz
S Yang
SF Altschul
T Treangen
T Xie
Thaine W Rowley
TJ Carver
Xianfeng Chen
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Comparative genomics has become an essential approach for identifying homologous gene candidates and their functions, and for studying genome evolution. There are many tools available for genome comparisons. Unfortunately, most of them are not applicable for the identification of unique genes and the inference of phylogenetic relationships in a given set of genomes. RESULTS: GenomeBlast is a Web tool developed for comparative analysis of multiple small genomes. A new parameter called "coverage" was introduced and used along with sequence identity to evaluate global similarity between genes. With GenomeBlast, the following results can be obtained: (1) unique genes in each genome; (2) homologous gene candidates among compared genomes; (3) 2D plots of homologous gene candidates along the all pairwise genome comparisons; and (4) a table of gene presence/absence information and a genome phylogeny. We demonstrated the functions in GenomeBlast with an example of multiple herpesviral genome analysis and illustrated how GenomeBlast is useful for small genome comparison. CONCLUSION: We developed a Web tool for comparative analysis of small genomes, which allows the user not only to identify unique genes and homologous gene candidates among multiple genomes, but also to view their graphical distributions on genomes, and to reconstruct genome phylogeny. GenomeBlast runs on a Linux server with 4 CPUs and 4 GB memory. The online version of GenomeBlast is available to public by using a Web browser with the URL

DigitalCommons@University of Nebraska

The University of Nebraska, Omaha

A New Rhesus Macaque Assembly and Annotation for Next-Generation Sequencing Analyses

Author: Bosinger Steven E.
Cornish Adam S.
Ferguson Betsy
Fox Howard S.
Gibbs Robert M.
Johnson Zachary P.
Marçais Guillaume
Maudhoo Mnirnal D.
Meehan Daniel T.
Norgren Robert B.
Pandey Sanjit
Roberts Michael
Salzberg Steven L.
Tharp Gregory K.
Treangen Todd
Wipfler Kristin
Yorke James A.
Zhang Xiongfei
Zimin Aleksey V.
Publication venue: DigitalCommons@UNMC
Publication date: 01/01/2014
Field of study

BACKGROUND: The rhesus macaque (Macaca mulatta) is a key species for advancing biomedical research. Like all draft mammalian genomes, the draft rhesus assembly (rheMac2) has gaps, sequencing errors and misassemblies that have prevented automated annotation pipelines from functioning correctly. Another rhesus macaque assembly, CR_1.0, is also available but is substantially more fragmented than rheMac2 with smaller contigs and scaffolds. Annotations for these two assemblies are limited in completeness and accuracy. High quality assembly and annotation files are required for a wide range of studies including expression, genetic and evolutionary analyses. RESULTS: We report a new de novo assembly of the rhesus macaque genome (MacaM) that incorporates both the original Sanger sequences used to assemble rheMac2 and new Illumina sequences from the same animal. MacaM has a weighted average (N50) contig size of 64 kilobases, more than twice the size of the rheMac2 assembly and almost five times the size of the CR_1.0 assembly. The MacaM chromosome assembly incorporates information from previously unutilized mapping data and preliminary annotation of scaffolds. Independent assessment of the assemblies using Ion Torrent read alignments indicates that MacaM is more complete and accurate than rheMac2 and CR_1.0. We assembled messenger RNA sequences from several rhesus tissues into transcripts which allowed us to identify a total of 11,712 complete proteins representing 9,524 distinct genes. Using a combination of our assembled rhesus macaque transcripts and human transcripts, we annotated 18,757 transcripts and 16,050 genes with complete coding sequences in the MacaM assembly. Further, we demonstrate that the new annotations provide greatly improved accuracy as compared to the current annotations of rheMac2. Finally, we show that the MacaM genome provides an accurate resource for alignment of reads produced by RNA sequence expression studies. CONCLUSIONS: The MacaM assembly and annotation files provide a substantially more complete and accurate representation of the rhesus macaque genome than rheMac2 or CR_1.0 and will serve as an important resource for investigators conducting next-generation sequencing studies with nonhuman primates. REVIEWERS: This article was reviewed by Dr. Lutz Walter, Dr. Soojin Yi and Dr. Kateryna Makova

Digital Repository at the University of Maryland

University of Nebraska Medical Center Research: DigitalCommons@UNMC

Recommended from our members

The influence of the accessory genome on bacterial pathogen evolution

Author: Abu-Ali GS
Adiba S
Alfano JR
Arnold DL
Arnold DL
Arnold DL
Asadulghani M
Baharoglu Z
Baquero F
Barash I
Blondel CJ
Blondel CJ
Brüssow H
Cambray G
Chen J
Chen Y
Choi J
Colinon C
Croucher NJ
Dawes FE
De Gelder L
Diard M
Dillon SC
Douard G
Doyle M
Elsaied H
Fondi M
Freeman VJ
Gartemann KH
Gillings MR
Godfrey SAC
Govind R
Greenberg JT
Grillot-Courvalin C
Groisman EA
Guerin E
Hacker J
Hacker J
Halary S
Hassan F
Hazen TH
Hegstad K
Heinemann JA
Heringa S
Hochhut B
Holden MT
Huang L
Imamovic L
Jackson RW
Jackson RW
Jenner C
Joss M
Jové T
Kearney B
Kers JA
Kiiru JN
Koenig JE
Koenig JE
Krauland MG
Landgraf A
Larsson P
Leplae Rl
León G
Lipps HJ
Lloyd AL
Loftie-Eaton W
Lovell HC
Lovell HC
Manning SD
Marchetti M
Matz C
Maurelli AT
Mazel D
Michael CA
Michael CA
Morris CE
Morris CE
Moura A
Naas T
Nadarasah G
Naka H
Nawaz M
Ogura Y
Paauw A
Partridge SR
Pitman A
Poirel L
Poirel L
Ramirez MS
Rankin DJ
Rezzonico F
Rivas LA
Rodriguez-Martinez JM
Rohmer L
Rosewarne CP
Sajjad A
Salanoubat M
Salzberg SL
Seth-Smith H
Shaheen BW
Siguier P
Siguier P
Smillie C
Smith AB
Song H
Sota M
Steinberg KM
Sundin GW
Sundin GW
Tao L
Treangen TJ
van der Meer JR
van der Veen EL
van Essen-Zandbergen A
Wagner A
Waldor MK
Wiesner M
Woodford N
Yang H
Zhou Z
Zupan J
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2011
Field of study

Bacterial pathogens exhibit significant variation in their genomic content of virulence factors. This reflects the abundance of strategies pathogens evolved to infect host organisms by suppressing host immunity. Molecular arms-races have been a strong driving force for the evolution of pathogenicity, with pathogens often encoding overlapping or redundant functions, such as type III protein secretion effectors and hosts encoding ever more sophisticated immune systems. The pathogens’ frequent exposure to other microbes, either in their host or in the environment, provides opportunities for the acquisition or interchange of mobile genetic elements. These DNA elements accessorise the core genome and can play major roles in shaping genome structure and altering the complement of virulence factors. Here, we review the different mobile genetic elements focusing on the more recent discoveries and highlighting their role in shaping bacterial pathogen evolution

Central Archive at the University of Reading

UWE Bristol Research Repository

Academica-e (Univ. Pública de Navarra)

CoCoNUT: an efficient system for the comparison and analysis of genomes

Author: A Darling
A Kasprzyk
B Haas
B Ma
B Mau
B Morgenstern
B Raphael
C Wawra
DR Bentley
E Mardis
E Ohlebusch
E Passarge
E Sonnhammer
Enno Ohlebusch
G Bourque
G Gremme
I Ovcharenko
J Krumsiek
J Peterson
J Thompson
L Florea
M Abouelhoda
M Abouelhoda
M Abouelhoda
M Abouelhoda
M Abouelhoda
M Blanchette
M Brudno
M Clamp
M Höhl
M Kellis
M Margulies
Mohamed I Abouelhoda
P Chain
R Staden
S Altschul
S Karlin
S Kurtz
S Ranganathan
S Schwartz
S Schwartz
S Shibuya
Stefan Kurtz
T Treangen
T Vision
T Wu
The Arabidopsis Genome Initiative
W Kent
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Public Library of Science (PLOS)

Insertion Sequence Inversions Mediated by Ectopic Recombination between Terminal Inverted Repeats

Author: A Barzel
Alison Ling
C Feschotte
C Vitte
DJ Hedges
DW Martin
ES Lander
F Yang
G Marais
G Santoyo
HM Arends
J Filee
J Foster
J Parkhill
L Klasson
L Klasson
M Chandler
M Rosenberg
M Wu
Mark A. Batzer
P Siguier
P Siguier
PC Weber
PC Weber
R Belshaw
R Cordaux
R Cordaux
R Cordaux
R Cordaux
Richard Cordaux
S Leclercq
S Pichon
SG Andersson
SL Salzberg
T Wicker
TA Hall
TJ Treangen
WS Reznikoff
Z Nagy
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Transposable elements are widely distributed and diverse in both eukaryotes and prokaryotes, as exemplified by DNA transposons. As a result, they represent a considerable source of genomic variation, for example through ectopic (i.e. non-allelic homologous) recombination events between transposable element copies, resulting in genomic rearrangements. Ectopic recombination may also take place between homologous sequences located within transposable element sequences. DNA transposons are typically bounded by terminal inverted repeats (TIRs). Ectopic recombination between TIRs is expected to result in DNA transposon inversions. However, such inversions have barely been documented. In this study, we report natural inversions of the most common prokaryotic DNA transposons: insertion sequences (IS). We identified natural TIR-TIR recombination-mediated inversions in 9% of IS insertion loci investigated in Wolbachia bacteria, which suggests that recombination between IS TIRs may be a quite common, albeit largely overlooked, source of genomic diversity in bacteria. We suggest that inversions may impede IS survival and proliferation in the host genome by altering transpositional activity. They may also alter genomic instability by modulating the outcome of ectopic recombination events between IS copies in various orientations. This study represents the first report of TIR-TIR recombination within bacterial IS elements and it thereby uncovers a novel mechanism of structural variation for this class of prokaryotic transposable elements

CiteSeerX

M-GCAT: interactively and efficiently constructing large-scale multiple genome comparison frameworks in closely related species

Author: A Darling
A Delcher
AE Darling
B Morgenstern
B Raphael
C Grasso
C Notredame
C Notredame
D Ferre
DA Nix
EP Rocha
I Ovcharenko
J Choudhuri
J Deogun
JD Thompson
K Katoh
K Liolos
K Rutherford
L Florea
L Wang
M Blanchette
M Brudno
M Brudno
M Brudno
M Hohl
M Margulies
M Waterman
N Bray
N Bray
NT Perna
P Chain
RL Tatusov
S Batzoglou
S Batzoglou
S Schwartz
T Carver
Todd J Treangen
W Huang
Xavier Messeguer
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Due to recent advances in whole genome shotgun sequencing and assembly technologies, the financial cost of decoding an organism's DNA has been drastically reduced, resulting in a recent explosion of genomic sequencing projects. This increase in related genomic data will allow for in depth studies of evolution in closely related species through multiple whole genome comparisons. RESULTS: To facilitate such comparisons, we present an interactive multiple genome comparison and alignment tool, M-GCAT, that can efficiently construct multiple genome comparison frameworks in closely related species. M-GCAT is able to compare and identify highly conserved regions in up to 20 closely related bacterial species in minutes on a standard computer, and as many as 90 (containing 75 cloned genomes from a set of 15 published enterobacterial genomes) in an hour. M-GCAT also incorporates a novel comparative genomics data visualization interface allowing the user to globally and locally examine and inspect the conserved regions and gene annotations. CONCLUSION: M-GCAT is an interactive comparative genomics tool well suited for quickly generating multiple genome comparisons frameworks and alignments among closely related species. M-GCAT is freely available for download for academic and non-commercial use at:

Elsevier - Publisher Connector

A roadmap for gene system development in Clostridium

Author: Al-Hinai
Anne M. Henstra
Argyros
Boeke
Buffie
Carter
Cartman
Cartman
Christopher M. Humphreys
Croux
Cui
Davis
de Kok
Dong
Dupuy
Ehsaan
Engler
Fagan
Feil
Fichot
Fungmin Liew
Gareth T. Little
Gibson
Hartman
Heap
Heap
Heap
Heap
Heap
Herbert
Humphreys
Jennert
Jonathan Baker
Katrin Schwarz
Kelly
Kolek
Koren
Kovacs
Kuehne
Kuehne
Köpke
Köpke
Köpke
Lesiak
Li
Lili Sheng
Meaney
Mermelstein
Michelle L. Kelly
Minton
Minton
Muhammad Ehsaan
Ng
Nigel P. Minton
Nolling
Oultram
Purdy
Pyne
Pyne
Quail
Roberts
Sandoval
Scott
Stefka
Treangen
van Eijk
Wang
Warrens
Williams
Xu
Yang
Ying Zhang
Zhang
Zhang
Publication venue: 'Elsevier BV'
Publication date: 01/10/2016
Field of study

Clostridium species are both heroes and villains. Some cause serious human and animal diseases, those present in the microbiota contribute to health and wellbeing, while others represent useful industrial chassis for the production of chemicals and fuels. To understand, counter or exploit, there is a fundamental requirement for effective systems that may be used for directed or random genome modifications. We have formulated a simple roadmap whereby the necessary gene systems maybe developed and deployed. At its heart is the use of 'pseudo-suicide' vectors and the creation of a pyrE mutant (a uracil auxotroph), initially aided by ClosTron technology, but ultimately made using a special form of allelic exchange termed ACE (Allele-Coupled Exchange). All mutants, regardless of the mutagen employed, are made in this host. This is because through the use of ACE vectors, mutants can be rapidly complemented concomitant with correction of the pyrE allele and restoration of uracil prototrophy. This avoids the phenotypic effects frequently observed with high copy number plasmids and dispenses with the need to add antibiotic to ensure plasmid retention. Once available, the pyrE host may be used to stably insert all manner of application specific modules. Examples include, a sigma factor to allow deployment of a mariner transposon, hydrolases involved in biomass deconstruction and therapeutic genes in cancer delivery vehicles. To date, provided DNA transfer is obtained, we have not encountered any clostridial species where this technology cannot be applied. These include, Clostridium difficile, Clostridium acetobutylicum, Clostridium beijerinckii, Clostridium botulinum, Clostridium perfringens, Clostridium sporogenes, Clostridium pasteurianum, Clostridium ljungdahlii, Clostridium autoethanogenum and even Geobacillus thermoglucosidasius

Nottingham ePrints

Nottingham eTheses

Repository@Nottingham

Public Library of Science (PLOS)

progressiveMauve: Multiple Genome Alignment with Gene Gain, Loss and Rearrangement

Multiple genome alignment remains a challenging problem. Effects of recombination including rearrangement, segmental duplication, gain, and loss can create a mosaic pattern of homology even among closely related organisms.We describe a new method to align two or more genomes that have undergone rearrangements due to recombination and substantial amounts of segmental gain and loss (flux). We demonstrate that the new method can accurately align regions conserved in some, but not all, of the genomes, an important case not handled by our previous work. The method uses a novel alignment objective score called a sum-of-pairs breakpoint score, which facilitates accurate detection of rearrangement breakpoints when genomes have unequal gene content. We also apply a probabilistic alignment filtering method to remove erroneous alignments of unrelated sequences, which are commonly observed in other genome alignment methods. We describe new metrics for quantifying genome alignment accuracy which measure the quality of rearrangement breakpoint predictions and indel predictions. The new genome alignment algorithm demonstrates high accuracy in situations where genomes have undergone biologically feasible amounts of genome rearrangement, segmental gain and loss. We apply the new algorithm to a set of 23 genomes from the genera Escherichia, Shigella, and Salmonella. Analysis of whole-genome multiple alignments allows us to extend the previously defined concepts of core- and pan-genomes to include not only annotated genes, but also non-coding regions with potential regulatory roles. The 23 enterobacteria have an estimated core-genome of 2.46Mbp conserved among all taxa and a pan-genome of 15.2Mbp. We document substantial population-level variability among these organisms driven by segmental gain and loss. Interestingly, much variability lies in intergenic regions, suggesting that the Enterobacteriacae may exhibit regulatory divergence.The multiple genome alignments generated by our software provide a platform for comparative genomic and population genomic studies. Free, open-source software implementing the described genome alignment approach is available from http://gel.ahabs.wisc.edu/mauve

CiteSeerX

OPUS - University of Technology Sydney