Search CORE

HAL Clermont Université

Infoscience - École polytechnique fédérale de Lausanne

Drosophila P element: transposition, regulation and evolution

Author: Coen D.
Delattre M.
Higuet D.
Lehmann M.
Lemaitre B.
Montchamp C.
Nouaud D.
Quesneville H.
Ronsseray S.
Simonelig M.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 17/09/2010
Field of study

High-quality de novo assembly of the apple genome and methylome dynamics of early fruit development

Author: Aubourg S.
Becker C.
Bianco L.
Bucher E.
Celton J. M.
Choisne N.
Daccord N.
Di Pierro E. A.
Durel C. E.
Gaillard S.
Gouzy J.
Guérif P.
Jasper D.
Laurens F.
Lespinasse Y.
Linsmith Gareth
Micheletti D.
Muranty H.
Quesneville H.
Rees G.
Schijlen E.
Troggio M.
van de Geest H.
van de Weg E.
Velasco R.
Weigel D.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Using the latest sequencing and optical mapping technologies, we have produced a high-quality de novo assembly of the apple (Malus domestica Borkh.) genome. Repeat sequences, which represented over half of the assembly, provided an unprecedented opportunity to investigate the uncharacterized regions of a tree genome; we identified a new hyper-repetitive retrotransposon sequence that was over-represented in heterochromatic regions and estimated that a major burst of different transposable elements (TEs) occurred 21 million years ago. Notably, the timing of this TE burst coincided with the uplift of the Tian Shan mountains, which is thought to be the center of the location where the apple originated, suggesting that TEs and associated processes may have contributed to the diversification of the apple ancestor and possibly to its divergence from pear. Finally, genome-wide DNA methylation data suggest that epigenetic marks may contribute to agronomically relevant aspects, such as apple fruit development

HAL-UNICE

Archivio istituzionale della ricerca - Fondazione Edmund Mach

Linkage disequilibrium in young genetically isolated Dutch population

Author: A Collins
A Kong
A Wright
B Devlin
B Muller-Myhsok
C Zapata
D Fallin
D Zaykin
DE Reich
DJ Schaid
DJ Schaid
E Sobel
ES Lander
GR Abecasis
H Quesneville
JC Venter
KM Weiss
L Kruglyak
M Abney
M Boehnke
MD Teare
N Risch
P Zavattari
RC Lewontin
SK Service
T Varilo
YS Aulchenko
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/07/2004
Field of study

The design and feasibility of genetic studies of complex diseases are critically dependent on the extent and distribution of linkage disequilibrium (LD) across the genome and between different populations. We have examined genomewide and region-specific LD in a young genetically isolated population identified in the Netherlands by genotyping approximately 800 Short Tandem Repeat markers distributed genomewide across 58 individuals. Several regions were an

Erasmus University Digital Repository

Correlation of LNCR rasiRNAs Expression with Heterochromatin Formation during Development of the Holocentric Insect Spodoptera frugiperda

Author: A Criniti
A Murakami
A Pelisson
A Verdel
AA Aravin
AC Chueh
B Czech
BY Lu
C Klattenhoff
CN Topp
D Fagegaltier
DM Carone
E d'Alençon
Emmanuelle d'Alençon
Emmanuelle Permal
ER Havecker
François Cousserans
G Jagadeeswaran
H Quesneville
Hadi Quesneville
HB Megosh
HH Kazazian Jr
HR Lee
J Brennecke
J Brennecke
J Brosius
JH Bergmann
K Saito
KA Senti
LH Wong
LS Gunawardane
M Gerbal
M Halic
M Mandrioli
M Mandrioli
M Wassenegger
MA Matzke
Michael Freitag
MS Klenov
N Rhind
Philippe Fournier
PY Chen
R Santoro
S Desset
S Houwing
S Kawaoka
S Shpiz
Slavica Stanojcic
Sylvie Gimenez
T Wicker
TA Farazi
TA Volpe
VV Vagin
Y Du
Y Kawamura
YJ Lu
Z Lippman
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Repeat-associated small interfering RNAs (rasiRNAs) are derived from various genomic repetitive elements and ensure genomic stability by silencing endogenous transposable elements. Here we describe a novel subset of 46 rasiRNAs named LNCR rasiRNAs due to their homology with one long non-coding RNA (LNCR) of Spodoptera frugiperda. LNCR operates as the intermediate of an unclassified transposable element (TE-LNCR). TE-LNCR is a very invasive transposable element, present in high copy numbers in the S. frugiperda genome. LNCR rasiRNAs are single-stranded RNAs without a prominent nucleotide motif, which are organized in two distinct, strand-specific clusters. The expression of LNCR and LNCR rasiRNAs is developmentally regulated. Formation of heterochromatin in the genomic region where three copies of the TE-LNCR are embedded was followed by chromatin immunoprecipitation (ChIP) and we observed this chromatin undergo dynamic changes during development. In summary, increased LNCR expression in certain developmental stages is followed by the appearance of a variety of LNCR rasiRNAs which appears to correlate with subsequent accumulation of a heterochromatic histone mark and silencing of the genomic region with TE-LNCR. These results support the notion that a repeat-associated small interfering RNA pathway is linked to heterochromatin formation and/or maintenance during development to establish repression of the TE-LNCR transposable element. This study provides insights into the rasiRNA silencing pathway and its role in the formation of fluctuating heterochromatin during the development of one holocentric organism

CiteSeerX

Public Library of Science (PLOS)

ProdInra

A high-quality sequence of Rosa chinensis to elucidate genome structure and ornamental traits

Author: A. Berard
A. Chastellier
C. Maliepaard
D. Lakhwani
D. Schulz
E. Bucher
E. Neu
E. Schijlen
F. Foucher
H. Quesneville
H. Van de Geest
I. Kirov
J. Clotault
J. De Riek
J. Jeauffre
K. Kawamura
K. Van Laere
L. Hamama
L. Hibrand-Saint Oyant
L. Leus
L. Voisine
M. Linde
M.C. Le Paslier
N. Choisne
N. Daccord
N.N. Zhou
P. Arens
P.M. Bourke
R. Bounon
R. Smulder
R. Voorrips
S. Aubourg
S. Balzergue
S. Gaillard
S. Sakr
T. Borm
T. Debener
T. Hesselink
T. Ruttink
T. Thouroude
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 01/01/2018
Field of study

Rose is the worlds most important ornamental plant with economic, cultural and symbolic value. Roses are cultivated worldwide and sold as garden roses, cut flowers and potted plants. Rose has a complex genome with high heterozygosity and various ploidy levels. Our objectives were (i) to develop the first high-quality reference genome sequence for the genus Rosa by sequencing a doubled haploid, combining long and short read sequencing, and anchoring to a high-density genetic map and (ii) to study the genome structure and the genetic basis of major ornamental traits. We produced a haploid rose line from R. chinensis "Old Blush" and generated the first rose genome sequence at the pseudo-molecule scale (512 Mbp with N50 of 3.4 Mb and L75 of 97). The sequence was validated using high-density diploid and tetraploid genetic maps. We delineated hallmark chromosomal features including the pericentromeric regions through annotation of TE families and positioned centromeric repeats using FISH. Genetic diversity was analysed by resequencing eight Rosa species. Combining genetic and genomic approaches, we identified potential genetic regulators of key ornamental traits, including prickle density and number of flower petals. A rose APETALA2 homologue is proposed to be the major regulator of petals number in rose. This reference sequence is an important resource for studying polyploidisation, meiosis and developmental processes as we demonstrated for flower and prickle development. This reference sequence will also accelerate breeding through the development of molecular markers linked to traits, the identification of the genes underlying them and the exploitation of synteny across Rosaceae

Novel transposable elements from Anopheles gambiae

Abstract Background Transposable elements (TEs) are DNA sequences, present in the genome of most eukaryotic organisms that hold the key characteristic of being able to mobilize and increase their copy number within chromosomes. These elements are important for eukaryotic genome structure and evolution and lately have been considered as potential drivers for introducing transgenes into pathogen-transmitting insects as a means to control vector-borne diseases. The aim of this work was to catalog the diversity and abundance of TEs within the <it>Anopheles gambiae </it>genome using the PILER tool and to consolidate a database in the form of a hyperlinked spreadsheet containing detailed and readily available information about the TEs present in the genome of <it>An. gambiae</it>. Results Here we present the spreadsheet named AnoTExcel that constitutes a database with detailed information on most of the repetitive elements present in the genome of the mosquito. Despite previous work on this topic, our approach permitted the identification and characterization both of previously described and novel TEs that are further described in detailed. Conclusions Identification and characterization of TEs in a given genome is important as a way to understand the diversity and evolution of the whole set of TEs present in a given species. This work contributes to a better understanding of the landscape of TEs present in the mosquito genome. It also presents a novel platform for the identification, analysis, and characterization of TEs on sequenced genomes.</p

Springer - Publisher Connector

Context-driven discovery of gene cassettes in mobile integrons using a computational grammar

Author: A Moura
ACE Darling
AL Delcher
AL Delcher
CJ van Rijsbergen
D Frishman
DA Rowe-Magnus
DB Searls
E Rivas
Enrico Coiera
F Baquero
F Meyer
F Meyer
Guy Tsafnat
H Quesneville
HW Stokes
HW Stokes
IT Paulsen
J Fleiss
J Landis
Jaron Schaeffer
Jon R Iredell
K Rutherford
L Stein
M Ashburner
M Kanehisa
MA Andrade
MJ Joss
R Overbeek
RM Hall
RS Levings
S Ji
S Leung
Sally R Partridge
SF Altschul
SR Partridge
U Bohnebeck
WR Pearson
Y Boucher
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Gene discovery algorithms typically examine sequence data for low level patterns. A novel method to computationally discover higher order DNA structures is presented, using a context sensitive grammar. The algorithm was applied to the discovery of gene cassettes associated with integrons. The discovery and annotation of antibiotic resistance genes in such cassettes is essential for effective monitoring of antibiotic resistance patterns and formulation of public health antibiotic prescription policies. Results We discovered two new putative gene cassettes using the method, from 276 integron features and 978 GenBank sequences. The system achieved <it>κ </it>= 0.972 annotation agreement with an expert gold standard of 300 sequences. In rediscovery experiments, we deleted 789,196 cassette instances over 2030 experiments and correctly relabelled 85.6% (<it>α </it>≥ 95%, <it>E </it>≤ 1%, mean sensitivity = 0.86, specificity = 1, F-score = 0.93), with no false positives. Error analysis demonstrated that for 72,338 missed deletions, two adjacent deleted cassettes were labeled as a single cassette, increasing performance to 94.8% (mean sensitivity = 0.92, specificity = 1, F-score = 0.96). Conclusion Using grammars we were able to represent heuristic background knowledge about large and complex structures in DNA. Importantly, we were also able to use the context embedded in the model to discover new putative antibiotic resistance gene cassettes. The method is complementary to existing automatic annotation systems which operate at the sequence level.</p

Springer - Publisher Connector

Macquarie University ResearchOnline

Repetitive Elements May Comprise Over Two-Thirds of the Human Genome

Author: A Nekrutenko
A. P. Jason de Koning
AFA Smit
AL Price
AR Quinlan
C Feschotte
DA Ray
David D. Pollock
E Lerat
EE Eichler
EF Kirkness
G Achaz
G Benson
G Lunter
Gregory P. Copenhaver
H Quesneville
HH Kazazian Jr
J Brosius
J Jurka
J Jurka
J Jurka
JS Mattick
JU Pontius
K Lindblad-Toh
M Pheasant
MA Batzer
Mark A. Batzer
MC Frith
R Li
RC Edgar
RM Kuhn
S Karlin
S Kurtz
SF Altschul
TA Castoe
Todd A. Castoe
TS Mikkelsen
W Gu
Wanjun Gu
WC Warren
Z Bao
Publication venue: Public Library of Science
Publication date: 01/12/2011
Field of study

Transposable elements (TEs) are conventionally identified in eukaryotic genomes by alignment to consensus element sequences. Using this approach, about half of the human genome has been previously identified as TEs and low-complexity repeats. We recently developed a highly sensitive alternative de novo strategy, P-clouds, that instead searches for clusters of high-abundance oligonucleotides that are related in sequence space (oligo “clouds”). We show here that P-clouds predicts >840 Mbp of additional repetitive sequences in the human genome, thus suggesting that 66%–69% of the human genome is repetitive or repeat-derived. To investigate this remarkable difference, we conducted detailed analyses of the ability of both P-clouds and a commonly used conventional approach, RepeatMasker (RM), to detect different sized fragments of the highly abundant human Alu and MIR SINEs. RM can have surprisingly low sensitivity for even moderately long fragments, in contrast to P-clouds, which has good sensitivity down to small fragment sizes (∼25 bp). Although short fragments have a high intrinsic probability of being false positives, we performed a probabilistic annotation that reflects this fact. We further developed “element-specific” P-clouds (ESPs) to identify novel Alu and MIR SINE elements, and using it we identified ∼100 Mb of previously unannotated human elements. ESP estimates of new MIR sequences are in good agreement with RM-based predictions of the amount that RM missed. These results highlight the need for combined, probabilistic genome annotation approaches and suggest that the human genome consists of substantially more repetitive sequence than previously believed

Public Library of Science (PLOS)