Search CORE

20 research outputs found

Catastrophic chromosomal restructuring during genome elimination in plants.

Author: Bradnam Keith R
Chan Simon Wl
Comai Luca
Henry Isabelle M
Korf Ian
Lysak Martin A
Mandakova Terezie
Marimuthu Mohan Pa
Ravi Maruthachalam
Tan Ek Han
Publication venue: eScholarship, University of California
Publication date: 01/05/2015
Field of study

Genome instability is associated with mitotic errors and cancer. This phenomenon can lead to deleterious rearrangements, but also genetic novelty, and many questions regarding its genesis, fate and evolutionary role remain unanswered. Here, we describe extreme chromosomal restructuring during genome elimination, a process resulting from hybridization of Arabidopsis plants expressing different centromere histones H3. Shattered chromosomes are formed from the genome of the haploid inducer, consistent with genomic catastrophes affecting a single, laggard chromosome compartmentalized within a micronucleus. Analysis of breakpoint junctions implicates breaks followed by repair through non-homologous end joining (NHEJ) or stalled fork repair. Furthermore, mutation of required NHEJ factor DNA Ligase 4 results in enhanced haploid recovery. Lastly, heritability and stability of a rearranged chromosome suggest a potential for enduring genomic novelty. These findings provide a tractable, natural system towards investigating the causes and mechanisms of complex genomic rearrangements similar to those associated with several human disorders

PubMed Central

eScholarship - University of California

Comparative Analysis of Tandem Repeats from Hundreds of Species Reveals Unique Insights into Centromere Evolution

Author: Bradnam Keith R.
Chan Simon W. -L.
DeRisi Joseph L.
Eid John
Garcia José Fernando
Korf Ian F.
May Michael R.
Melters Daniël P.
Peluso Paul
Rank David
Ross-Ibarra Jeffrey
Ruby J. Graham
Sebra Robert
Smith Timothy
Telis Natalie
Tobias Christian
Young Hugh A.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 22/09/2012
Field of study

Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. The assumption that the most abundant tandem repeat is the centromere DNA was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and in length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond ~50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution, including the appearance of higher order repeat structures in which several polymorphic monomers make up a larger repeating unit. While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animals and plants. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes

arXiv.org e-Print Archive

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

Longer First Introns Are a General Property of Eukaryotic Gene Structure

Author: A Rogers
A Sakurai
AB Rose
AB Rose
AB Rose
AE Vinogradov
Alan Christoffels
BY Chung
CB Russell
D Mascarenhas
D Swarbreck
DA Benson
DJ Gaffney
DL Halligan
E Gazave
EV Kriventseva
G Marais
H Akashi
H Nielsen
HK Stenoien
Ian Korf
J Majewski
J Spieth
JC Venter
JD Hawkins
JJ Jonsson
JS Jeon
JV Chamary
K Lin
Keith R. Bradnam
KR Kalari
L Collins
L Duret
L Morello
M Deutsch
M Stanke
MG Reese
MW Smith
PD Keightley
RD Palmiter
RJ Wilson
S Levy
SH Ho
SW Li
T Blumenthal
W Gilbert
X Hong
XY Ren
Y Chen
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

While many properties of eukaryotic gene structure are well characterized, differences in the form and function of introns that occur at different positions within a transcript are less well understood. In particular, the dynamics of intron length variation with respect to intron position has received relatively little attention. This study analyzes all available data on intron lengths in GenBank and finds a significant trend of increased length in first introns throughout a wide range of species. This trend was found to be even stronger when using high-confidence gene annotation data for three model organisms (Arabidopsis thaliana, Caenorhabditis elegans, and Drosophila melanogaster) which show that the first intron in the 5′ UTR is - on average - significantly longer than all downstream introns within a gene. A partial explanation for increased first intron length in A. thaliana is suggested by the increased frequency of certain motifs that are present in first introns. The phenomenon of longer first introns can potentially be used to improve gene prediction software and also to detect errors in existing gene annotations

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species

Author: \uc9l\ue9nie Godzaridis
Adam M. Phillippy
Alexey Sergushichev
Anton Alexandrov
Benedict Paten
Binghang Liu
Bruno M. Vieira
Carson Qu
Daniel S. Rokhsar
Dariusz Przybylski
David B. Jaffe
David C. Schwartz
David Haussler
DEL FABBRO Cristian
Delphine Naquin
Dent Earl
Dominique Lavenier
Erich D. Jarvis
Fedor Tsarev
Filipe J. Ribeiro
Fran\ue7ois Laviolette
Francisco Pina Martins
Ganeshkumar Ganapathy
Giles Hall
Guillaume Chapuis
Guojie Zhang
Hamidreza Chitsaz
Hao Zhang
Henry Song
Huaiyang Jiang
Iain Maccallum
Ian F. Korf
Inan\ue7 Birol
Isaac Y. Ho
J. Ruby
Jacob O. Kitzman
Jacques Corbeil
James R. Knight
Jared T. Simpson
Jarrod A. Chapman
Jason Howard
Jay Shendure
Jianying Yuan
Joseph B. Hiatt
Joseph N. Fass
Jun Wang
Keith R. Bradnam
Kim C. Worley
Martin Hunt
Matthew D. Macmanes
Matthias Haimel
Michael C. Schatz
Michael Bechner
Michael Place
Nicolas Maillet
Nuno A. Fonseca
Oct\ue1vio S. Paulo
Paul J. Kersey
Paul Baranay
Pavel Fedotov
Rayan Chikhi
Richard A. Gibbs
Richard Durbin
Ruibang Luo
S\ue9bastien Boisvert
Sante Gnerre
Scalabrin Simone
Scott Emrich
Sergey Kazakov
Sergey Koren
Sergey Melnikov
Shaun D. Jackman
Shiguo Zhou
Shuangye Yin
Siu Ming Yiu
Stephen Richards
Steve Goldstein
T. Docking
Tak Wah Lam
Ted Sharpe
Thomas D. Otto
Timothy I. Shaw
Vezzi Francesco
Vicedomini Riccardo
Wen Chi Chou
Xiang Qin
Yingrui Li
Yue Liu
Yujian Shi
Zemin Ning
Zhenyu Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Background: The process of generating raw genome sequence data continues to become cheaper, faster, and more accurate. However, assembly of such data into high-quality, finished genome sequences remains challenging. Many genome assembly tools are available, but they differ greatly in terms of their performance (speed, scalability, hardware requirements, acceptance of newer read technologies) and in their final output (composition of assembled sequence). More importantly, it remains largely unclear how to best assess the quality of assembled genome sequences. The Assemblathon competitions are intended to assess current state-of-the-art methods in genome assembly. Results: In Assemblathon 2, we provided a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and snake). This resulted in a total of 43 submitted assemblies from 21 participating teams. We evaluated these assemblies using a combination of optical map data, Fosmid sequences, and several statistical methods. From over 100 different metrics, we chose ten key measures by which to assess the overall quality of the assemblies. Conclusions: Many current genome assemblers produced useful assemblies, containing a significant representation of their genes and overall genome structure. However, the high degree of variability between the entries suggests that there is still much room for improvement in the field of genome assembly and that approaches which work well in assembling the genome of one species may not necessarily work well for another

Archivio istituzionale della ricerca - Università degli Studi di Udine

Longer first introns are a general property of eukaryotic gene structure.

Author: Bradnam Keith R,
Publication venue
Publication date: 19/05/2020
Field of study

Ezid

Recommended from our members

Longer first introns are a general property of eukaryotic gene structure.

Author: Bradnam Keith R
Korf Ian
Publication venue: eScholarship, University of California
Publication date: 01/08/2008
Field of study

While many properties of eukaryotic gene structure are well characterized, differences in the form and function of introns that occur at different positions within a transcript are less well understood. In particular, the dynamics of intron length variation with respect to intron position has received relatively little attention. This study analyzes all available data on intron lengths in GenBank and finds a significant trend of increased length in first introns throughout a wide range of species. This trend was found to be even stronger when using high-confidence gene annotation data for three model organisms (Arabidopsis thaliana, Caenorhabditis elegans, and Drosophila melanogaster) which show that the first intron in the 5' UTR is--on average--significantly longer than all downstream introns within a gene. A partial explanation for increased first intron length in A. thaliana is suggested by the increased frequency of certain motifs that are present in first introns. The phenomenon of longer first introns can potentially be used to improve gene prediction software and also to detect errors in existing gene annotations

eScholarship - University of California

First introns are the longest introns in most species.

Author: Ian Korf (3780)
Keith R. Bradnam (80686)
Publication venue
Publication date
Field of study

Results shown for all species in GenBank release 164 which have at least 500 CDSs that specify multiple introns. Z-tests were used to determine significance and color denotes level of significance (see legend, N.S. = not significant).</p

The Francis Crick Institute

Incorrect C. elegans gene annotation determined by inspection of intron lengths.

Author: Ian Korf (3780)
Keith R. Bradnam (80686)
Publication venue
Publication date
Field of study

This gene prediction contained an incorrect in-frame intron sequence in the first exon. Transcript evidence, homology evidence from C. briggsae, and an alternative gene prediction (Twinscan) suggested that the first intron is an annotation error. Image taken from Genome Browser display of WormBase release WS180 (<a href="http://ws180.wormbase.org" target="_blank">http://ws180.wormbase.org</a>).</p

The Francis Crick Institute

Intron size variation for selected species with different numbers of introns.

Author: Ian Korf (3780)
Keith R. Bradnam (80686)
Publication venue
Publication date
Field of study

Intron lengths are shown for species with CDSs that contain 4, 6, 7 or 9 introns (in D. melanogaster, A. thaliana, C. elegans, and H. sapiens respectively). Bars on graph show standard error of the mean. Numbers of CDSs used for each species are shown.</p

The Francis Crick Institute

Intron length variation in three model organisms.

Author: Ian Korf (3780)
Keith R. Bradnam (80686)
Publication venue
Publication date
Field of study

Mean intron length is calculated for the first intron in the 5′ UTR (position −1, in blue) and for the first eight introns of the coding sequence (in red) for three named species. Error bars indicate standard error of the mean. Bottom right panel shows the occurrence of a potential IME motif (pictured) in A. thaliana introns. %Motif density is calculated by concatenating together all introns in each category, and then calculating what fraction of the total sequence is occupied by the motif.</p

The Francis Crick Institute