Search CORE

64 research outputs found

Genome of Mycoplasma haemofelis, unraveling its strategies for survival and persistence

Author: do Nascimento Naíla C
Guimaraes Ana MS
Martin Samuel W
Messick Joanne B
SanMiguel Phillip J
Santos Andrea P
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Mycoplasma haemofelis is a mycoplasmal pathogen (hemoplasma) that attaches to the host's erythrocytes. Distributed worldwide, it has a significant impact on the health of cats causing acute disease and, despite treatment, establishing chronic infection. It might also have a role as a zoonotic agent, especially in immunocompromised patients. Whole genome sequencing and analyses of M. haemofelis strain Ohio2 was undertaken as a step toward understanding its survival and persistence. Metabolic pathways are reduced, relying on the host to supply many of the nutrients and metabolites needed for survival. M. haemofelis must import glucose for ATP generation and ribose derivates for RNA/DNA synthesis. Hypoxanthine, adenine, guanine, uracil and CMP are scavenged from the environment to support purine and pyrimidine synthesis. In addition, nicotinamide, amino acids and any vitamins needed for growth, must be acquired from its environment. The core proteome of M. haemofelis contains an abundance of paralogous gene families, corresponding to 70.6% of all the CDSs. This "paralog pool" is a rich source of different antigenic epitopes that can be varied to elude the host's immune system and establish chronic infection. M. haemofelis also appears to be capable of phase variation, which is particularly relevant to the cyclic bacteremia and persistence, characteristics of the infection in the cat. The data generated herein should be of great use for understanding the mechanisms of M. haemofelis infection. Further, it will provide new insights into its pathogenicity and clues needed to formulate media to support the in vitro cultivation of M. haemofelis

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Purdue E-Pubs

Gene content and distribution in the nuclear genome of Fragaria vesca

Author: Bennetzen Jeffrey L.
Davis Thomas M.
Folta Kevin M.
Pontaroli Ana Clara
Qian Zhang
Rogers Rebekah L.
SanMiguel Phillip
Shields Melanie E.
Publication venue: 'Crop Science Society of America'
Publication date: 01/03/2009
Field of study

Thirty fosmids were randomly selected from a library of Fragaria vesca subsp. americana (cv. Pawtuckaway) DNA. These fosmid clones were individually sheared, and ∼4- to 5-kb fragments were subcloned. Subclones on a single 384-well plate were sequenced bidirectionally for each fosmid. Assembly of these data yielded 12 fosmid inserts completely sequenced, 14 inserts as 2 to 3 contiguous sequences (contigs), and 4 inserts with 5 to 9 contigs. In most cases, a single unambiguous contig order and orientation was determined, so no further finishing was required to identify genes and their relative arrangement. One hundred fifty-eight genes were identified in the ∼1.0 Mb of nuclear genomic DNA that was assembled. Because these fosmids were randomly chosen, this allowed prediction of the genetic content of the entire ∼200 Mb F. vesca genome as about 30,500 protein-encoding genes, plus >4700 truncated gene fragments. The genes are mostly arranged in gene-rich regions, to a variable degree intermixed with transposable elements (TEs). The most abundant TEs in F. vesca were found to be long terminal repeat (LTR) retrotransposons, and these comprised about 13% of the DNA analyzed. Over 30 new repeat families were discovered, mostly TEs, and the total TE content of F. vesca is predicted to be at least 16%.EEA BalcarceFil: Pontaroli, Ana Clara. Instituto Nacional de Tecnología Agropecuaria (INTA). Estación Experimental Agropecuaria Balcarce; Argentina. University of Georgia. Department of Genetics; Estados UnidosFil: Rogers, Rebekah L. Harvard University. Department of Organismic and Evolutionary Biology; Estados Unidos. University of Georgia. Department of Genetics; Estados UnidosFil: Qian, Zhang. University of New Hampshire. Department of Biological Sciences; Estados UnidosFil: Shields, Melanie E. University of New Hampshire. Department of Biological Sciences; Estados UnidosFil: Davis, Thomas M. University of New Hampshire. Department of Biological Sciences; Estados UnidosFil: Folta, Kevin M. University of Florida. Horticultural Sciences Department; Estados UnidosFil: SanMiguel, Phillip. Purdue University. Department of Horticulture and Landscape Architecture; Estados UnidosFil: Bennetzen, Jeffrey L. University of Georgia. Department of Genetics; Estados Unido

Directory of Open Access Journals

Repositorio Institucional – Biblioteca Digital

Exceptional lability of a genomic complex in rice and its close relatives revealed by interspecific and intraspecific comparison and population analysis

Author: Jackson Scott A
Lin Feng
Ma Jianxin
McCouch Susan R
SanMiguel Phillip J
Tian Zhixi
Wing Rod A
Yu Yanjun
Yu Yeisoo
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Extensive DNA rearrangement of genic colinearity, as revealed by comparison of orthologous genomic regions, has been shown to be a general concept describing evolutionary dynamics of plant genomes. However, the nature, timing, lineages and adaptation of local genomic rearrangement in closely related species (<it>e.g</it>., within a genus) and haplotype variation of genomic rearrangement within populations have not been well documented. Results We previously identified a hotspot for genic rearrangement and transposon accumulation in the <it>Orp </it>region of Asian rice (<it>Oryza sativa</it>, AA) by comparison with its orthologous region in sorghum. Here, we report the comparative analysis of this region with its orthologous regions in the wild progenitor species (<it>O. nivara</it>, AA) of Asian rice and African rice (<it>O. glaberrima</it>) using the BB genome <it>Oryza </it>species (<it>O. punctata</it>) as an outgroup, and investigation of transposon insertion sites and a segmental inversion event in the AA genomes at the population level. We found that <it>Orp </it>region was primarily and recently expanded in the Asian rice species <it>O. sativa </it>and <it>O. nivara</it>. LTR-retrotransposons shared by the three AA-genomic regions have been fixed in all the 94 varieties that represent different populations of the AA-genome species/subspecies, indicating their adaptive role in genome differentiation. However, LTR-retrotransposons unique to either <it>O. nivara </it>or <it>O. sativa </it>regions exhibited dramatic haplotype variation regarding their presence or absence between or within populations/subpopulations. Conclusions The LTR-retrotransposon insertion hotspot in the <it>Orp </it>region was formed recently, independently and concurrently in different AA-genome species, and that the genic rearrangements detected in different species appear to be differentially triggered by transposable elements. This region is located near the end of the short arm of chromosome 8 and contains a high proportion of LTR-retrotransposons similar to observed in the centromeric region of this same chromosome, and thus may represent a genomic region that has recently switched from euchromatic to heterochromatic states. The haplotype variation of LTR-retrotransposon insertions within this region reveals substantial admixture among various subpopulations as established by molecular markers at the whole genome level, and can be used to develop retrotransposon junction markers for simple and rapid classification of <it>O. sativa </it>germplasm.</p

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The University of Arizona

Purdue E-Pubs

University of Queensland eSpace

An examination of targeted gene neighborhoods in strawberry

Author: Bennetzen Jeffrey L
Davis Thomas M
Folta Kevin M
Pontaroli Ana C
SanMiguel Phillip
Shields Melanie E
Tombolato-Terzić Denise
Wang Hao
Yao Qin
Zhang Qian
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Strawberry (<it>Fragaria </it>spp.) is the familiar name of a group of economically important crop plants and wild relatives that also represent an emerging system for the study of gene and genome evolution. Its small stature, rapid seed-to-seed cycle, transformability and miniscule basic genome make strawberry an attractive system to study processes related to plant physiology, development and crop production; yet it lacks substantial genomics-level resources. This report addresses this deficiency by characterizing 0.71 Mbp of gene space from a diploid species (<it>F. vesca</it>). The twenty large genomic tracks (30-52 kb) captured as fosmid inserts comprise gene regions with roles in flowering, disease resistance, and metabolism. Results A detailed description of the studied regions reveals 131 Blastx-supported gene sites and eight additional EST-supported gene sites. Only 15 genes have complete EST coverage, enabling gene modelling, while 76 lack EST support. Instances of microcolinearity with <it>Arabidopsis thaliana </it>were identified in twelve inserts. A relatively high portion (25%) of targeted genes were found in unanticipated tandem duplications. The effectiveness of six FGENESH training models was assessed via comparisons among <it>ab initio </it>predictions and homology-based gene and start/stop codon identifications. Fourteen transposable-element-related sequences and 158 simple sequence repeat loci were delineated. Conclusions This report details the structure and content of targeted regions of the strawberry genome. The data indicate that the strawberry genome is gene-dense, with an average of one protein-encoding gene or pseudogene per 5.9 kb. Current overall EST coverage is sparse. The unexpected gene duplications and their differential patterns of EST support suggest possible subfunctionalization or pseudogenization of these sequences. This report provides a high-resolution depiction of targeted gene neighborhoods that will aid whole-genome sequence assembly, provide valuable tools for plant breeders and advance the understanding of strawberry genome evolution.</p

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Purdue E-Pubs

UNH Scholars' Repository

The TIGR Maize Database

Author: Barbazuk William Brad
Bennetzen Jeffrey
Chan Agnes P.
Cheung Foo
Lee Dan
Pertea Geo
Pontaroli Ana C.
Quackenbush John
Rabinowicz Pablo D.
SanMiguel Phillip
Whitelaw Cathy
Yuan Yinan
Zheng Li
Publication venue: Oxford University Press
Publication date: 28/12/2005
Field of study

Maize is a staple crop of the grass family and also an excellent model for plant genetics. Owing to the large size and repetitiveness of its genome, we previously investigated two approaches to accelerate gene discovery and genome analysis in maize: methylation filtration and high C(0)t selection. These techniques allow the construction of gene-enriched genomic libraries by minimizing repeat sequences due to either their methylation status or their copy number, yielding a 7-fold enrichment in genic sequences relative to a random genomic library. Approximately 900 000 gene-enriched reads from maize were generated and clustered into Assembled Zea mays (AZM) sequences. Here we report the current AZM release, which consists of ∼298 Mb representing 243 807 sequence assemblies and singletons. In order to provide a repository of publicly available maize genomic sequences, we have created the TIGR Maize Database (). In this resource, we have assembled and annotated the AZMs and used available sequenced markers to anchor AZMs to maize chromosomes. We have constructed a maize repeat database and generated draft sequence assemblies of 287 maize bacterial artificial chromosome (BAC) clone sequences, which we annotated along with 172 additional publicly available BAC clones. All sequences, assemblies and annotations are available at the project website via web interfaces and FTP downloads

Crossref

PubMed Central

Methylation-sensitive linking libraries enhance gene-enriched sequencing of complex genomes and map DNA methylation domains

Author: Ammiraju Jetty SS
Bennetzen Jeffrey L
Bharti Arvind K
Collura Kristi
Estep Matt
Estill James
He Ruifeng
Kim HyeRan
Kudrna David
Luo Meizhong
Ma Jianxin
Messing Joachim
Nelson William
SanMiguel Phillip
Sisneros Nicholas
Soderlund Carol
Talag Jayson
Wing Rod A
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Many plant genomes are resistant to whole-genome assembly due to an abundance of repetitive sequence, leading to the development of gene-rich sequencing techniques. Two such techniques are hypomethylated partial restriction (HMPR) and methylation spanning linker libraries (MSLL). These libraries differ from other gene-rich datasets in having larger insert sizes, and the MSLL clones are designed to provide reads localized to "epigenetic boundaries" where methylation begins or ends. Results A large-scale study in maize generated 40,299 HMPR sequences and 80,723 MSLL sequences, including MSLL clones exceeding 100 kb. The paired end reads of MSLL and HMPR clones were shown to be effective in linking existing gene-rich sequences into scaffolds. In addition, it was shown that the MSLL clones can be used for anchoring these scaffolds to a BAC-based physical map. The MSLL end reads effectively identified epigenetic boundaries, as indicated by their preferential alignment to regions upstream and downstream from annotated genes. The ability to precisely map long stretches of fully methylated DNA sequence is a unique outcome of MSLL analysis, and was also shown to provide evidence for errors in gene identification. MSLL clones were observed to be significantly more repeat-rich in their interiors than in their end reads, confirming the correlation between methylation and retroelement content. Both MSLL and HMPR reads were found to be substantially gene-enriched, with the <it>Sal</it>I MSLL libraries being the most highly enriched (31% align to an EST contig), while the HMPR clones exhibited exceptional depletion of repetitive DNA (to ~11%). These two techniques were compared with other gene-enrichment methods, and shown to be complementary. Conclusion MSLL technology provides an unparalleled approach for mapping the epigenetic status of repetitive blocks and for identifying sequences mis-identified as genes. Although the types and natures of epigenetic boundaries are barely understood at this time, MSLL technology flags both approximate boundaries and methylated genes that deserve additional investigation. MSLL and HMPR sequences provide a valuable resource for maize genome annotation, and are a uniquely valuable complement to any plant genome sequencing project. In order to make these results fully accessible to the community, a web display was developed that shows the alignment of MSLL, HMPR, and other gene-rich sequences to the BACs; this display is continually updated with the latest ESTs and BAC sequences.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The University of Arizona

Purdue E-Pubs

University of Queensland eSpace

Construction, alignment and analysis of twelve framework physical maps that represent the ten genome types of the genus Oryza

Author: Braidotti Michele
Collura Kristi
Gill Navdeep
Goicoechea José Luis
Hurwitz Bonnie
Jackson Scott A
Kim HyeRan
Kudrna David
Maher Christopher
Mullikin James C
Nelson William
SanMiguel Phillip
Soderlund Carol
Stein Lincoln
Ware Doreen
Wing Rod A
Wissotski Marina
Yu Yeisoo
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Bacterial artificial chromosome (BAC) fingerprint and end-sequenced physical maps representing the ten genome types of Oryza are presente

Crossref

Cold Spring Harbor Laboratory Institutional Repository

Springer - Publisher Connector

PubMed Central

The University of Arizona

NSU Works

University of Queensland eSpace

Exceptional Diversity, Non-Random Distribution, and Rapid Evolution of Retroelements in the B73 Maize Genome

Author: AFA Smit
Ansuya Jogi
AP Tikhonov
B McClintock
B Piegu
BA Kronmiller
BC Meyers
BJM Zonneveld
C Vitte
Cristian Chaparro
DA Kramerov
DC Howell
DE Berg
EM McCarthy
H Chou
HA Schmidt
Harmit S. Malik
HH Fu
J Felsenstein
James C. Estill
JC Estill
JC Estill
JD Thompson
Jean-Marc Deragon
Jeffrey L. Bennetzen
JL Bennetzen
JL Bennetzen
JL Bennetzen
JL Bennetzen
JM Deragon
JS Hawkins
JX Ma
JX Ma
JX Ma
K Fengler
KJ Edwards
KM Devos
M Umeda
M Yamazaki
MA Grandbastien
MA Johns
MJ Varagona
MV Mendiola
N Galtier
Naadira Upshaw
O Jaillon
P SanMiguel
P SanMiguel
P SanMiguel
Phillip J. SanMiguel
PJ SanMiguel
PJ SanMiguel
PL Deininger
PRJ Leeton
PS Schnable
RC Edgar
Regina S. Baucom
Richard P. Westerman
RS Baucom
RY Liu
S Brunner
S Tsukahara
SB Hedges
SR Wessler
T Wicker
TE Bureau
VV Kapitonov
W Gilbert
W Wang
XW Gai
Y Yasui
Y Yoshioka
YK Jin
Z Yang
Publication venue: Public Library of Science
Publication date: 01/11/2009
Field of study

Recent comprehensive sequence analysis of the maize genome now permits detailed discovery and description of all transposable elements (TEs) in this complex nuclear environment. Reiteratively optimized structural and homology criteria were used in the computer-assisted search for retroelements, TEs that transpose by reverse transcription of an RNA intermediate, with the final results verified by manual inspection. Retroelements were found to occupy the majority (>75%) of the nuclear genome in maize inbred B73. Unprecedented genetic diversity was discovered in the long terminal repeat (LTR) retrotransposon class of retroelements, with >400 families (>350 newly discovered) contributing >31,000 intact elements. The two other classes of retroelements, SINEs (four families) and LINEs (at least 30 families), were observed to contribute 1,991 and ∼35,000 copies, respectively, or a combined ∼1% of the B73 nuclear genome. With regard to fully intact elements, median copy numbers for all retroelement families in maize was 2 because >250 LTR retrotransposon families contained only one or two intact members that could be detected in the B73 draft sequence. The majority, perhaps all, of the investigated retroelement families exhibited non-random dispersal across the maize genome, with LINEs, SINEs, and many low-copy-number LTR retrotransposons exhibiting a bias for accumulation in gene-rich regions. In contrast, most (but not all) medium- and high-copy-number LTR retrotransposons were found to preferentially accumulate in gene-poor regions like pericentromeric heterochromatin, while a few high-copy-number families exhibited the opposite bias. Regions of the genome with the highest LTR retrotransposon density contained the lowest LTR retrotransposon diversity. These results indicate that the maize genome provides a great number of different niches for the survival and procreation of a great variety of retroelements that have evolved to differentially occupy and exploit this genomic diversity

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Purdue E-Pubs

Complete Genome Sequence of Mycoplasma suis and Insights into Its Biology and Adaption to an Erythrocyte Niche

Author: A Brownback
A D'Alessandro
A Haldimann
A Iddar
A Iddar
AL Delcher
AM Guimaraes
Ana M. S. Guimaraes
Andrea P. Santos
AS Juncker
AW Murray
C Citti
C Yuan
CJ Chang
CL Yuan
CM Cordova
CM Fraser
CV Bizarro
DA Boyd
DG Edward
DR Brown
DY Arutyunov
E Yus
EE Wanker
EJ Splitter
Elankumaran Subbiah
G Ben-Menachem
G Furness
GM Zinn
GW Clark
H Neimark
H Neimark
I Chambaud
J Lewthwaite
J Pei
J Renaudin
JB Messick
JB Messick
JC Dunning Hotopp
JD Bendtsen
JD Jaffe
JD Pollack
JD Pollack
JD Pollack
JD Pollack
JF Zachary
Joanne B. Messick
Jorge Timenetsky
JP Henderson
JW Moulder
K Caspersen
K Dybvig
K Groebel
K Heinritzi
K Oshima
KM Abdullah
KM Felder
L Papazisi
LE Hoelzle
LM Berent
M Mayer
M Ritzmann
MB Nicolás
MJ Downie
MJ Downie
O Rahman
O Rahman
PC Hu
Phillip SanMiguel
R Chopra-Dewasthaly
RD Oberst
RN McElhaney
S Commans
S Razin
S Rottem
SC Henry
SW Lee
T Hsu
TA Baker
Thomas Walter
TM Lowe
V Schilling
W Saurin
W Viratyosin
X Zheng
Y Maede
Y Maede
Y Rikihisa
Y Sasaki
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Mycoplasma suis, the causative agent of porcine infectious anemia, has never been cultured in vitro and mechanisms by which it causes disease are poorly understood. Thus, the objective herein was to use whole genome sequencing and analysis of M. suis to define pathogenicity mechanisms and biochemical pathways. M. suis was harvested from the blood of an experimentally infected pig. Following DNA extraction and construction of a paired end library, whole-genome sequencing was performed using GS-FLX (454) and Titanium chemistry. Reads on paired-end constructs were assembled using GS De Novo Assembler and gaps closed by primer walking; assembly was validated by PFGE. Glimmer and Manatee Annotation Engine were used to predict and annotate protein-coding sequences (CDS). The M. suis genome consists of a single, 742,431 bp chromosome with low G+C content of 31.1%. A total of 844 CDS, 3 single copies, unlinked rRNA genes and 32 tRNAs were identified. Gene homologies and GC skew graph show that M. suis has a typical Mollicutes oriC. The predicted metabolic pathway is concise, showing evidence of adaptation to blood environment. M. suis is a glycolytic species, obtaining energy through sugars fermentation and ATP-synthase. The pentose-phosphate pathway, metabolism of cofactors and vitamins, pyruvate dehydrogenase and NAD+ kinase are missing. Thus, ribose, NADH, NADPH and coenzyme A are possibly essential for its growth. M. suis can generate purines from hypoxanthine, which is secreted by RBCs, and cytidine nucleotides from uracil. Toxins orthologs were not identified. We suggest that M. suis may cause disease by scavenging and competing for host' nutrients, leading to decreased life-span of RBCs. In summary, genome analysis shows that M. suis is dependent on host cell metabolism and this characteristic is likely to be linked to its pathogenicity. The prediction of essential nutrients will aid the development of in vitro cultivation systems

Public Library of Science (PLOS)

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Directory of Open Access Journals

PubMed Central

RCAAP - Repositório Científico de Acesso Aberto de Portugal

Purdue E-Pubs

Universidade de São Paulo

Detailed Analysis of a Contiguous 22-Mb Region of the Maize Genome

Author: A Esen
A Kalyanaraman
A Smit
AA Salamov
AH Paterson
AH Paterson
AH Paterson
Ananth Kalyanaraman
Angelina Angelova
AP Tikhonov
Apurva Narechania
B Gaut
B Gaut
B McClintock
B McClintock
B Meyers
BA Kronmiller
Blake C. Meyers
C Liang
C Soderlund
C Soderlund
C Soderlund
CA Whitelaw
Catrina Fronick
Cheng-Ting Yeh
Chengzhi Liang
Cm Vitte
Cristian Chaparro
D Austin
D Austin
D Bubeck
D Lisch
David C. Schwartz
David Kudrna
Dawn H. Nagel
DN Duvick
Doreen Ware
E Allen
E Kellogg
EM McCarthy
Emanuele De Paoli
F Liu
F Wei
F Wei
F Wei
Fusheng Wei
G Haberer
G Zabala
Gabriel Scara
H Fu
H Fu
H Yao
HB Mann
HS Malik
HyeRan Kim
I Goldman
I Goldman
J Besemer
J Lai
J Ma
J Messing
JD Thompson
JE Stajich
Jean-Marc Deragon
Jeffrey L. Bennetzen
Jennifer Currie
Jianwei Zhang
Jinke Lin
JL Bennetzen
JL Bennetzen
JL Bennetzen
JL Bennetzen
JN Volff
Joseph R. Ecker
Joshua C. Stein
K Ilic
K Lahners
K Nobuta
K Vandepoele
Kai Ying
KJ Edwards
KM Devos
Kristi Collura
L Veldboom
L Yang
L Zhang
Laura Courtney
Lifang Zhang
Lixing Yang
Lori Spiegel
Lucinda A. Fulton
Lydia Nascimento
M Alleman
M Bohn
M Bohn
M Chen
M Gale
M Kimura
M Kimura
M Morgante
M Spannagl
MA Gore
Marina Wissotski
Melissa Kramer
MR Woodhouse
N Alexandrov
N Jiang
N Rostoks
N Springer
Ning Jiang
P Byrne
P SanMiguel
P SanMiguel
Pamela J. Green
Patrick S. Schnable
Phillip San Miguel
PS Schnable
Q Li
Q Zhou
R Bruggmann
R Liu
RA Martienssen
RD Finn
Regina S. Baucom
Richard K. Wilson
RK Slotkin
Robert A. Martienssen
Robert S. Fulton
Rod A. Wing
RS Baucom
S Ahn
S Kurtz
S Liu
S Ouyang
S Schwartz
S Takahashi
S Zhou
Sandra W. Clifton
Scott Kruchowski
SE Lewis
SH Hulbert
Shiguo Zhou
Shiran Pasternak
Srinivas Aluru
Stephanie Adams
Susan M. Rock
Susan R. Wessler
T Wicker
TAGI AGI
Tina A. Graves
V Curwen
VV Kapitonov
W Beavis
W Gilbert
W Ramakrishna
W. Richard McCombie
WA Wilson
William Courtney
WJ Kent
Wolfgang Golser
X Cui
X Gao
XF Wang
XY Lin
Y Jia
Yeisoo Yu
YK Jin
Yujun Han
Z Swigonova
Z Yang
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

Most of our understanding of plant genome structure and evolution has come from the careful annotation of small (e.g., 100 kb) sequenced genomic regions or from automated annotation of complete genome sequences. Here, we sequenced and carefully annotated a contiguous 22 Mb region of maize chromosome 4 using an improved pseudomolecule for annotation. The sequence segment was comprehensively ordered, oriented, and confirmed using the maize optical map. Nearly 84% of the sequence is composed of transposable elements (TEs) that are mostly nested within each other, of which most families are low-copy. We identified 544 gene models using multiple levels of evidence, as well as five miRNA genes. Gene fragments, many captured by TEs, are prevalent within this region. Elimination of gene redundancy from a tetraploid maize ancestor that originated a few million years ago is responsible in this region for most disruptions of synteny with sorghum and rice. Consistent with other sub-genomic analyses in maize, small RNA mapping showed that many small RNAs match TEs and that most TEs match small RNAs. These results, performed on ∼1% of the maize genome, demonstrate the feasibility of refining the B73 RefGen_v1 genome assembly by incorporating optical map, high-resolution genetic map, and comparative genomic data sets. Such improvements, along with those of gene and repeat annotation, will serve to promote future functional genomic and phylogenomic research in maize and other grasses

Public Library of Science (PLOS)

Crossref

Cold Spring Harbor Laboratory Institutional Repository

Archivio istituzionale della ricerca - Università degli Studi di Udine

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Purdue E-Pubs

University of Queensland eSpace