Search CORE

17 research outputs found

The DAWGPAWS pipeline for the annotation of genes and transposable elements in plant genomes

Author: Bennetzen Jeffrey L
Estill James C
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background High quality annotation of the genes and transposable elements in complex genomes requires a human-curated integration of multiple sources of computational evidence. These evidences include results from a diversity of <it>ab initio </it>prediction programs as well as homology-based searches. Most of these programs operate on a single contiguous sequence at a time, and the results are generated in a diverse array of readable formats that must be translated to a standardized file format. These translated results must then be concatenated into a single source, and then presented in an integrated form for human curation. Results We have designed, implemented, and assessed a Perl-based workflow named DAWGPAWS for the generation of computational results for human curation of the genes and transposable elements in plant genomes. The use of DAWGPAWS was found to accelerate annotation of 80–200 kb wheat DNA inserts in bacterial artificial chromosome (BAC) vectors by approximately twenty-fold and to also significantly improve the quality of the annotation in terms of completeness and accuracy. Conclusion The DAWGPAWS genome annotation pipeline fills an important need in the annotation of plant genomes by generating computational evidences in a high throughput manner, translating these results to a common file format, and facilitating the human curation of these computational results. We have verified the value of DAWGPAWS by using this pipeline to annotate the genes and transposable elements in 220 BAC insertions from the hexaploid wheat genome (<it>Triticum aestivum </it>L.). DAWGPAWS can be applied to annotation efforts in other plant genomes with minor modifications of program-specific configuration files, and the modular design of the workflow facilitates integration into existing pipelines.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Tuberculosis in the Western Pacific Region: Estimating the burden of disease and return on investment 2020–2030 in four countries

Author: Estill Janne
Houben Rein M.G.J.
Islam Tauhid
Keiser Olivia
McBryde Emma S.
Morishita Fukushi
Nguyen Anh Tuan
Oh Kyung Hyun
Orel Erol
Ragonnet Romain
Rahevar Kalpeshsinh
Raviglione Mario C.
Rudman Jamie
Trauer James M.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2021
Field of study

Background: We aimed to estimate the disease burden of Tuberculosis (TB) and return on investment of TB care in selected high-burden countries of the Western Pacific Region (WPR) until 2030. Methods: We projected the TB epidemic in Viet Nam and Lao People's Democratic Republic (PDR) 2020–2030 using a mathematical model under various scenarios: counterfactual (no TB care); baseline (TB care continues at current levels); and 12 different diagnosis and treatment interventions. We retrieved previous modeling results for China and the Philippines. We pooled the new and existing information on incidence and deaths in the four countries, covering >80% of the TB burden in WPR. We estimated the return on investment of TB care and interventions in Viet Nam and Lao PDR using a Solow model. Findings: In the baseline scenario, TB incidence in the four countries decreased from 97•0/100,000/year (2019) to 90•1/100,000/year (2030), and TB deaths from 83,300/year (2019) to 71,100/year (2030). Active case finding (ACF) strategies (screening people not seeking care for respiratory symptoms) were the most effective single interventions. Return on investment (2020–2030) for TB care in Viet Nam and Lao PDR ranged US

4-US

49/dollar spent; additional interventions brought up to US$2•7/dollar spent. Interpretation: In the modeled countries, TB incidence will only modestly decrease without additional interventions. Interventions that include ACF can reduce TB burden but achieving the End TB incidence and mortality targets will be difficult without new transformational tools (e.g. vaccine, new diagnostic tools, shorter treatment). However, TB care, even at its current level, can bring a multiple-fold return on investment

LSHTM Research Online

ResearchOnline at James Cook University

PubMed Central

Archive ouverte UNIGE

Exceptional Diversity, Non-Random Distribution, and Rapid Evolution of Retroelements in the B73 Maize Genome

Author: AFA Smit
Ansuya Jogi
AP Tikhonov
B McClintock
B Piegu
BA Kronmiller
BC Meyers
BJM Zonneveld
C Vitte
Cristian Chaparro
DA Kramerov
DC Howell
DE Berg
EM McCarthy
H Chou
HA Schmidt
Harmit S. Malik
HH Fu
J Felsenstein
James C. Estill
JC Estill
JC Estill
JD Thompson
Jean-Marc Deragon
Jeffrey L. Bennetzen
JL Bennetzen
JL Bennetzen
JL Bennetzen
JL Bennetzen
JM Deragon
JS Hawkins
JX Ma
JX Ma
JX Ma
K Fengler
KJ Edwards
KM Devos
M Umeda
M Yamazaki
MA Grandbastien
MA Johns
MJ Varagona
MV Mendiola
N Galtier
Naadira Upshaw
O Jaillon
P SanMiguel
P SanMiguel
P SanMiguel
Phillip J. SanMiguel
PJ SanMiguel
PJ SanMiguel
PL Deininger
PRJ Leeton
PS Schnable
RC Edgar
Regina S. Baucom
Richard P. Westerman
RS Baucom
RY Liu
S Brunner
S Tsukahara
SB Hedges
SR Wessler
T Wicker
TE Bureau
VV Kapitonov
W Gilbert
W Wang
XW Gai
Y Yasui
Y Yoshioka
YK Jin
Z Yang
Publication venue: Public Library of Science
Publication date: 01/11/2009
Field of study

Recent comprehensive sequence analysis of the maize genome now permits detailed discovery and description of all transposable elements (TEs) in this complex nuclear environment. Reiteratively optimized structural and homology criteria were used in the computer-assisted search for retroelements, TEs that transpose by reverse transcription of an RNA intermediate, with the final results verified by manual inspection. Retroelements were found to occupy the majority (>75%) of the nuclear genome in maize inbred B73. Unprecedented genetic diversity was discovered in the long terminal repeat (LTR) retrotransposon class of retroelements, with >400 families (>350 newly discovered) contributing >31,000 intact elements. The two other classes of retroelements, SINEs (four families) and LINEs (at least 30 families), were observed to contribute 1,991 and ∼35,000 copies, respectively, or a combined ∼1% of the B73 nuclear genome. With regard to fully intact elements, median copy numbers for all retroelement families in maize was 2 because >250 LTR retrotransposon families contained only one or two intact members that could be detected in the B73 draft sequence. The majority, perhaps all, of the investigated retroelement families exhibited non-random dispersal across the maize genome, with LINEs, SINEs, and many low-copy-number LTR retrotransposons exhibiting a bias for accumulation in gene-rich regions. In contrast, most (but not all) medium- and high-copy-number LTR retrotransposons were found to preferentially accumulate in gene-poor regions like pericentromeric heterochromatin, while a few high-copy-number families exhibited the opposite bias. Regions of the genome with the highest LTR retrotransposon density contained the lowest LTR retrotransposon diversity. These results indicate that the maize genome provides a great number of different niches for the survival and procreation of a great variety of retroelements that have evolved to differentially occupy and exploit this genomic diversity

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Purdue E-Pubs

A physical map for the Amborella trichopoda genome sheds light on the evolution of angiosperm genome structure

Author: Albert Victor A
Ayyampalayam Saravanaraj
Banks Jody
Barbazuk Brad
Bowers John E
Carlson John E
Chamala Srikar
Chanderbali Andre
Collura Kristi
dePamphilis Claude W
Duarte Jill
Estill James C
Goicoechea José Luis
Jiao Yuannian
Kudrna Dave
Leebens-Mack Jim
Luo Meizhong
Ma Hong
Mandoli Dina
Paterson Andrew H
Pires J Chris
Rounsley Steve
Sebastian Aswathy
Soltis Douglas E
Soltis Pamela S
Tang Haibao
Tomkins Jeffrey
Wing Rod A
Xiong Zhiyong
Yu Yeisoo
Zuccolo Andrea
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Background: Recent phylogenetic analyses have identified Amborella trichopoda, an understory tree species endemic to the forests of New Caledonia, as sister to a clade including all other known flowering plant species. The Amborella genome is a unique reference for understanding the evolution of angiosperm genomes because it can serve as an outgroup to root comparative analyses. A physical map, BAC end sequences and sample shotgun sequences provide a first view of the 870 Mbp Amborella genome.Results: Analysis of Amborella BAC ends sequenced from each contig suggests that the density of long terminal repeat retrotransposons is negatively correlated with that of protein coding genes. Syntenic, presumably ancestral, gene blocks were identified in comparisons of the Amborella BAC contigs and the sequenced Arabidopsis thaliana, Populus trichocarpa, Vitis vinifera and Oryza sativa genomes. Parsimony mapping of the loss of synteny corroborates previous analyses suggesting that the rate of structural change has been more rapid on lineages leading to Arabidopsis and Oryza compared with lineages leading to Populus and Vitis. The gamma paleohexiploidy event identified in the Arabidopsis, Populus and Vitis genomes is shown to have occurred after the divergence of all other known angiosperms from the lineage leading to Amborella.Conclusions: When placed in the context of a physical map, BAC end sequences representing just 5.4% of the Amborella genome have facilitated reconstruction of gene blocks that existed in the last common ancestor of all flowering plants. The Amborella genome is an invaluable reference for inferences concerning the ancestral angiosperm and subsequent genome evolution

Crossref

Springer - Publisher Connector

PubMed Central

The University of Arizona

Archivio della ricerca della Scuola Superiore Sant'Anna

University of Queensland eSpace

An SNP Resource for Rice Genetics and Breeding Based on Subspecies Indica and Japonica Genome Alignments

Author: Estill James C.
Feltus F. Alex
Jiang Ning
Paterson Andrew H.
Schulze Stefan R.
Wan Jun
Publication venue: Cold Spring Harbor Laboratory Press
Publication date: 01/09/2004
Field of study

Dense coverage of the rice genome with polymorphic DNA markers is an invaluable tool for DNA marker-assisted breeding, positional cloning, and a wide range of evolutionary studies. We have aligned drafts of two rice subspecies, indica and japonica, and analyzed levels and patterns of genetic diversity. After filtering multiple copy and low quality sequence, 408,898 candidate DNA polymorphisms (SNPs/INDELs) were discerned between the two subspecies. These filters have the consequence that our data set includes only a subset of the available SNPs (in particular excluding large numbers of SNPs that may occur between repetitive DNA alleles) but increase the likelihood that this subset is useful: Direct sequencing suggests that 79.8% ± 7.5% of the in silico SNPs are real. The SNP sample in our database is not randomly distributed across the genome. In fact, 566 rice genomic regions had unusually high (328 contigs/48.6 Mb/13.6% of genome) or low (237 contigs/64.7 Mb/18.1% of genome) polymorphism rates. Many SNP-poor regions were substantially longer than most SNP-rich regions, covering up to 4 Mb, and possibly reflecting introgression between the respective gene pools that may have occurred hundreds of years ago. Although 46.2% ± 8.3% of the SNPs differentiate other pairs of japonica and indica genotypes, SNP rates in rice were not predictive of evolutionary rates for corresponding genes in another grass species, sorghum. The data set is freely available at http://www.plantgenome.uga.edu/snp

Crossref

PubMed Central

Comparative genomics of Gossypium and Arabidopsis: Unraveling the consequences of both ancient and recent polyploidy

Author: Bowers John E.
Estill James C.
Paterson Andrew H.
Pierce Gary J.
Rogers Carl J.
Rong Junkang
Schulze Stefan R.
Waghmare Vijay N.
Zhang Hua
Publication venue: Cold Spring Harbor Laboratory Press
Publication date: 01/09/2005
Field of study

Both ancient and recent polyploidy, together with post-polyploidization loss of many duplicated gene copies, complicates angiosperm comparative genomics. To explore an approach by which these challenges might be mitigated, genetic maps of extant diploid and tetraploid cottons (Gossypium spp.) were used to infer the approximate order of 3016 loci along the chromosomes of their hypothetical common ancestor. The inferred Gossypium gene order corresponded more closely than the original maps did to a similarly inferred ancestral gene order predating an independent paleopolyploidization (α) in Arabidopsis. At least 59% of the cotton map and 53% of the Arabidopsis transcriptome showed correspondence in multilocus gene arrangements based on one or both of two software packages (CrimeStatII, FISH). Genomic regions in which chromosome structural rearrangement has been rapid (obscuring gene order correspondence) have also been subject to greater divergence of individual gene sequences. About 26%-44% of corresponding regions involved multiple Arabidopsis or cotton chromosomes, in some cases consistent with known, more ancient, duplications. The genomic distributions of multiple-locus probes provided early insight into the consequences for chromosome structure of an ancient large-scale duplication in cotton. Inferences that mitigate the consequences of ancient duplications improve leveraging of genomic information for model organisms in the study of more complex genomes

Crossref

PubMed Central

Mapping social vulnerability to enhance housing and neighborhood resilience

Author: Aguirre Benigno E.
Bates Frederick L.
Blaikie Piers
Blanchard-Boehm R. Denise.
Bolin R. C.
Bolin Robert C.
Bolin Robert C.
Bolin Robert C.
Bolin Robert C.
Bunyan Bryant Paul Mohai
Cochrane Hal C.
Comerio Mary C.
Cutter Susan L.
Dash Nicole
Deyle Robert E.
Drabek Thomas E.
Edwards Margie L.
Enarson Elaine
Enarson Elaine
Farley Reynolds.
Galindo Kimberly B.
Gladwin Hugh
Highfield W.
Lindell M.K.
Lindell Michael K.
Lindell Michael. K.
Logan John R.
Massey Douglas D.
Mileti Dennis.
Mitchell James K.
Moore Harry
Moore Harry Estill
Moore Harry Estill.
Morrow Betty Hearn
Morrow Betty Hearn
Morrow Betty Hearn.
Olshansky Robert B.
Peacock
Peacock Walter Gillis
Peacock Walter Gillis
Peacock Walter Gillis
Perry Ronald W.
Perry Ronald W.
Phillips Brenda D.
Rossi Peter H.
Rubin Claire. B.
Tierney Kathleen J.
Turner Ralph H.
Vaughn Elaine
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref