Search CORE

8 research outputs found

Whole genome single nucleotide polymorphism based phylogeny of Francisella tularensis and its application to the development of a strain typing assay

Author: Fleischmann Robert D
Holmes Michael H
Jones Marcus
Karamycheva Svetlana A
Molins Claudia
Pandya Gagan A
Petersen Jeannine M
Peterson Scott N
Pradhan Sonal
Schriefer Martin E
Wolcott Mark J
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background A low genetic diversity in <it>Francisella tularensis </it>has been documented. Current DNA based genotyping methods for typing <it>F. tularensis </it>offer a limited and varying degree of subspecies, clade and strain level discrimination power. Whole genome sequencing is the most accurate and reliable method to identify, type and determine phylogenetic relationships among strains of a species. However, lower cost typing schemes are necessary in order to enable typing of hundreds or even thousands of isolates. Results We have generated a high-resolution phylogenetic tree from 40 <it>Francisella </it>isolates, including 13 <it>F. tularensis </it>subspecies <it>holarctica </it>(type B) strains, 26 <it>F. tularensis </it>subsp. <it>tularensis </it>(type A) strains and a single <it>F. novicida </it>strain. The tree was generated from global multi-strain single nucleotide polymorphism (SNP) data collected using a set of six Affymetrix GeneChip® resequencing arrays with the non-repetitive portion of LVS (type B) as the reference sequence complemented with unique sequences of SCHU S4 (type A). Global SNP based phylogenetic clustering was able to resolve all non-related strains. The phylogenetic tree was used to guide the selection of informative SNPs specific to major nodes in the tree for development of a genotyping assay for identification of <it>F. tularensis </it>subspecies and clades. We designed and validated an assay that uses these SNPs to accurately genotype 39 additional <it>F. tularensis </it>strains as type A (A1, A2, A1a or A1b) or type B (B1 or B2). Conclusion Whole-genome SNP based clustering was shown to accurately identify SNPs for differentiation of <it>F. tularensis </it>subspecies and clades, emphasizing the potential power and utility of this methodology for selecting SNPs for typing of <it>F. tularensis </it>to the strain level. Additionally, whole genome sequence based SNP information gained from a representative population of strains may be used to perform evolutionary or phylogenetic comparisons of strains, or selection of unique strains for whole-genome sequencing projects.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Local Admixture of Amplified and Diversified Secreted Pathogenesis Determinants Shapes Mosaic \u3cem\u3eToxoplasma gondii\u3c/em\u3e Genomes

Toxoplasma gondii is among the most prevalent parasites worldwide, infecting many wild and domestic animals and causing zoonotic infections in humans. T. gondii differs substantially in its broad distribution from closely related parasites that typically have narrow, specialized host ranges. To elucidate the genetic basis for these differences, we compared the genomes of 62 globally distributed T. gondii isolates to several closely related coccidian parasites. Our findings reveal that tandem amplification and diversification of secretory pathogenesis determinants is the primary feature that distinguishes the closely related genomes of these biologically diverse parasites. We further show that the unusual population structure of T. gondii is characterized by clade-specific inheritance of large conserved haploblocks that are significantly enriched in tandemly clustered secretory pathogenesis determinants. The shared inheritance of these conserved haploblocks, which show a different ancestry than the genome as a whole, may thus influence transmission, host range and pathogenicity

Crossref

HAL-UNILIM

PubMed Central

University of Kentucky

eScholarship - University of California

ScholarlyCommons@Penn

Monitoring the Long-Term Molecular Epidemiology of the Pneumococcus and Detection of Potential ‘Vaccine Escape’ Strains

Author: AC Springman
AH Tu
B Beall
B Beall
B Ewing
B Flannery
B Henriques-Normark
B Snel
Behnam Jarrahi
C Blomberg
C Mollet
C Obert
CC Daniels
CG Whitney
CJ Orihuela
CL Byington
DA Robinson
DE Briles
DL Hava
DP Martin
F Ronquist
F Tajima
F Tajima
GA Pandya
GA Pandya
Gagan A. Pandya
GG Sutton
GS Nabors
H Roche
H Tettelin
IM Feavers
IS Aaberge
J Yother
J Yother
JA Lanie
JE Clarridge 3rd
JE Cooper
Jia Liu
JJ Bijlsma
JP Claverys
JP Huelsenbeck
L McGee
LA Hicks
M Darrieux
M Drancourt
M Nei
M. Catherine McEllistrem
MC Maiden
MC McEllistrem
MC McEllistrem
Michael H. Holmes
MJ Jedrzejas
MR Moore
NA Silva
Pratap Venepally
Ravi Sanka
RC Edgar
RJ Singleton
Robert D. Fleischmann
S Rozen
Scott N. Peterson
Shabir Ahmed Madhi
SJ Rehm
SK Hollingshead
Svetlana A. Karamycheva
T Do
T van der Poll
TJ Mitchell
TM File
W Xin
WP Hanage
Y Chen
Yun Bai
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

While the pneumococcal protein conjugate vaccines reduce the incidence in invasive pneumococcal disease (IPD), serotype replacement remains a major concern. Thus, serotype-independent protection with vaccines targeting virulence genes, such as PspA, have been pursued. PspA is comprised of diverse clades that arose through recombination. Therefore, multi-locus sequence typing (MLST)-defined clones could conceivably include strains from multiple PspA clades. As a result, a method is needed which can both monitor the long-term epidemiology of the pneumococcus among a large number of isolates, and analyze vaccine-candidate genes, such as pspA, for mutations and recombination events that could result in 'vaccine escape' strains.We developed a resequencing array consisting of five conserved and six variable genes to characterize 72 pneumococcal strains. The phylogenetic analysis of the 11 concatenated genes was performed with the MrBayes program, the single nucleotide polymorphism (SNP) analysis with the DNA Sequence Polymorphism program (DnaSP), and the recombination event analysis with the recombination detection package (RDP).The phylogenetic analysis correlated with MLST, and identified clonal strains with unique PspA clades. The DnaSP analysis correlated with the serotype-specific diversity detected using MLST. Serotypes associated with more than one ST complex had a larger degree of sequence polymorphism than a serotype associated with one ST complex. The RDP analysis confirmed the high frequency of recombination events in the pspA gene.The phylogenetic tree correlated with MLST, and detected multiple PspA clades among clonal strains. The genetic diversity of the strains and the frequency of recombination events in the mosaic gene, pspA were accurately assessed using the DnaSP and RDP programs, respectively. These data provide proof-of-concept that resequencing arrays could play an important role within research and clinical laboratories in both monitoring the molecular epidemiology of the pneumococcus and detecting 'vaccine escape' strains among vaccine-candidate genes

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Predicted highly derived class 1 CRISPR-Cas system in Haloarchaea containing diverged Cas5 and Cas7 homologs but no CRISPR array

Author: Garrett Roger A.
Karamycheva Svetlana
Koonin Eugene V.
Makarova Kira S.
Shah Shiraz A.
Vestergaard Gisle
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2019
Field of study

Copenhagen University Research Information System

Online Research Database In Technology

Comparative Analyses of Potato Expressed Sequence Tag Libraries

Author: Ascenzi Robert A.
Baker Barbara J.
Bougri Oleg
Buell C. Robin
Cho Jennifer
Fry William E.
Griffiths Helen M.
Hart Amy L.
Jin Hailing
Karamycheva Svetlana
Lee Yuandan
Pertea Geo M.
Quackenbush John
Restrepo Silvia
Riedmuller Steve B.
Ronning Catherine M.
Smart Christine D.
Stegalkina Svetlana S.
Sultana Razvan
Tanksley Steve
Tsai Jennifer
Utterbach Teresa R.
van der Hoeven Rutger
Vanaken Susan E.
White Joseph A.
Yamamoto Miki L.
Zhang Peifen
Publication venue: American Society of Plant Biologists
Publication date: 01/02/2003
Field of study

The cultivated potato (Solanum tuberosum) shares similar biology with other members of the Solanaceae, yet has features unique within the family, such as modified stems (stolons) that develop into edible tubers. To better understand potato biology, we have undertaken a survey of the potato transcriptome using expressed sequence tags (ESTs) from diverse tissues. A total of 61,940 ESTs were generated from aerial tissues, below-ground tissues, and tissues challenged with the late-blight pathogen (Phytophthora infestans). Clustering and assembly of these ESTs resulted in a total of 19,892 unique sequences with 8,741 tentative consensus sequences and 11,151 singleton ESTs. We were able to identify a putative function for 43.7% of these sequences. A number of sequences (48) were expressed throughout the libraries sampled, representing constitutively expressed sequences. Other sequences (13,068, 21%) were uniquely expressed and were detected only in a single library. Using hierarchal and k means clustering of the EST sequences, we were able to correlate changes in gene expression with major physiological events in potato biology. Using pair-wise comparisons of tuber-related tissues, we were able to associate genes with tuber initiation, dormancy, and sprouting. We also were able to identify a number of characterized as well as novel sequences that were unique to the incompatible interaction of late-blight pathogen, thereby providing a foundation for further understanding the mechanism of resistance

Crossref

PubMed Central

Sequence Evaluation of Four Pooled-Tissue Normalized Bovine cDNA Libraries and Construction of a Gene Index for Cattle

Author: Bennett Gary L.
Casas Eduardo
Chitko-McKown Carol G.
Cho Jennifer
Fahrenkrug Scott C.
Freking Brad A.
Grosse William M.
Heaton Michael P.
Holt Ingeborg
Karamycheva Svetlana
Keele John W.
Laegreid William W.
Liang Feng
Pertea Geo
Quackenbush John
Roberts Andrew J.
Rohrer Gary A.
Smith Timothy P.L.
Stone Roger T.
White Joseph
Wray James E.
Publication venue: Cold Spring Harbor Laboratory Press
Publication date: 01/04/2001
Field of study

An essential component of functional genomics studies is the sequence of DNA expressed in tissues of interest. To provide a resource of bovine-specific expressed sequence data and facilitate this powerful approach in cattle research, four normalized cDNA libraries were produced and arrayed for high-throughput sequencing. The libraries were made with RNA pooled from multiple tissues to increase efficiency of normalization and maximize the number of independent genes for which sequence data were obtained. Target tissues included those with highest likelihood to have impact on production parameters of animal health, growth, reproductive efficiency, and carcass merit. Success of normalization and inter- and intralibrary redundancy were assessed by collecting 6000–23,000 sequences from each of the libraries (68,520 total sequences deposited in GenBank). Sequence comparison and assembly of these sequences was performed in combination with 56,500 other bovine EST sequences present in the GenBank dbEST database to construct a cattle Gene Index (available from The Institute for Genomic Research at http://www.tigr.org/tdb/tgi.shtml). The 124,381 bovine ESTs present in GenBank at the time of the analysis form 16,740 assemblies that are listed and annotated on the Web site. Analysis of individual library sequence data indicates that the pooled-tissue approach was highly effective in preparing libraries for efficient deep sequencing

Crossref

PubMed Central

Local admixture of amplified and diversified secreted pathogenesis determinants shapes mosaic Toxoplasma gondii genomes

Author: Ajioka James W.
Ajzenberg Daniel
Behnke Michael S.
Boothroyd John
Boyle Jon P.
Brunk Brian P.
Dardé Marie-Laure
David Sibley L
Diaz-Miranda Maria A.
Dubey Jitender P.
Fritz Heather M.
Gennari Solange M.
Gregory Brian D.
Grigg Michael E.
Hadjithomas Michalis
Howe Daniel K.
Karamycheva Svetlana
Khan Asis
Kim Kami
Kissinger Jessica C.
Liu L
Lorenzi L
Namasivayam Sivaranjani
Parkinson John
Pinney Deborah
Roos David S.
Rosenthal Benjamin M.
Saeij Jeroen P. J.
Su Chunlei
Swapna L
White Michael W.
Zhu Xing-Quan
Publication venue: Nature Publishing Group
Publication date: 01/01/2016
Field of study

International audienc

HAL-UNILIM

eScholarship - University of California

Araport: the Arabidopsis Information Portal

Author: Altschul
Balakrishnan
Benjamin D. Rosen
Brady
Chia-Yi Cheng
Christopher D. Town
Dooley
Eilbeck
Erik S. Ferlanti
Fielding
Gene Ontology Consortium.
Goff
Gomez
Goodstein
Gos Micklem
Haas
Haas
International Arabidopsis Informatics Consortium
Jason R. Miller
Joseph Stubbs
Julie M. Sullivan
Kalderimis
Konstantinos Krampis
Lamesch
Lee
Lyne
Lyons
Maglott
Maria Kim
Matthew R. Hanlon
Matthew Vaughn
Mi
Orchard
Sanderson
Sergio Contrino
Shannon
Skinner
Smith
Stephen A. Mock
Svetlana Karamycheva
UniProt Consortium.
Vivek Krishnakumar
Walter Moreira
Winter
Zhou
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Crossref