Search CORE

4 research outputs found

VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data

Author: Adkins Joshua N
Ansong Charles
Cannon William R
Jensen Jeffrey L
Kobold Markus A
McCue Lee Ann
Payne Samuel H
Peterson Elena S
Schrimpe-Rutledge Alexandra C
Walker Hyunjoo
Webb Samantha R
Webb-Robertson Bobbie-Jo M
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background The procedural aspects of genome sequencing and assembly have become relatively inexpensive, yet the full, accurate structural annotation of these genomes remains a challenge. Next-generation sequencing transcriptomics (RNA-Seq), global microarrays, and tandem mass spectrometry (MS/MS)-based proteomics have demonstrated immense value to genome curators as individual sources of information, however, integrating these data types to validate and improve structural annotation remains a major challenge. Current visual and statistical analytic tools are focused on a single data type, or existing software tools are retrofitted to analyze new data forms. We present Visual Exploration and Statistics to Promote Annotation (VESPA) is a new interactive visual analysis software tool focused on assisting scientists with the annotation of prokaryotic genomes though the integration of proteomics and transcriptomics data with current genome location coordinates. Results VESPA is a desktop Java™ application that integrates high-throughput proteomics data (peptide-centric) and transcriptomics (probe or RNA-Seq) data into a genomic context, all of which can be visualized at three levels of genomic resolution. Data is interrogated via searches linked to the genome visualizations to find regions with high likelihood of mis-annotation. Search results are linked to exports for further validation outside of VESPA or potential coding-regions can be analyzed concurrently with the software through interaction with BLAST. VESPA is demonstrated on two use cases (<it>Yersinia pestis </it>Pestoides F and <it>Synechococcus </it>sp. PCC 7002) to demonstrate the rapid manner in which mis-annotations can be found and explored in VESPA using either proteomics data alone, or in combination with transcriptomic data. Conclusions VESPA is an interactive visual analytics tool that integrates high-throughput data into a genomic context to facilitate the discovery of structural mis-annotations in prokaryotic genomes. Data is evaluated via visual analysis across multiple levels of genomic resolution, linked searches and interaction with existing bioinformatics tools. We highlight the novel functionality of VESPA and core programming requirements for visualization of these large heterogeneous datasets for a client-side application. The software is freely available at <url>https://www.biopilot.org/docs/Software/Vespa.php</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Sequential projection pursuit principal component analysis – dealing with missing data associated with new -omics technologies

Author: Bobbie-Jo M. Webb-Robertson
Hyunjoo Walker
Jason E. McDermott
Joel G. Pounds
Karin D. Rodland
Katrina M. Waters
Melissa M. Matzke
Thomas O. Metz
Publication venue: Future Science Ltd
Publication date: 01/03/2013
Field of study

Principal Component Analysis (PCA) is a common exploratory tool used to evaluate large complex data sets. The resulting lower-dimensional representations are often valuable for pattern visualization, clustering, or classification of the data. However, PCA cannot be applied directly to many -omics data sets generated by newer technologies such as label-free mass spectrometry due to large numbers of non-random missing values. Here we present a sequential projection pursuit PCA (sppPCA) method for defining principal components in the presence of missing data. Our results demonstrate that this approach generates robust and informative low-dimensional data representations compared to commonly used imputation approaches

Crossref

Directory of Open Access Journals

Ribotype variability of Clostridioides difficile strains in patients with hospital-acquired C. difficile infections, community-acquired C. difficile infections, and colonization with toxigenic and non-toxigenic strains of C. difficile

Author: Barbut
Bidet
Bongyoung Kim
Clabots
Cohen
Crobach
Curry
Eyre
Freeman
Gravel
Hyunjoo Pai
Jieun Kim
Kelly
Kim
Kim
L Ogra
Lanzas
Martin
McDonald
McDonald
Mi-Ran Seo
Razavi
Schäffler
Stoesser
Walker
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Clinical Characteristics and Treatment Outcomes of Clostridium difficile Infections by PCR Ribotype 017 and 018 Strains

Author: A Goorhuis
AS Walker
DA Collins
DN Gerding
FC Tenover
G Vedantam
H Kato
Hyunjoo Pai
I See
J Freeman
J Huelsenbeck
J Kim
J Kim
J Kim
J Kim
J Kim
Jieun Kim
KE Dingle
M Charlson
M Komatsu
M Merrigan
M Senoh
Michel R. Popoff
MM Awad
MP Bauer
P Spigaglia
PM Hawkey
R Baldan
S Erb
S Johnson
S Khanna
SH Cohen
ST Walk
VC Cheng
Yeonjae Kim
Publication venue: 'Public Library of Science (PLoS)'
Publication date
Field of study

Crossref