Search CORE

Digital Commons@Becker

University of Kentucky

Analysis of retinoids by high-performance liquid chromatography using programmed gradient separation

Author: Annesley Thomas M.
Ellis Charles N.
Giacherio Donald A.
Grekin Roy C.
Wilkerson Karen
Publication venue: 'Elsevier BV'
Publication date: 01/01/1984
Field of study

Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/24984/1/0000411.pd

Deep Blue Documents at the University of Michigan

SigFuge: Single gene clustering of RNA-seq reveals differential isoform usage among cancer samples

Author: Cabanski Christopher R
Hayes D. Neil
Johnson Amy R
Kimes Patrick K
Liu Yufeng
Maher Christopher A
Makowski Liza
Marron J. S
Perou Charles M
Wilkerson Matthew D
Zhao Ni
Publication venue: Digital Commons@Becker
Publication date: 01/01/2014
Field of study

High-throughput sequencing technologies, including RNA-seq, have made it possible to move beyond gene expression analysis to study transcriptional events including alternative splicing and gene fusions. Furthermore, recent studies in cancer have suggested the importance of identifying transcriptionally altered loci as biomarkers for improved prognosis and therapy. While many statistical methods have been proposed for identifying novel transcriptional events with RNA-seq, nearly all rely on contrasting known classes of samples, such as tumor and normal. Few tools exist for the unsupervised discovery of such events without class labels. In this paper, we present SigFuge for identifying genomic loci exhibiting differential transcription patterns across many RNA-seq samples. SigFuge combines clustering with hypothesis testing to identify genes exhibiting alternative splicing, or differences in isoform expression. We apply SigFuge to RNA-seq cohorts of 177 lung and 279 head and neck squamous cell carcinoma samples from the Cancer Genome Atlas, and identify several cases of differential isoform usage including CDKN2A, a tumor suppressor gene known to be inactivated in a majority of lung squamous cell tumors. By not restricting attention to known sample stratifications, SigFuge offers a novel approach to unsupervised screening of genetic loci across RNA-seq cohorts. SigFuge is available as an R package through Bioconductor

Digital Commons@Becker

ABRA: improved coding indel detection via assembly-based realignment

Author: Mose Lisle E.
Neil Hayes D.
Parker Joel S.
Perou Charles M.
Wilkerson Matthew D.
Publication venue
Publication date: 01/01/2014
Field of study

Motivation: Variant detection from next-generation sequencing (NGS) data is an increasingly vital aspect of disease diagnosis, treatment and research. Commonly used NGS-variant analysis tools generally rely on accurately mapped short reads to identify somatic variants and germ-line genotypes. Existing NGS read mappers have difficulty accurately mapping short reads containing complex variation (i.e. more than a single base change), thus making identification of such variants difficult or impossible. Insertions and deletions (indels) in particular have been an area of great difficulty. Indels are frequent and can have substantial impact on function, which makes their detection all the more imperative.Results: We present ABRA, an assembly-based realigner, which uses an efficient and flexible localized de novo assembly followed by global realignment to more accurately remap reads. This results in enhanced performance for indel detection as well as improved accuracy in variant allele frequency estimation.Availability and implementation: ABRA is implemented in a combination of Java and C/C++ and is freely available for download at https://github.com/mozack/abra.Contact: [email protected]; [email protected] information: Supplementary data are available at Bioinformatics online

Integrated RNA and DNA sequencing improves mutation detection in low purity tumors

Author: Cabanski Christopher R.
Hammerman Peter S.
Hayes D. Neil
Hoadley Katherine A.
Mose Lisle E.
Parker Joel S.
Perou Charles M.
Sun Wei
Troester Melissa A.
Walter Vonn
Wilkerson Matthew D.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2014
Field of study

Identifying somatic mutations is critical for cancer genome characterization and for prioritizing patient treatment. DNA whole exome sequencing (DNA-WES) is currently the most popular technology; however, this yields low sensitivity in low purity tumors. RNA sequencing (RNA-seq) covers the expressed exome with depth proportional to expression. We hypothesized that integrating DNA-WES and RNA-seq would enable superior mutation detection versus DNA-WES alone. We developed a first-of-its-kind method, called UNCeqR, that detects somatic mutations by integrating patient-matched RNA-seq and DNA-WES. In simulation, the integrated DNA and RNA model outperformed the DNA-WES only model. Validation by patient-matched whole genome sequencing demonstrated superior performance of the integrated model over DNA-WES only models, including a published method and published mutation profiles. Genome-wide mutational analysis of breast and lung cancer cohorts (n = 871) revealed remarkable tumor genomics properties. Low purity tumors experienced the largest gains in mutation detection by integrating RNA-seq and DNA-WES. RNA provided greater mutation signal than DNA in expressed mutations. Compared to earlier studies on this cohort, UNCeqR increased mutation rates of driver and therapeutically targeted genes (e.g. PIK3CA, ERBB2 and FGFR2). In summary, integrating RNA-seq with DNA-WES increases mutation detection performance, especially for low purity tumors

Harvard University - DASH

Digital Commons@Becker

Public Library of Science (PLOS)

SWISS MADE: Standardized WithIn Class Sum of Squares to Evaluate Methodologies and Dataset Elements

Author: Chad Creighton
Charles M. Perou
Cheng Fan
Christopher R. Cabanski
D. Neil Hayes
Eric Bair
J. S. Marron
Jianying Li
Matthew D. Wilkerson
Michele C. Hayward
Xiaoying Yin
Yuan Qi
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Contemporary high dimensional biological assays, such as mRNA expression microarrays, regularly involve multiple data processing steps, such as experimental processing, computational processing, sample selection, or feature selection (i.e. gene selection), prior to deriving any biological conclusions. These steps can dramatically change the interpretation of an experiment. Evaluation of processing steps has received limited attention in the literature. It is not straightforward to evaluate different processing methods and investigators are often unsure of the best method. We present a simple statistical tool, Standardized WithIn class Sum of Squares (SWISS), that allows investigators to compare alternate data processing methods, such as different experimental methods, normalizations, or technologies, on a dataset in terms of how well they cluster a priori biological classes. SWISS uses Euclidean distance to determine which method does a better job of clustering the data elements based on a priori classifications. We apply SWISS to three different gene expression applications. The first application uses four different datasets to compare different experimental methods, normalizations, and gene sets. The second application, using data from the MicroArray Quality Control (MAQC) project, compares different microarray platforms. The third application compares different technologies: a single Agilent two-color microarray versus one lane of RNA-Seq. These applications give an indication of the variety of problems that SWISS can be helpful in solving. The SWISS analysis of one-color versus two-color microarrays provides investigators who use two-color arrays the opportunity to review their results in light of a single-channel analysis, with all of the associated benefits offered by this design. Analysis of the MACQ data shows differential intersite reproducibility by array platform. SWISS also shows that one lane of RNA-Seq clusters data by biological phenotypes as well as a single Agilent two-color microarray

Directory of Open Access Journals

ReQON: a Bioconductor package for recalibrating quality scores from next-generation sequencing data

Author: Bizon Chris
Cabanski Christopher R
Cavin Keary
Hayes D
Marron JS
Parker Joel
Perou Charles
Wilhelmsen Kirk
Wilkerson Matthew D
Publication venue: BioMed Central Ltd
Publication date: 04/09/2012
Field of study

AbstractBackgroundNext-generation sequencing technologies have become important tools for genome-wide studies. However, the quality scores that are assigned to each base have been shown to be inaccurate. If the quality scores are used in downstream analyses, these inaccuracies can have a significant impact on the results.ResultsHere we present ReQON, a tool that recalibrates the base quality scores from an input BAM file of aligned sequencing data using logistic regression. ReQON also generates diagnostic plots showing the effectiveness of the recalibration. We show that ReQON produces quality scores that are both more accurate, in the sense that they more closely correspond to the probability of a sequencing error, and do a better job of discriminating between sequencing errors and non-errors than the original quality scores. We also compare ReQON to other available recalibration tools and show that ReQON is less biased and performs favorably in terms of quality score accuracy.ConclusionReQON is an open source software package, written in R and available through Bioconductor, for recalibrating base quality scores for next-generation sequencing data. ReQON produces a new BAM file with more accurate quality scores, which can improve the results of downstream analysis, and produces several diagnostic plots showing the effectiveness of the recalibration

A national survey of medical education fellowships

Author: Britta M. Thompson
Charles J. Hatem
Elizabeth Nelson
Epstein RM
Fleming VM
Gruppen LD
Gruppen LD
Hackbarth GM
Hatem CJ
Hatem CJ
Jolly BC
Larry D. Gruppen
Lown BA
Moses AS
Nancy S. Searle
Searle NS
Searle NS
Steinert Y
Wilkerson L
Publication venue: Medical Education Online
Publication date: 01/01/2011
Field of study

Purpose: The purpose of our study was to determine the prevalence, focus, time commitment, graduation requirements and programme evaluation methods of medical education fellowships throughout the United States. Medical education fellowships are defined as a single cohort of medical teaching faculty who participate in an extended faculty development programme. Methods: A 26-item online questionnaire was distributed to all US medical schools (n=127) in 2005 and 2006. The questionnaire asked each school if it had a medical education fellowship and the characteristics of the fellowship programme. Results: Almost half (n=55) of the participating schools (n=120, response rate 94.5 %) reported having fellowships. Duration (10–584 hours) and length (<1 month–48 months) varied; most focused on teaching skills, scholarly dissemination and curriculum design, and required the completion of a scholarly project. A majority collected participant satisfaction; few used other programme evaluation strategies. Conclusions: The number of medical education fellowships increased rapidly during the 1990s and 2000s. Across the US, programmes are similar in participant characteristics and curricular focus but unique in completion requirements. Fellowships collect limited programme evaluation data, indicating a need for better outcome data. These results provide benchmark data for those implementing or revising existing medical education fellowships

Harvard University - DASH

Directory of Open Access Journals

ReQON: a Bioconductor package for recalibrating quality scores from next-generation sequencing data

Author: Charles M Perou
Chris Bizon
Christopher R Cabanski
D Hayes
Joel S Parker
JS Marron
Keary Cavin
Kirk C Wilhelmsen
Matthew D Wilkerson
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Background Next-generation sequencing technologies have become important tools for genome-wide studies. However, the quality scores that are assigned to each base have been shown to be inaccurate. If the quality scores are used in downstream analyses, these inaccuracies can have a significant impact on the results. Results Here we present ReQON, a tool that recalibrates the base quality scores from an input BAM file of aligned sequencing data using logistic regression. ReQON also generates diagnostic plots showing the effectiveness of the recalibration. We show that ReQON produces quality scores that are both more accurate, in the sense that they more closely correspond to the probability of a sequencing error, and do a better job of discriminating between sequencing errors and non-errors than the original quality scores. We also compare ReQON to other available recalibration tools and show that ReQON is less biased and performs favorably in terms of quality score accuracy. Conclusion ReQON is an open source software package, written in R and available through Bioconductor, for recalibrating base quality scores for next-generation sequencing data. ReQON produces a new BAM file with more accurate quality scores, which can improve the results of downstream analysis, and produces several diagnostic plots showing the effectiveness of the recalibration

Springer - Publisher Connector