25 research outputs found
Allocating HIV Prevention Funds in the United States: Recommendations from an Optimization Model
The Centers for Disease Control and Prevention (CDC) had an annual budget of approximately $327 million to fund health departments and community-based organizations for core HIV testing and prevention programs domestically between 2001 and 2006. Annual HIV incidence has been relatively stable since the year 2000 [1] and was estimated at 48,600 cases in 2006 and 48,100 in 2009 [2]. Using estimates on HIV incidence, prevalence, prevention program costs and benefits, and current spending, we created an HIV resource allocation model that can generate a mathematically optimal allocation of the Division of HIV/AIDS Prevention’s extramural budget for HIV testing, and counseling and education programs. The model’s data inputs and methods were reviewed by subject matter experts internal and external to the CDC via an extensive validation process. The model projects the HIV epidemic for the United States under different allocation strategies under a fixed budget. Our objective is to support national HIV prevention planning efforts and inform the decision-making process for HIV resource allocation. Model results can be summarized into three main recommendations. First, more funds should be allocated to testing and these should further target men who have sex with men and injecting drug users. Second, counseling and education interventions ought to provide a greater focus on HIV positive persons who are aware of their status. And lastly, interventions should target those at high risk for transmitting or acquiring HIV, rather than lower-risk members of the general population. The main conclusions of the HIV resource allocation model have played a role in the introduction of new programs and provide valuable guidance to target resources and improve the impact of HIV prevention efforts in the United States
Multiplatform analysis of 12 cancer types reveals molecular classification within and across tissues of origin
Recent genomic analyses of pathologically defined tumor types identify “within-a-tissue” disease subtypes. However, the extent to which genomic signatures are shared across tissues is still unclear. We performed an integrative analysis using five genome-wide platforms and one proteomic platform on 3,527 specimens from 12 cancer types, revealing a unified classification into 11 major subtypes. Five subtypes were nearly identical to their tissue-of-origin counterparts, but several distinct cancer types were found to converge into common subtypes. Lung squamous, head and neck, and a subset of bladder cancers coalesced into one subtype typified by TP53 alterations, TP63 amplifications, and high expression of immune and proliferation pathway genes. Of note, bladder cancers split into three pan-cancer subtypes. The multiplatform classification, while correlated with tissue-of-origin, provides independent information for predicting clinical outcomes. All data sets are available for data-mining from a unified resource to support further biological discoveries and insights into novel therapeutic strategie
Multiplatform Analysis of 12 Cancer Types Reveals Molecular Classification within and across Tissues of Origin
Recent genomic analyses of pathologically-defined tumor types identify “within-a-tissue” disease subtypes. However, the extent to which genomic signatures are shared across tissues is still unclear. We performed an integrative analysis using five genome-wide platforms and one proteomic platform on 3,527 specimens from 12 cancer types, revealing a unified classification into 11 major subtypes. Five subtypes were nearly identical to their tissue-of-origin counterparts, but several distinct cancer types were found to converge into common subtypes. Lung squamous, head & neck, and a subset of bladder cancers coalesced into one subtype typified by TP53 alterations, TP63 amplifications, and high expression of immune and proliferation pathway genes. Of note, bladder cancers split into three pan-cancer subtypes. The multi-platform classification, while correlated with tissue-of-origin, provides independent information for predicting clinical outcomes. All datasets are available for data-mining from a unified resource to support further biological discoveries and insights into novel therapeutic strategies
Recommended from our members
Prediction of cancer phenotypes through the integration of multi-omic data and prior information
High-throughput data have become ubiquitous in the study of biological phenomena. We can now query cellular state at higher resolution, giving us better insight into complex diseases. For example, there are currently tens of thousands of cancer patients with simultaneous copy number, mutation, methylation, mRNA, miRNA and protein level profiles. Furthermore, cellular perturbations are increasingly characterized on the multi-omic level. These experiments uncover important dependencies among genes, their products and environmental conditions - relationships that accumulate in a growing number of databases. However, the integration of such prior pathway knowledge with new heterogeneous genomic measurements in an interpretable model remains a formidable challenge that is still not fully solved.My thesis presents three different approaches which incrementally address that problem. First, I present a feature engineering method (hVIPER) that infers kinase protein activity levels in a pathway-informed manner. Next, I develop one of the joint winners of the DREAM9 Gene Essentiality Prediction Challenge - a Multiple Kernel Learning algorithm with multi-omic pathway-derived kernel functions (MPL). Finally, I improve upon the DREAM9 winner by introducing empirical kernel functions computed through Random Forest tree ensembles (AKLIMATE). AKLIMATE outperforms state-of-the-art methods in diverse phenotype learning tasks, including predicting microsatellite instability in endometrial and colorectal cancer, survival in breast cancer and shRNA knockdown response in CCLE cell lines. In conclusion, I briefly demonstrate how AKLIMATE can be adapted to the development of multi-omic minimum-feature predictors for patient subtypes
Accurate cancer phenotype prediction with AKLIMATE, a stacked kernel learner integrating multimodal genomic data and pathway knowledge.
Advancements in sequencing have led to the proliferation of multi-omic profiles of human cells under different conditions and perturbations. In addition, many databases have amassed information about pathways and gene "signatures"-patterns of gene expression associated with specific cellular and phenotypic contexts. An important current challenge in systems biology is to leverage such knowledge about gene coordination to maximize the predictive power and generalization of models applied to high-throughput datasets. However, few such integrative approaches exist that also provide interpretable results quantifying the importance of individual genes and pathways to model accuracy. We introduce AKLIMATE, a first kernel-based stacked learner that seamlessly incorporates multi-omics feature data with prior information in the form of pathways for either regression or classification tasks. AKLIMATE uses a novel multiple-kernel learning framework where individual kernels capture the prediction propensities recorded in random forests, each built from a specific pathway gene set that integrates all omics data for its member genes. AKLIMATE has comparable or improved performance relative to state-of-the-art methods on diverse phenotype learning tasks, including predicting microsatellite instability in endometrial and colorectal cancer, survival in breast cancer, and cell line response to gene knockdowns. We show how AKLIMATE is able to connect feature data across data platforms through their common pathways to identify examples of several known and novel contributors of cancer and synthetic lethality
Recommended from our members
Accurate cancer phenotype prediction with AKLIMATE, a stacked kernel learner integrating multimodal genomic data and pathway knowledge.
Advancements in sequencing have led to the proliferation of multi-omic profiles of human cells under different conditions and perturbations. In addition, many databases have amassed information about pathways and gene "signatures"-patterns of gene expression associated with specific cellular and phenotypic contexts. An important current challenge in systems biology is to leverage such knowledge about gene coordination to maximize the predictive power and generalization of models applied to high-throughput datasets. However, few such integrative approaches exist that also provide interpretable results quantifying the importance of individual genes and pathways to model accuracy. We introduce AKLIMATE, a first kernel-based stacked learner that seamlessly incorporates multi-omics feature data with prior information in the form of pathways for either regression or classification tasks. AKLIMATE uses a novel multiple-kernel learning framework where individual kernels capture the prediction propensities recorded in random forests, each built from a specific pathway gene set that integrates all omics data for its member genes. AKLIMATE has comparable or improved performance relative to state-of-the-art methods on diverse phenotype learning tasks, including predicting microsatellite instability in endometrial and colorectal cancer, survival in breast cancer, and cell line response to gene knockdowns. We show how AKLIMATE is able to connect feature data across data platforms through their common pathways to identify examples of several known and novel contributors of cancer and synthetic lethality
A model for allocating CDC’s HIV prevention resources in the United States
HIV/AIDS, Resource allocation, Optimization model,
New infections and costs by budget amount given model allocation.
a<p>Refers to the number of infections averted relative to the next annual budget increment.</p>b<p>Refers to the number of infections averted relative to no investment of funds.</p>c<p>Calculated as the total budget over 5 years divided by the total infections averted.</p
Allocated proportion of budget.
a<p>May not total 100%. Includes funds reaching to the 3 risk groups only, not the general population.</p>b<p>Undiagnosed positive persons are not distinguishable from susceptible persons so they are targeted together.</p