161 research outputs found

    Phenotype Prediction Using Regularized Regression on Genetic Data in the DREAM5 Systems Genetics B Challenge

    Get PDF
    A major goal of large-scale genomics projects is to enable the use of data from high-throughput experimental methods to predict complex phenotypes such as disease susceptibility. The DREAM5 Systems Genetics B Challenge solicited algorithms to predict soybean plant resistance to the pathogen Phytophthora sojae from training sets including phenotype, genotype, and gene expression data. The challenge test set was divided into three subcategories, one requiring prediction based on only genotype data, another on only gene expression data, and the third on both genotype and gene expression data. Here we present our approach, primarily using regularized regression, which received the best-performer award for subchallenge B2 (gene expression only). We found that despite the availability of 941 genotype markers and 28,395 gene expression features, optimal models determined by cross-validation experiments typically used fewer than ten predictors, underscoring the importance of strong regularization in noisy datasets with far more features than samples. We also present substantial analysis of the training and test setup of the challenge, identifying high variance in performance on the gold standard test sets.National Science Foundation (U.S.). Graduate Research Fellowship ProgramNational Defense Science and Engineering Graduate Fellowshi

    Detection of regulator genes and eQTLs in gene networks

    Full text link
    Genetic differences between individuals associated to quantitative phenotypic traits, including disease states, are usually found in non-coding genomic regions. These genetic variants are often also associated to differences in expression levels of nearby genes (they are "expression quantitative trait loci" or eQTLs for short) and presumably play a gene regulatory role, affecting the status of molecular networks of interacting genes, proteins and metabolites. Computational systems biology approaches to reconstruct causal gene networks from large-scale omics data have therefore become essential to understand the structure of networks controlled by eQTLs together with other regulatory genes, and to generate detailed hypotheses about the molecular mechanisms that lead from genotype to phenotype. Here we review the main analytical methods and softwares to identify eQTLs and their associated genes, to reconstruct co-expression networks and modules, to reconstruct causal Bayesian gene and module networks, and to validate predicted networks in silico.Comment: minor revision with typos corrected; review article; 24 pages, 2 figure

    QCD and strongly coupled gauge theories : challenges and perspectives

    Get PDF
    We highlight the progress, current status, and open challenges of QCD-driven physics, in theory and in experiment. We discuss how the strong interaction is intimately connected to a broad sweep of physical problems, in settings ranging from astrophysics and cosmology to strongly coupled, complex systems in particle and condensed-matter physics, as well as to searches for physics beyond the Standard Model. We also discuss how success in describing the strong interaction impacts other fields, and, in turn, how such subjects can impact studies of the strong interaction. In the course of the work we offer a perspective on the many research streams which flow into and out of QCD, as well as a vision for future developments.Peer reviewe

    smt: a Matlab structured matrices toolbox

    Full text link
    We introduce the smt toolbox for Matlab. It implements optimized storage and fast arithmetics for circulant and Toeplitz matrices, and is intended to be transparent to the user and easily extensible. It also provides a set of test matrices, computation of circulant preconditioners, and two fast algorithms for Toeplitz linear systems.Comment: 19 pages, 1 figure, 1 typo corrected in the abstrac

    Delineation of prognostic biomarkers in prostate cancer

    Full text link
    Prostate cancer is the most frequently diagnosed cancer in American men(1,2). Screening for prostate-specific antigen (PSA) has led to earlier detection of prostate cancer(3), but elevated serum PSA levels may be present in non-malignant conditions such as benign prostatic hyperlasia (BPH). Characterization of gene-expression profiles that molecularly distinguish prostatic neoplasms may identify genes involved in prostate carcinogenesis, elucidate clinical biomarkers, and lead to an improved classification of prostate cancer(4-6). Using microarrays of complementary DNA, we examined gene-expression profiles of more than 50 normal and neoplastic prostate specimens and three common prostate-cancer cell lines. Signature expression profiles of normal adjacent prostate (NAP), BPH, localized prostate cancer, and metastatic, hormone-refractory prostate cancer were determined. Here we establish many associations between genes and prostate cancer. We assessed two of these genes-hepsin, a transmembrane serine protease, and pim-1, a serine/threonine kinase-at the protein level using tissue microarrays consisting of over 700 clinically stratified prostate-cancer specimens. Expression of hepsin and pim-1 proteins was significantly correlated with measures of clinical outcome. Thus, the integration of cDNA microarray, high-density tissue microarray, and linked clinical and pathology data is a powerful approach to molecular profiling of human cancer.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/62849/1/412822a0.pd

    Personalized medicine in psoriasis: developing a genomic classifier to predict histological response to Alefacept

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Alefacept treatment is highly effective in a select group patients with moderate-to-severe psoriasis, and is an ideal candidate to develop systems to predict who will respond to therapy. A clinical trial of 22 patients with moderate to severe psoriasis treated with alefacept was conducted in 2002-2003, as a mechanism of action study. Patients were classified as responders or non-responders to alefacept based on histological criteria. Results of the original mechanism of action study have been published. Peripheral blood was collected at the start of this clinical trial, and a prior analysis demonstrated that gene expression in PBMCs differed between responders and non-responders, however, the analysis performed could not be used to predict response.</p> <p>Methods</p> <p>Microarray data from PBMCs of 16 of these patients was analyzed to generate a treatment response classifier. We used a discriminant analysis method that performs sample classification from gene expression data, via "nearest shrunken centroid method". Centroids are the average gene expression for each gene in each class divided by the within-class standard deviation for that gene.</p> <p>Results</p> <p>A disease response classifier using 23 genes was created to accurately predict response to alefacept (12.3% error rate). While the genes in this classifier should be considered as a group, some of the individual genes are of great interest, for example, cAMP response element modulator (CREM), v-MAF avian musculoaponeurotic fibrosarcoma oncogene family (MAFF), chloride intracellular channel protein 1 (CLIC1, also called NCC27), NLR family, pyrin domain-containing 1 (NLRP1), and CCL5 (chemokine, cc motif, ligand 5, also called regulated upon activation, normally T expressed, and presumably secreted/RANTES).</p> <p>Conclusions</p> <p>Although this study is small, and based on analysis of existing microarray data, we demonstrate that a treatment response classifier for alefacept can be created using gene expression of PBMCs in psoriasis. This preliminary study may provide a useful tool to predict response of psoriatic patients to alefacept.</p

    Nearest Template Prediction: A Single-Sample-Based Flexible Class Prediction with Confidence Assessment

    Get PDF
    Gene-expression signature-based disease classification and clinical outcome prediction has not been widely introduced in clinical medicine as initially expected, mainly due to the lack of extensive validation needed for its clinical deployment. Obstacles include variable measurement in microarray assay, inconsistent assay platform, analytical requirement for comparable pair of training and test datasets, etc. Furthermore, as medical device helping clinical decision making, the prediction needs to be made for each single patient with a measure of its reliability. To address these issues, there is a need for flexible prediction method less sensitive to difference in experimental and analytical conditions, applicable to each single patient, and providing measure of prediction confidence. The nearest template prediction (NTP) method provides a convenient way to make class prediction with assessment of prediction confidence computed in each single patient's gene-expression data using only a list of signature genes and a test dataset. We demonstrate that the method can be flexibly applied to cross-platform, cross-species, and multiclass predictions without any optimization of analysis parameters

    Impact and Cost-Effectiveness of Culture for Diagnosis of Tuberculosis in HIV-Infected Brazilian Adults

    Get PDF
    Culture of Mycobacterium tuberculosis currently represents the closest "gold standard" for diagnosis of tuberculosis (TB), but operational data are scant on the impact and cost-effectiveness of TB culture for human immunodeficiency (HIV-) infected individuals in resource-limited settings.We recorded costs, laboratory results, and dates of initiating TB therapy in a centralized TB culture program for HIV-infected patients in Rio de Janeiro, Brazil, constructing a decision-analysis model to estimate the incremental cost-effectiveness of TB culture from the perspective of a public-sector TB control program. Of 217 TB suspects presenting between January 2006 and March 2008, 33 (15%) had culture-confirmed active tuberculosis; 23 (70%) were smear-negative. Among smear-negative, culture-positive patients, 6 (26%) began TB therapy before culture results were available, 11 (48%) began TB therapy after culture result availability, and 6 (26%) did not begin TB therapy within 180 days of presentation. The cost per negative culture was US17.52(solidmedia)17.52 (solid media)-23.50 (liquid media). Per 1,000 TB suspects and compared with smear alone, TB culture with solid media would avert an estimated eight TB deaths (95% simulation interval [SI]: 4, 15) and 37 disability-adjusted life years (DALYs) (95% SI: 13, 76), at a cost of 36(9536 (95% SI: 25, 50)perTBsuspector50) per TB suspect or 962 (95% SI: 469,469, 2642) per DALY averted. Replacing solid media with automated liquid culture would avert one further death (95% SI: -1, 4) and eight DALYs (95% SI: -4, 23) at 2751perDALY(952751 per DALY (95% SI: 680, dominated). The cost-effectiveness of TB culture was more sensitive to characteristics of the existing TB diagnostic system than to the accuracy or cost of TB culture.TB culture is potentially effective and cost-effective for HIV-positive patients in resource-constrained settings. Reliable transmission of culture results to patients and integration with existing systems are essential

    Deregulation upon DNA damage revealed by joint analysis of context-specific perturbation data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Deregulation between two different cell populations manifests itself in changing gene expression patterns and changing regulatory interactions. Accumulating knowledge about biological networks creates an opportunity to study these changes in their cellular context.</p> <p>Results</p> <p>We analyze re-wiring of regulatory networks based on cell population-specific perturbation data and knowledge about signaling pathways and their target genes. We quantify deregulation by merging regulatory signal from the two cell populations into one score. This joint approach, called JODA, proves advantageous over separate analysis of the cell populations and analysis without incorporation of knowledge. JODA is implemented and freely available in a Bioconductor package 'joda'.</p> <p>Conclusions</p> <p>Using JODA, we show wide-spread re-wiring of gene regulatory networks upon neocarzinostatin-induced DNA damage in Human cells. We recover 645 deregulated genes in thirteen functional clusters performing the rich program of response to damage. We find that the clusters contain many previously characterized neocarzinostatin target genes. We investigate connectivity between those genes, explaining their cooperation in performing the common functions. We review genes with the most extreme deregulation scores, reporting their involvement in response to DNA damage. Finally, we investigate the indirect impact of the ATM pathway on the deregulated genes, and build a hypothetical hierarchy of direct regulation. These results prove that JODA is a step forward to a systems level, mechanistic understanding of changes in gene regulation between different cell populations.</p

    Edwardsiella Comparative Phylogenomics Reveal the New Intra/Inter-Species Taxonomic Relationships, Virulence Evolution and Niche Adaptation Mechanisms

    Get PDF
    Edwardsiella bacteria are leading fish pathogens causing huge losses to aquaculture industries worldwide. E. tarda is a broad-host range pathogen that infects more than 20 species of fish and other animals including humans while E. ictaluri is host-adapted to channel catfish causing enteric septicemia of catfish (ESC). Thus, these two species consist of a useful comparative system for studying the intricacies of pathogen evolution. Here we present for the first time the phylogenomic comparisons of 8 genomes of E. tarda and E. ictaluri isolates. Genome-based phylogenetic analysis revealed that E. tarda could be separate into two kinds of genotypes (genotype I, EdwGI and genotype II, EdwGII) based on the sequence similarity. E. tarda strains of EdwGI were clustered together with the E. ictaluri lineage and showed low sequence conservation to E. tarda strains of EdwGII. Multilocus sequence analysis (MLSA) of 48 distinct Edwardsiella strains also supports the new taxonomic relationship of the lineages. We identified the type III and VI secretion systems (T3SS and T6SS) as well as iron scavenging related genes that fulfilled the criteria of a key evolutionary factor likely facilitating the virulence evolution and adaptation to a broad range of hosts in EdwGI E. tarda. The surface structure-related genes may underlie the adaptive evolution of E. ictaluri in the host specification processes. Virulence and competition assays of the null mutants of the representative genes experimentally confirmed their contributive roles in the evolution/niche adaptive processes. We also reconstructed the hypothetical evolutionary pathway to highlight the virulence evolution and niche adaptation mechanisms of Edwardsiella. This study may facilitate the development of diagnostics, vaccines, and therapeutics for this under-studied pathogen
    corecore