26 research outputs found

    Lung eQTLs to Help Reveal the Molecular Underpinnings of Asthma

    Get PDF
    Genome-wide association studies (GWAS) have identified loci reproducibly associated with pulmonary diseases; however, the molecular mechanism underlying these associations are largely unknown. The objectives of this study were to discover genetic variants affecting gene expression in human lung tissue, to refine susceptibility loci for asthma identified in GWAS studies, and to use the genetics of gene expression and network analyses to find key molecular drivers of asthma. We performed a genome-wide search for expression quantitative trait loci (eQTL) in 1,111 human lung samples. The lung eQTL dataset was then used to inform asthma genetic studies reported in the literature. The top ranked lung eQTLs were integrated with the GWAS on asthma reported by the GABRIEL consortium to generate a Bayesian gene expression network for discovery of novel molecular pathways underpinning asthma. We detected 17,178 cis- and 593 trans- lung eQTLs, which can be used to explore the functional consequences of loci associated with lung diseases and traits. Some strong eQTLs are also asthma susceptibility loci. For example, rs3859192 on chr17q21 is robustly associated with the mRNA levels of GSDMA (P = 3.55 × 10(-151)). The genetic-gene expression network identified the SOCS3 pathway as one of the key drivers of asthma. The eQTLs and gene networks identified in this study are powerful tools for elucidating the causal mechanisms underlying pulmonary disease. This data resource offers much-needed support to pinpoint the causal genes and characterize the molecular function of gene variants associated with lung diseases

    Using random forest and decision tree models for a new vehicle prediction approach in computational toxicology

    Get PDF
    yesDrug vehicles are chemical carriers that provide beneficial aid to the drugs they bear. Taking advantage of their favourable properties can potentially allow the safer use of drugs that are considered highly toxic. A means for vehicle selection without experimental trial would therefore be of benefit in saving time and money for the industry. Although machine learning is increasingly used in predictive toxicology, to our knowledge there is no reported work in using machine learning techniques to model drug-vehicle relationships for vehicle selection to minimise toxicity. In this paper we demonstrate the use of data mining and machine learning techniques to process, extract and build models based on classifiers (decision trees and random forests) that allow us to predict which vehicle would be most suited to reduce a drug’s toxicity. Using data acquired from the National Institute of Health’s (NIH) Developmental Therapeutics Program (DTP) we propose a methodology using an area under a curve (AUC) approach that allows us to distinguish which vehicle provides the best toxicity profile for a drug and build classification models based on this knowledge. Our results show that we can achieve prediction accuracies of 80 % using random forest models whilst the decision tree models produce accuracies in the 70 % region. We consider our methodology widely applicable within the scientific domain and beyond for comprehensively building classification models for the comparison of functional relationships between two variables

    AMI observations of northern supernova remnants at 14-18 GHz

    Full text link
    We present observations between 14.2 and 17.9 GHz of 12 reported supernova remnants (SNRs) made with the Arcminute Microkelvin Imager Small Array (AMI SA). In conjunction with data from the literature at lower radio frequencies, we determine spectra of these objects. For well-studied SNRs (Cas A, Tycho's SNR, 3C58 and the Crab Nebula), the results are in good agreement with spectra based on previous results. For the less well-studied remnants the AMI SA observations provide higher-frequency radio observations than previously available, and better constrain their radio spectra. The AMI SA results confirm a spectral turnover at ~11 GHz for the filled-centre remnant G74.9+1.2. We also see a possible steepening of the spectrum of the filled-centre remnant G54.1+0.3 within the AMI SA frequency band compared with lower frequencies. We confirm that G84.9+0.5, which had previously been identified as a SNR, is rather an HII region and has a flat radio spectrum.Comment: 12 pages, 24 figures, accepted MNRA

    Dissection of a QTL Hotspot on Mouse Distal Chromosome 1 that Modulates Neurobehavioral Phenotypes and Gene Expression

    Get PDF
    A remarkably diverse set of traits maps to a region on mouse distal chromosome 1 (Chr 1) that corresponds to human Chr 1q21–q23. This region is highly enriched in quantitative trait loci (QTLs) that control neural and behavioral phenotypes, including motor behavior, escape latency, emotionality, seizure susceptibility (Szs1), and responses to ethanol, caffeine, pentobarbital, and haloperidol. This region also controls the expression of a remarkably large number of genes, including genes that are associated with some of the classical traits that map to distal Chr 1 (e.g., seizure susceptibility). Here, we ask whether this QTL-rich region on Chr 1 (Qrr1) consists of a single master locus or a mixture of linked, but functionally unrelated, QTLs. To answer this question and to evaluate candidate genes, we generated and analyzed several gene expression, haplotype, and sequence datasets. We exploited six complementary mouse crosses, and combed through 18 expression datasets to determine class membership of genes modulated by Qrr1. Qrr1 can be broadly divided into a proximal part (Qrr1p) and a distal part (Qrr1d), each associated with the expression of distinct subsets of genes. Qrr1d controls RNA metabolism and protein synthesis, including the expression of ∼20 aminoacyl-tRNA synthetases. Qrr1d contains a tRNA cluster, and this is a functionally pertinent candidate for the tRNA synthetases. Rgs7 and Fmn2 are other strong candidates in Qrr1d. FMN2 protein has pronounced expression in neurons, including in the dendrites, and deletion of Fmn2 had a strong effect on the expression of few genes modulated by Qrr1d. Our analysis revealed a highly complex gene expression regulatory interval in Qrr1, composed of multiple loci modulating the expression of functionally cognate sets of genes

    Prediction of the effect of formulation on the toxicity of chemicals

    Get PDF
    Two approaches for the prediction of which of two vehicles will result in lower toxicity for anticancer agents are presented. Machine-learning models are developed using decision tree, random forest and partial least squares methodologies and statistical evidence is presented to demonstrate that they represent valid models. Separately, a clustering method is presented that allows the ordering of vehicles by the toxicity they show for chemically-related compounds

    Emerging Pattern Mining To Aid Toxicological Knowledge Discovery

    No full text
    Knowledge-based systems for toxicity prediction are typically based on rules, known as structural alerts, that describe relationships between structural features and different toxic effects. The identification of structural features associated with toxicological activity can be a time-consuming process and often requires significant input from domain experts. Here, we describe an emerging pattern mining method for the automated identification of activating structural features in toxicity data sets that is designed to help expedite the process of alert development. We apply the contrast pattern tree mining algorithm to generate a set of emerging patterns of structural fragment descriptors. Using the emerging patterns it is possible to form hierarchical clusters of compounds that are defined by the presence of common structural features and represent distinct chemical classes. The method has been tested on a large public <i>in vitro</i> mutagenicity data set and a public hERG channel inhibition data set and is shown to be effective at identifying common toxic features and recognizable classes of toxicants. We also describe how knowledge developers can use emerging patterns to improve the specificity and sensitivity of an existing expert system
    corecore