115 research outputs found

    Classifying and scoring of molecules with the NGN: new datasets, significance tests, and generalization

    Get PDF
    <p>Abstract</p> <p/> <p>This paper demonstrates how a Neural Grammar Network learns to classify and score molecules for a variety of tasks in chemistry and toxicology. In addition to a more detailed analysis on datasets previously studied, we introduce three new datasets (BBB, FXa, and toxicology) to show the generality of the approach. A new experimental methodology is developed and applied to both the new datasets as well as previously studied datasets. This methodology is rigorous and statistically grounded, and ultimately culminates in a Wilcoxon significance test that proves the effectiveness of the system. We further include a complete generalization of the specific technique to arbitrary grammars and datasets using a mathematical abstraction that allows researchers in different domains to apply the method to their own work.</p> <p>Background</p> <p>Our work can be viewed as an alternative to existing methods to solve the quantitative structure-activity relationship (QSAR) problem. To this end, we review a number approaches both from a methodological and also a performance perspective. In addition to these approaches, we also examined a number of chemical properties that can be used by generic classifier systems, such as feed-forward artificial neural networks. In studying these approaches, we identified a set of interesting benchmark problem sets to which many of the above approaches had been applied. These included: ACE, AChE, AR, BBB, BZR, Cox2, DHFR, ER, FXa, GPB, Therm, and Thr. Finally, we developed our own benchmark set by collecting data on toxicology.</p> <p>Results</p> <p>Our results show that our system performs better than, or comparatively to, the existing methods over a broad range of problem types. Our method does not require the expert knowledge that is necessary to apply the other methods to novel problems.</p> <p>Conclusions</p> <p>We conclude that our success is due to the ability of our system to: 1) encode molecules losslessly before presentation to the learning system, and 2) leverage the design of molecular description languages to facilitate the identification of relevant structural attributes of the molecules over different problem domains.</p

    Social Class

    Get PDF
    Discussion of class structure in fifth-century Athens, historical constitution of theater audiences, and the changes in the comic representation of class antagonism from Aristophanes to Menander

    A reference panel of 64,976 haplotypes for genotype imputation.

    Get PDF
    We describe a reference panel of 64,976 human haplotypes at 39,235,157 SNPs constructed using whole-genome sequence data from 20 studies of predominantly European ancestry. Using this resource leads to accurate genotype imputation at minor allele frequencies as low as 0.1% and a large increase in the number of SNPs tested in association studies, and it can help to discover and refine causal loci. We describe remote server resources that allow researchers to carry out imputation and phasing consistently and efficiently

    A large genome-wide association study of age-related macular degeneration highlights contributions of rare and common variants.

    Get PDF
    This is the author accepted manuscript. The final version is available from Nature Publishing Group via http://dx.doi.org/10.1038/ng.3448Advanced age-related macular degeneration (AMD) is the leading cause of blindness in the elderly, with limited therapeutic options. Here we report on a study of >12 million variants, including 163,714 directly genotyped, mostly rare, protein-altering variants. Analyzing 16,144 patients and 17,832 controls, we identify 52 independently associated common and rare variants (P < 5 × 10(-8)) distributed across 34 loci. Although wet and dry AMD subtypes exhibit predominantly shared genetics, we identify the first genetic association signal specific to wet AMD, near MMP9 (difference P value = 4.1 × 10(-10)). Very rare coding variants (frequency <0.1%) in CFH, CFI and TIMP3 suggest causal roles for these genes, as does a splice variant in SLC16A8. Our results support the hypothesis that rare coding variants can pinpoint causal genes within known genetic loci and illustrate that applying the approach systematically to detect new loci requires extremely large sample sizes.We thank all participants of all the studies included for enabling this research by their participation in these studies. Computer resources for this project have been provided by the high-performance computing centers of the University of Michigan and the University of Regensburg. Group-specific acknowledgments can be found in the Supplementary Note. The Center for Inherited Diseases Research (CIDR) Program contract number is HHSN268201200008I. This and the main consortium work were predominantly funded by 1X01HG006934-01 to G.R.A. and R01 EY022310 to J.L.H

    Expression of Tas1 Taste Receptors in Mammalian Spermatozoa: Functional Role of Tas1r1 in Regulating Basal Ca2+ and cAMP Concentrations in Spermatozoa

    Get PDF
    Background: During their transit through the female genital tract, sperm have to recognize and discriminate numerous chemical compounds. However, our current knowledge of the molecular identity of appropriate chemosensory receptor proteins in sperm is still rudimentary. Considering that members of the Tas1r family of taste receptors are able to discriminate between a broad diversity of hydrophilic chemosensory substances, the expression of taste receptors in mammalian spermatozoa was examined. Methodology/Principal Findings: The present manuscript documents that Tas1r1 and Tas1r3, which form the functional receptor for monosodium glutamate (umami) in taste buds on the tongue, are expressed in murine and human spermatozoa, where their localization is restricted to distinct segments of the flagellum and the acrosomal cap of the sperm head. Employing a Tas1r1-deficient mCherry reporter mouse strain, we found that Tas1r1 gene deletion resulted in spermatogenic abnormalities. In addition, a significant increase in spontaneous acrosomal reaction was observed in Tas1r1 null mutant sperm whereas acrosomal secretion triggered by isolated zona pellucida or the Ca2+ ionophore A23187 was not different from wild-type spermatozoa. Remarkably, cytosolic Ca2+ levels in freshly isolated Tas1r1-deficient sperm were significantly higher compared to wild-type cells. Moreover, a significantly higher basal cAMP concentration was detected in freshly isolated Tas1r1-deficient epididymal spermatozoa, whereas upon inhibition of phosphodiesterase or sperm capacitation, the amount of cAMP was not different between both genotypes. Conclusions/Significance: Since Ca2+ and cAMP control fundamental processes during the sequential process of fertilization, we propose that the identified taste receptors and coupled signaling cascades keep sperm in a chronically quiescent state until they arrive in the vicinity of the egg - either by constitutive receptor activity and/or by tonic receptor activation by gradients of diverse chemical compounds in different compartments of the female reproductive tract

    The language(s) of comedy

    Get PDF

    exomeSuite: Whole exome sequence variant filtering tool for rapid identification of putative disease causing SNVs/indels

    Get PDF
    Exome and whole-genome analyses powered by next-generation sequencing (NGS) have become invaluable tools in identifying causal mutations responsible for Mendelian disorders. Given that individual exomes contain several thousand single nucleotide variants and insertions/deletions, it remains a challenge to analyze large numbers of variants from multiple exomes to identify causal alleles associated with inherited conditions. To this end, we have developed user-friendly software that analyzes variant calls from multiple individuals to facilitate identification of causal mutations. The software, termed exomeSuite, filters for putative causative variants of monogenic diseases inherited in one of three forms: dominant, recessive caused by a homozygous variant, or recessive caused by two compound heterozygous variants. In addition, exomeSuite can perform homozygosity mapping and analyze the variant data of multiple unrelated individuals. Here we demonstrate that filtering of variants with exomeSuite reduces datasets to a fraction of a percent of their original size. To the best of our knowledge this is the first freely available software developed to analyze variant data from multiple individuals that rapidly assimilates and filters large data sets based on pattern of inheritance
    corecore