103 research outputs found

    The EM Algorithm and the Rise of Computational Biology

    Get PDF
    In the past decade computational biology has grown from a cottage industry with a handful of researchers to an attractive interdisciplinary field, catching the attention and imagination of many quantitatively-minded scientists. Of interest to us is the key role played by the EM algorithm during this transformation. We survey the use of the EM algorithm in a few important computational biology problems surrounding the "central dogma"; of molecular biology: from DNA to RNA and then to proteins. Topics of this article include sequence motif discovery, protein sequence alignment, population genetics, evolutionary models and mRNA expression microarray data analysis.Comment: Published in at http://dx.doi.org/10.1214/09-STS312 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Precision Counting of Small Black Holes

    Get PDF
    It has recently been proposed that a class of supersymmetric higher-derivative interactions in N=2 supergravity may encapsulate an infinite number of finite size corrections to the microscopic entropy of certain supersymmetric black holes. If this proposal is correct, it allows one to probe the string theory description of black-hole micro-states to far greater accuracy than has been possible before. We test this proposal for ``small'' black holes whose microscopic degeneracies can be computed exactly by counting the corresponding perturbative BPS states. We also study the ``black hole partition sum'' using general properties of of BPS degeneracies. This complements and extends our earlier work in hep-th/0502157Comment: 103 pages, uses JHEP3.cl

    Perspectives on the CAP Theorem

    Get PDF
    Almost twelve years ago, in 2000, Eric Brewer introduced the idea that there is a fundamental trade-off between consistency, availability, and partition tolerance. This trade-off, which has become known as the CAP Theorem, has been widely discussed ever since. In this paper, we review the CAP Theorem and situate it within the broader context of distributed computing theory. We then discuss the practical implications of the CAP Theorem, and explore some general techniques for coping with the inherent trade-offs that it implies

    A survey of statistical network models

    Full text link
    Networks are ubiquitous in science and have become a focal point for discussion in everyday life. Formal statistical models for the analysis of network data have emerged as a major topic of interest in diverse areas of study, and most of these involve a form of graphical representation. Probability models on graphs date back to 1959. Along with empirical studies in social psychology and sociology from the 1960s, these early works generated an active network community and a substantial literature in the 1970s. This effort moved into the statistical literature in the late 1970s and 1980s, and the past decade has seen a burgeoning network literature in statistical physics and computer science. The growth of the World Wide Web and the emergence of online networking communities such as Facebook, MySpace, and LinkedIn, and a host of more specialized professional network communities has intensified interest in the study of networks and network data. Our goal in this review is to provide the reader with an entry point to this burgeoning literature. We begin with an overview of the historical development of statistical network modeling and then we introduce a number of examples that have been studied in the network literature. Our subsequent discussion focuses on a number of prominent static and dynamic network models and their interconnections. We emphasize formal model descriptions, and pay special attention to the interpretation of parameters and their estimation. We end with a description of some open problems and challenges for machine learning and statistics.Comment: 96 pages, 14 figures, 333 reference

    Multilingual Part-of-Speech Tagging: Two Unsupervised Approaches

    Full text link
    We demonstrate the effectiveness of multilingual learning for unsupervised part-of-speech tagging. The central assumption of our work is that by combining cues from multiple languages, the structure of each becomes more apparent. We consider two ways of applying this intuition to the problem of unsupervised part-of-speech tagging: a model that directly merges tag structures for a pair of languages into a single sequence and a second model which instead incorporates multilingual context using latent variables. Both approaches are formulated as hierarchical Bayesian models, using Markov Chain Monte Carlo sampling techniques for inference. Our results demonstrate that by incorporating multilingual evidence we can achieve impressive performance gains across a range of scenarios. We also found that performance improves steadily as the number of available languages increases

    Automatic Reconfiguration for Large-Scale Reliable Storage Systems

    Get PDF
    Byzantine-fault-tolerant replication enhances the availability and reliability of Internet services that store critical state and preserve it despite attacks or software errors. However, existing Byzantine-fault-tolerant storage systems either assume a static set of replicas, or have limitations in how they handle reconfigurations (e.g., in terms of the scalability of the solutions or the consistency levels they provide). This can be problematic in long-lived, large-scale systems where system membership is likely to change during the system lifetime. In this paper, we present a complete solution for dynamically changing system membership in a large-scale Byzantine-fault-tolerant system. We present a service that tracks system membership and periodically notifies other system nodes of membership changes. The membership service runs mostly automatically, to avoid human configuration errors; is itself Byzantine-fault-tolerant and reconfigurable; and provides applications with a sequence of consistent views of the system membership. We demonstrate the utility of this membership service by using it in a novel distributed hash table called dBQS that provides atomic semantics even across changes in replica sets. dBQS is interesting in its own right because its storage algorithms extend existing Byzantine quorum protocols to handle changes in the replica set, and because it differs from previous DHTs by providing Byzantine fault tolerance and offering strong semantics. We implemented the membership service and dBQS. Our results show that the approach works well, in practice: the membership service is able to manage a large system and the cost to change the system membership is low

    A Uniform Min-Max Theorem and Characterizations of Computational Randomness

    Get PDF
    This thesis develops several tools and techniques using ideas from information theory, optimization, and online learning, and applies them to a number of highly related fundamental problems in complexity theory, pseudorandomness theory, and cryptography.Engineering and Applied Science

    Effects of Intravenous Ketamine on Explicit and Implicit Measures of Suicidality in Treatment-Resistant Depression

    Get PDF
    Background Intravenous ketamine has shown rapid antidepressant effects in early trials, making it a potentially attractive candidate for depressed patients at imminent risk of suicide. The Implicit Association Test (IAT), a performance-based measure of association between concepts, may have utility in suicide assessment. Methods Twenty-six patients with treatment-resistant depression were assessed using the suicidality item of the Montgomery-Asberg Depression Rating Scale (MADRS-SI) 2 hours before and 24 hours following a single subanesthetic dose of intravenous ketamine. Ten patients also completed IATs assessing implicit suicidal associations at comparable time points. In a second study, nine patients received thrice-weekly ketamine infusions over a 12-day period. Results Twenty-four hours after a single infusion, MADRS-SI scores were reduced on average by 2.08 points on a 0 to 6 scale (p < .001; d = 1.37), and 81% of patients received a rating of 0 or 1 postinfusion. Implicit suicidal associations were also reduced following ketamine (p = .003; d = 1.36), with reductions correlated across implicit and explicit measures. MADRS-SI reductions were sustained for 12 days by repeated-dose ketamine (p < .001; d = 2.42). Conclusions These preliminary findings support the premise that ketamine has rapid beneficial effects on suicidal cognition and warrants further study.Psycholog
    • …
    corecore