73,778 research outputs found

    Cached Sufficient Statistics for Efficient Machine Learning with Large Datasets

    Full text link
    This paper introduces new algorithms and data structures for quick counting for machine learning datasets. We focus on the counting task of constructing contingency tables, but our approach is also applicable to counting the number of records in a dataset that match conjunctive queries. Subject to certain assumptions, the costs of these operations can be shown to be independent of the number of records in the dataset and loglinear in the number of non-zero entries in the contingency table. We provide a very sparse data structure, the ADtree, to minimize memory use. We provide analytical worst-case bounds for this structure for several models of data distribution. We empirically demonstrate that tractably-sized data structures can be produced for large real-world datasets by (a) using a sparse tree structure that never allocates memory for counts of zero, (b) never allocating memory for counts that can be deduced from other counts, and (c) not bothering to expand the tree fully near its leaves. We show how the ADtree can be used to accelerate Bayes net structure finding algorithms, rule learning algorithms, and feature selection algorithms, and we provide a number of empirical results comparing ADtree methods against traditional direct counting approaches. We also discuss the possible uses of ADtrees in other machine learning methods, and discuss the merits of ADtrees in comparison with alternative representations such as kd-trees, R-trees and Frequent Sets.Comment: See http://www.jair.org/ for any accompanying file

    Recognising Desire: A psychosocial approach to understanding education policy implementation and effect

    Get PDF
    It is argued that in order to understand the ways in which teachers experience their work - including the idiosyncratic ways in which they respond to and implement mandated education policy - it is necessary to take account both of sociological and of psychological issues. The paper draws on original research with practising and beginning teachers, and on theories of social and psychic induction, to illustrate the potential benefits of this bipartisan approach for both teachers and researchers. Recognising the significance of (but somewhat arbitrary distinction between) structure and agency in teachers’ practical and ideological positionings, it is suggested that teachers’ responses to local and central policy changes are governed by a mix of pragmatism, social determinism and often hidden desires. It is the often underacknowledged strength of desire that may tip teachers into accepting and implementing policies with which they are not ideologically comfortable

    Generating entangled atom-photon pairs from Bose-Einstein condensates

    Get PDF
    We propose using spontaneous Raman scattering from an optically driven Bose-Einstein condensate as a source of atom-photon pairs whose internal states are maximally entangled. Generating entanglement between a particle which is easily transmitted (the photon) and one which is easily trapped and coherently manipulated (an ultracold atom) will prove useful for a variety of quantum-information related applications. We analyze the type of entangled states generated by spontaneous Raman scattering and construct a geometry which results in maximum entanglement

    Advanced cogeneration research study: Executive summary

    Get PDF
    This study provides a broad based overview of selected areas relevant to the development of a comprehensive Southern California Edison (SCE) advanced cogeneration project. The areas studied are: (1) Cogeneration potential in the SCE service territory; (2) Advanced cogeneration technologies; and (3) Existing cogeneration computer models. An estimated 3700 MW sub E could potentially be generated from existing industries in the Southern California Edison service territory using cogeneration technology. Of this total, current technology could provide 2600 MW sub E and advanced technology could provide 1100 MW sub E. The manufacturing sector (SIC Codes 20-39) was found to have the highest average potential for current cogeneration technology. The mining sector (SIC Codes 10-14) was found to have the highest potential for advanced technology

    Making postgraduate students and supervisors aware of the role of emotions in the PhD process

    Get PDF
    Emotions are an integral part of the PhD process. A range of emotions are common and to be expected. How do emotions affect the PhD process for both postgraduate students and their supervisors? How can we make our emotions work positively for us in the PhD process? To explore answers to these questions, three lecturers currently supervising postgraduates and three postgraduates at various stages in their doctoral studies collectively pooled their experiences. We developed an interactive workshop that was recently conducted for postgraduate students at Murdoch University and at the Australian Association for Social Research annual conference 2002. This presentation will explore the role that emotions play in the PhD process and how supervisors and postgraduates alike can benefit from reflecting on this issue. A number of practical (and humorous) tips will be provided as well as examples from others' PhD experiences. The role of emotions at the beginning, middle and end of a PhD program will be explored. The data collection and analysis phases are a time when emotions may run riot. Trepidation is especially common when fieldwork or data collection is involved, as is anger when postgraduate's views about how the world works are challenged and then sadness (and relief!) when the data collection phase is finished. We will discuss how supervisors can assist their postgraduates to make these feelings work for them. The presentation will also explore the emotions that arise from the supervisor-postgraduate partnership

    First principles theory of fluctuations in vortex liquids and solids

    Full text link
    Consistent perturbation theory for thermodynamical quantities in type II superconductors in magnetic field at low temperatures is developed. It is complementary to the existing expansion valid at high temperatures. Magnetization and specific heat are calculated to two loop order and compare well to existing Monte Carlo simulations and experiments.Comment: 3 .ps fig. In press Phys. Rev.

    Speech-plans: Generating evaluative responses in spoken dialogue

    Get PDF
    Recent work on evaluation of spoken dialogue systems indicates that better algorithms are needed for the presentation of complex information in speech. Current dialogue systems often rely on presenting sets of options and their attributes sequentially. This places a large memory burden on users, who have to remember complex trade-offs between multiple options and their attributes. To address these problems we build on previous work using multiattribute decision theory to devise speech-planning algorithms that present usertailored summaries, comparisons and recommendations that allow users to focus on critical differences between options and their attributes. We discuss the differences between speech and text planning that result from the particular demands of the speech situation.
    • 

    corecore