280,326 research outputs found

    Towards a Learning Theory of Cause-Effect Inference

    Full text link
    We pose causal inference as the problem of learning to classify probability distributions. In particular, we assume access to a collection {(Si,li)}i=1n\{(S_i,l_i)\}_{i=1}^n, where each SiS_i is a sample drawn from the probability distribution of Xi×YiX_i \times Y_i, and lil_i is a binary label indicating whether "XiYiX_i \to Y_i" or "XiYiX_i \leftarrow Y_i". Given these data, we build a causal inference rule in two steps. First, we featurize each SiS_i using the kernel mean embedding associated with some characteristic kernel. Second, we train a binary classifier on such embeddings to distinguish between causal directions. We present generalization bounds showing the statistical consistency and learning rates of the proposed approach, and provide a simple implementation that achieves state-of-the-art cause-effect inference. Furthermore, we extend our ideas to infer causal relationships between more than two variables

    Rgtsp: a generalized top scoring pairs package for class prediction

    Get PDF
    Summary: A top scoring pair (TSP) classifier consists of a pair of variables whose relative ordering can be used for accurately predicting the class label of a sample. This classification rule has the advantage of being easily interpretable and more robust against technical variations in data, as those due to different microarray platforms. Here we describe a parallel implementation of this classifier which significantly reduces the training time, and a number of extensions, including a multi-class approach, which has the potential of improving the classification performance. Availability and Implementation: Full C++ source code and R package Rgtsp are freely available from http://lausanne.isb-sib.ch/~vpopovic/research/. The implementation relies on existing OpenMP libraries. Contact: [email protected]

    Efficient Contextual Measures for Classification of Multispectral Image Data

    Get PDF
    The most common method for labeling multispectral image data classifies each pixel entirely on the basis of its own spectral signature. Such a method neither utilizes contextual information in the image nor does it incorporate secondary information related to the scene. This exclusion is generally due to the poor cost/performance efficiency of most contextual algorithms and a lack of knowledge concerning how to relate variables from different sources. In this research, several efficient spatial context measures are developed from different structural models for four-nearest-neighbor neighborhoods. Most of these measures rely on simple manipulations of label probabilities generated by a noncontextual classifier. They are efficient computationally and are effective in improving classification accuracy over the noncontextual result. Among other schemata, the measures include: average label probabilities in a neighborhood; label probabilities; combined as a function of a metric in the label probability space; and context through semantic constraints within a Bayesian framework. In addition, an efficient implementation of a contextual classifier based on compound decision theory is developed through a simplification of the structure of the contextual prior probability^ No accuracy is lost through the simplification, but computational speed is increased 15-fold. Finally, a procedure to combine label probabilities from independent data sources is proposed. A mechanism for combining the label probabilities from each of the sources as a function of their independent classification accuracies is created and evaluated

    Význam používání údajů o nutriční hodnotě na obalu výrobků k dosažení zdravějších stravovacích návyků ve Španělsku

    Get PDF
    This study aims at testing a theoretical model explaining why people follow healthy eating habits and in particular to identify how the nutritional labels use influences this behavioural pattern. The results indicate that the individuals who utilise more often the nutritional labels follow healthier eating habits, such as avoiding snacking between meals, a lower intake of salt and avoiding the fat intake. According the factors explaining the label use, the health knowledge, the bad health status and being aware of the diet-health relation are found significant. Age and household size are the sociodemographic variables which also affect the nutritional label use and eating habits. Findings provide more evidence on the consumers’ underlying motivations to pay attention to nutritional labelling, which allows evaluating the impact of the implementation of the regulation Ec 1924/2006 of the European Parliament and the council of 20 December 2006 on nutritional and health claims made on foods (regulation Ec 1924/2006). in addition, empirical results could help the local policy makers to establish appropriate market strategies to increase healthy eating habits by promoting the nutritional label use by consumers

    The Hep-CORE policy score: A European hepatitis C national policy implementation ranking based on patient organization data.

    Get PDF
    BACKGROUND content: New hepatitis C virus (HCV) treatments spurred the World Health Organization (WHO) in 2016 to adopt a strategy to eliminate HCV as a public health threat by 2030. To achieve this, key policies must be implemented. In the absence of monitoring mechanisms, this study aims to assess the extent of policy implementation from the perspective of liver patient groups. - Label: METHODS content: "Thirty liver patient organisations, each representing a country, were surveyed in October 2018 to assess implementation of HCV policies in practice. Respondents received two sets of questions based on: 1) WHO recommendations; and 2) validated data sources verifying an existing policy in their country. Academic experts selected key variables from each set for inclusion into policy scores. The similarity scores were calculated for each set with a multiple joint correspondence analysis. Proxy reference countries were included as the baseline to contextualize results. We extracted scores for each country and standardized them from 0 to 10 (best)." - Label: RESULTS content: Twenty-five countries responded. For the score based on WHO recommendations, Bulgaria had the lowest score whereas five countries (Cyprus, Netherlands, Portugal, Slovenia, and Sweden) had the highest scores. For the verified policy score, a two-dimensional solution was identified; first dimension scores pertained to whether verified policies were in place and second dimension scores pertained to the proportion of verified policies in-place that were implemented. Spain, UK, and Sweden had high scores for both dimensions. - Label: CONCLUSIONS content: Patient groups reported that the European region is not on track to meet WHO 2030 HCV goals. More action should be taken to implement and monitor HCV policies

    On the Implementation of GNU Prolog

    Get PDF
    GNU Prolog is a general-purpose implementation of the Prolog language, which distinguishes itself from most other systems by being, above all else, a native-code compiler which produces standalone executables which don't rely on any byte-code emulator or meta-interpreter. Other aspects which stand out include the explicit organization of the Prolog system as a multipass compiler, where intermediate representations are materialized, in Unix compiler tradition. GNU Prolog also includes an extensible and high-performance finite domain constraint solver, integrated with the Prolog language but implemented using independent lower-level mechanisms. This article discusses the main issues involved in designing and implementing GNU Prolog: requirements, system organization, performance and portability issues as well as its position with respect to other Prolog system implementations and the ISO standardization initiative.Comment: 30 pages, 3 figures, To appear in Theory and Practice of Logic Programming (TPLP); Keywords: Prolog, logic programming system, GNU, ISO, WAM, native code compilation, Finite Domain constraint

    Bounded Concurrent Timestamp Systems Using Vector Clocks

    Full text link
    Shared registers are basic objects used as communication mediums in asynchronous concurrent computation. A concurrent timestamp system is a higher typed communication object, and has been shown to be a powerful tool to solve many concurrency control problems. It has turned out to be possible to construct such higher typed objects from primitive lower typed ones. The next step is to find efficient constructions. We propose a very efficient wait-free construction of bounded concurrent timestamp systems from 1-writer multireader registers. This finalizes, corrects, and extends, a preliminary bounded multiwriter construction proposed by the second author in 1986. That work partially initiated the current interest in wait-free concurrent objects, and introduced a notion of discrete vector clocks in distributed algorithms.Comment: LaTeX source, 35 pages; To apper in: J. Assoc. Comp. Mac

    SafeWeb: A Middleware for Securing Ruby-Based Web Applications

    Get PDF
    Web applications in many domains such as healthcare and finance must process sensitive data, while complying with legal policies regarding the release of different classes of data to different parties. Currently, software bugs may lead to irreversible disclosure of confidential data in multi-tier web applications. An open challenge is how developers can guarantee these web applications only ever release sensitive data to authorised users without costly, recurring security audits. Our solution is to provide a trusted middleware that acts as a “safety net” to event-based enterprise web applications by preventing harmful data disclosure before it happens. We describe the design and implementation of SafeWeb, a Ruby-based middleware that associates data with security labels and transparently tracks their propagation at different granularities across a multi-tier web architecture with storage and complex event processing. For efficiency, maintainability and ease-of-use, SafeWeb exploits the dynamic features of the Ruby programming language to achieve label propagation and data flow enforcement. We evaluate SafeWeb by reporting our experience of implementing a web-based cancer treatment application and deploying it as part of the UK National Health Service (NHS)
    corecore