1,216 research outputs found

    Preference Learning

    Get PDF
    This report documents the program and the outcomes of Dagstuhl Seminar 14101 “Preference Learning”. Preferences have recently received considerable attention in disciplines such as machine learning, knowledge discovery, information retrieval, statistics, social choice theory, multiple criteria decision making, decision under risk and uncertainty, operations research, and others. The motivation for this seminar was to showcase recent progress in these different areas with the goal of working towards a common basis of understanding, which should help to facilitate future synergies

    Income Distribution and Poverty in the Republic of Haiti

    Get PDF
    After decades of stagnation and economic decline coupled with political upheavals, the Republic of Haiti is today the poorest nation in the Western Hemisphere and one of the poorest in the world. The present research reveals that this country is also where income is worst distributed in the most unequal region of the world, viz., Latin America and the Carribbean. We use the 2001 Haiti Living Conditions Survey for distributive analysis and poverty assessment to try to make manifest the potential links between household well-being and individual socio-economic characteristics. One particular finding is that access to land does not help the poor escape poverty. Complementary to the inequality and poverty profiles constructed herein, a relatively new methodology using weighted least squares for complex survey is adopted to additively decompose inequality by multiple factor components. Also, we estimate a polychotomous ordered logic to investigate the risk of being indigent or poor.Republic of Haiti, inequality, multiple factor components decomposition, poverty, stochastic dominance

    Pseudo-Marginal Bayesian Inference for Gaussian Processes

    Get PDF
    The main challenges that arise when adopting Gaussian Process priors in probabilistic modeling are how to carry out exact Bayesian inference and how to account for uncertainty on model parameters when making model-based predictions on out-of-sample data. Using probit regression as an illustrative working example, this paper presents a general and effective methodology based on the pseudo-marginal approach to Markov chain Monte Carlo that efficiently addresses both of these issues. The results presented in this paper show improvements over existing sampling methods to simulate from the posterior distribution over the parameters defining the covariance function of the Gaussian Process prior. This is particularly important as it offers a powerful tool to carry out full Bayesian inference of Gaussian Process based hierarchic statistical models in general. The results also demonstrate that Monte Carlo based integration of all model parameters is actually feasible in this class of models providing a superior quantification of uncertainty in predictions. Extensive comparisons with respect to state-of-the-art probabilistic classifiers confirm this assertion.Comment: 14 pages double colum

    An easy-to-hard learning paradigm for multiple classes and multiple labels

    Full text link
    © 2017 Weiwei Liu, Ivor W. Tsang and Klaus-Robert Müller. Many applications, such as human action recognition and object detection, can be formulated as a multiclass classification problem. One-vs-rest (OVR) is one of the most widely used approaches for multiclass classification due to its simplicity and excellent performance. However, many confusing classes in such applications will degrade its results. For example, hand clap and boxing are two confusing actions. Hand clap is easily misclassified as boxing, and vice versa. Therefore, precisely classifying confusing classes remains a challenging task. To obtain better performance for multiclass classifications that have confusing classes, we first develop a classifier chain model for multiclass classification (CCMC) to transfer class information between classifiers. Then, based on an analysis of our proposed model, we propose an easy-to-hard learning paradigm for multiclass classification to automatically identify easy and hard classes and then use the predictions from simpler classes to help solve harder classes. Similar to CCMC, the classifier chain (CC) model is also proposed by Read et al. (2009) to capture the label dependency for multi-label classification. However, CC does not consider the order of difficulty of the labels and achieves degenerated performance when there are many confusing labels. Therefore, it is non-trivial to learn the appropriate label order for CC. Motivated by our analysis for CCMC, we also propose the easy-to-hard learning paradigm for multi-label classi cation to automatically identify easy and hard labels, and then use the predictions from simpler labels to help solve harder labels. We also demonstrate that our proposed strategy can be successfully applied to a wide range of applications, such as ordinal classi cation and relationship prediction. Extensive empirical studies validate our analysis and the e-ectiveness of our proposed easy-to-hard learning strategies

    Implementation and reporting of causal mediation analysis in 2015: a systematic review in epidemiological studies

    Get PDF
    BACKGROUND: Causal mediation analysis is often used to understand the impact of variables along the causal pathway of an occurrence relation. How well studies apply and report the elements of causal mediation analysis remains unknown. METHODS: We systematically reviewed epidemiological studies published in 2015 that employed causal mediation analysis to estimate direct and indirect effects of observed associations between an exposure on an outcome. We identified potential epidemiological studies through conducting a citation search within Web of Science and a keyword search within PubMed. Two reviewers independently screened studies for eligibility. For eligible studies, one reviewer performed data extraction, and a senior epidemiologist confirmed the extracted information. Empirical application and methodological details of the technique were extracted and summarized. RESULTS: Thirteen studies were eligible for data extraction. While the majority of studies reported and identified the effects of measures, most studies lacked sufficient details on the extent to which identifiability assumptions were satisfied. Although most studies addressed issues of unmeasured confounders either from empirical approaches or sensitivity analyses, the majority did not examine the potential bias arising from the measurement error of the mediator. Some studies allowed for exposure-mediator interaction and only a few presented results from models both with and without interactions. Power calculations were scarce. CONCLUSIONS: Reporting of causal mediation analysis is varied and suboptimal. Given that the application of causal mediation analysis will likely continue to increase, developing standards of reporting of causal mediation analysis in epidemiological research would be prudent

    Risk Aversion in International Relations Theory

    Get PDF
    |When international relations theorists use the concept of risk aversion, they usually cite the economics conception involving concave utility functions. However, concavity is meaningful only when the goal is measurable on an interval scale. International decisions are usually not of this type, so that many statements appearing in the literature are formally meaningless. Applications of prospect theory face this difficulty especially, as risk aversion and acceptance are at their center. This paper gives two definitions of risk attitude that do not require an interval scale. The second and more distinctive one uses the property of submodularity in place of concavity. R. D. Luce has devised a theory of choice with features of prospect theory but not requiring on an interval scale, and the second definition in combination with this theory yields the traditional claim that decision makers are risk-averse for gains and risk-seeking for losses.risk aversion, prospect theory, international relations, joint receipts, measurement theory.

    Multilayer Networks

    Full text link
    In most natural and engineered systems, a set of entities interact with each other in complicated patterns that can encompass multiple types of relationships, change in time, and include other types of complications. Such systems include multiple subsystems and layers of connectivity, and it is important to take such "multilayer" features into account to try to improve our understanding of complex systems. Consequently, it is necessary to generalize "traditional" network theory by developing (and validating) a framework and associated tools to study multilayer systems in a comprehensive fashion. The origins of such efforts date back several decades and arose in multiple disciplines, and now the study of multilayer networks has become one of the most important directions in network science. In this paper, we discuss the history of multilayer networks (and related concepts) and review the exploding body of work on such networks. To unify the disparate terminology in the large body of recent work, we discuss a general framework for multilayer networks, construct a dictionary of terminology to relate the numerous existing concepts to each other, and provide a thorough discussion that compares, contrasts, and translates between related notions such as multilayer networks, multiplex networks, interdependent networks, networks of networks, and many others. We also survey and discuss existing data sets that can be represented as multilayer networks. We review attempts to generalize single-layer-network diagnostics to multilayer networks. We also discuss the rapidly expanding research on multilayer-network models and notions like community structure, connected components, tensor decompositions, and various types of dynamical processes on multilayer networks. We conclude with a summary and an outlook.Comment: Working paper; 59 pages, 8 figure

    Measuring Poverty as a Fuzzy and Multidimensional Concept: Theory and Evidence from the United Kingdom

    Get PDF
    Previous research shows that poor people define poverty not only in material terms, but also in psychological and social terms, though it has been consistently characterized by economic resources in social sciences. Using a method based on `fuzzy-set' theory can be uniquely placed to answer the question as it allows us not only to tackle the problem of arbitrary poverty line, but also to integrate multiple dimensions into one index in an intuitive way. It can avoid the problem of poverty line entirely by introducing the concept of `membership function' which represents a degree of inclusion in a fuzzy subgroup poor. I therefore argue that the fuzzy measures of poverty can be a strong multidimensional alternative for the measures centered around income. To support the argument, two crucial points are clarfied. Firstly, the difference between traditional measures and the fuzzy measures needs to be discussed further since the discussions on the new measures so far lean more toward the fresh insights from the measures, so that the distinction in policy-relevant information has not been emphasized enough. From the comparison, I present that the fuzzy measures can provide a richer description of the social phenomenon, enabling a more acceptable distinction between different sub-populations. Secondly, how the measures behave statistically should be considered in depth because one of the most frequent critiques for poverty measurements is that present methods depend too much on arbitrary decisions like setting a poverty line. Utilizing a Monte Carlo simulation, I find that the measures (Totally Fuzzy, Totally Fuzzy and Relative, and Integrated Fuzzy and Relative) acknowledge two points quite well: (i) poverty is a multidimensional concept, and (ii) the `poor' and `non-poor' are not two mutually exclusive sets and the distinction can be `fuzzy'. It also turns out that the sampling distribution of the fuzzy measures is well-behaved, and they are robust to arbitrary choice in the estimation as well as reliable with relatively small sample size. Besides, I show that they are robust to measurement errors. Finally, I investigate the identification performance of each measure and show that the measures have a strong consistency
    corecore